期刊文献+

解卷积混合语音频域盲分离的次序问题新方法 被引量:1

Approach to Permutation Alignment of Blind Source Separation in Frequency-Domain for Convolutive Mixing Speech
下载PDF
导出
摘要 多通道语音信号的混合往往是卷积混合,瞬时盲分离方法不能获得好的分离效果,而频域方法由于频率次序的问题使性能下降。本文采用时频掩模的方法得到各频点上具有确定次序的、但带有失真的分离信号,将其作为参考,与频域上解得的次序不定信号进行相关,从而获得精确的语音分离信号。实验表明:本文提出的方法能有效地解决频域盲分离的次序不确定性问题,得到精度更高的分离卷积混合的语音信号。 In real world, multi-channel speech sources are usually in convolutive mixing environment. Instantaneous blind source separation(BSS) cannot separate the speech sources well. In frequecy-domain the synthetical performance declines due to the permutation problem at different frequency bins. This paper uses time-frequency mask idea to find some rough separated speech sources as references, in which there are some distortions without permutation ambiguity. Then the correlation among the references and the recovered sources by independent component analysis (ICA) is used to solve the permutation problem. Experimental results show that the proposed algorithm can solve permutation problem of recovered sources in frequency-domain and separate the mixed speech sources.
出处 《数据采集与处理》 CSCD 北大核心 2008年第6期734-739,共6页 Journal of Data Acquisition and Processing
基金 国家自然科学基金(NSF60571052)资助项目
关键词 盲信号分离 独立元分析 波达方向 时频二元掩模 blind source separation(BSS) independent component analysis(ICA) direction of arrival(DOA) time-frequency binary mask
  • 相关文献

参考文献15

  • 1Yang H H, Amari S. Adaptive on-line learning for blind separation-maximum entropy and minimum mutual information[J].Neural Computation, 1997, 9(5): 1457-1482.
  • 2Ozgur Y, Scott R. Blind separation of speech mixtures via time-frequency masking [J].IEEE Trans Signal Processing, 2004, 52(7) : 1830-1847.
  • 3姜卫东,陆佶人,张宏滔,高明生.基于相邻频点幅度相关的语音信号盲源分离[J].电路与系统学报,2005,10(3):1-4. 被引量:13
  • 4Smaragdis P. Blind separation of convolved mixtures in the frequency domain[J].Neural Compution, 1998, 22(6): 21-34.
  • 5Ikram M Z,Morgan D R. A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation[C]//Proc IEEE ICASSP. Orlando, Florida, USA: IEEE, 2002:881- 888.
  • 6吴文妍,张立明.带约束的卷积语音信号频域盲分离方法[J].系统工程与电子技术,2006,28(11):1617-1620. 被引量:2
  • 7Sawada H, Mukai R, Araki S,et al. A robust and precise method for solving the permutation problem of frequency-domain blind source separation [J]. IEEE Trans Speech and Audio Processing, 2004, 12 (5) : 530-538.
  • 8Bell A, Sejnowski T. An information-maximization approach to blind separation and blind deconvolution [J]. Neural Computation, 1995, 7(6) : 1129-1159.
  • 9Hyvtirinen A, Karhunen J O. Independent component analysis[M]. New York: J Wiley, 2001.
  • 10Comon P. Idependent component analysis-a new concept[J]. Signal Processing, 1994, 34: 287-314.

二级参考文献18

  • 1姜卫东,陆佶人,张宏滔.基于单个频点的水声信号盲源分离[J].电子与信息学报,2005,27(4):532-535. 被引量:5
  • 2姜卫东,陆佶人,张宏滔,高明生.基于相邻频点幅度相关的语音信号盲源分离[J].电路与系统学报,2005,10(3):1-4. 被引量:13
  • 3Sawada H, Mukai R, et al. A polar-coordinate based activation function for frequency domain blind source separation[A]. Proc. of ICA2001[C]. California, USA, 2001. 663-668.
  • 4Anemuller J, Kollmeier B. Amplitude modulation decorrelation for convolutive blind source separation[A]. Proc. of ICA2000[C]. Helsinki, Finland, 2000. 215-220.
  • 5Smaragdis P. Blind separation of convolved mixtures in the frequency domain[J]. Neurocomputing, 1998, 22: 21-34.
  • 6Cardoso J F. Souloumiac A. Blind beamforming for non-Gaussian signals[J]. IEE Proc.-F, , 1993, 140(6): 362-370.
  • 7Hyvarinen A, Oja E. A fast fixed-point algorithm for independent component analysis[J]. Neural Computation, 1997, 9: 1493-1492.
  • 8Ikram M Z, Morgan D R. Exploring permutation inconsistency in blind separation of speech signals in a reverberant environment[A]. Proc. ICASSP2000[C]. 2000. 1041-1044.
  • 9Kurita S, Saruwatari H, et al. Evaluation of blind separation method using directivity pattern under reverberant conditions[A]. Proc. ICASSP2000[C]. Istanbul, Turkey, 2000. 3140-3143.
  • 10Dapena A, Serviere C, Castedo L. Inversion of the sliding Fourier transform using only two frequency bins and its application to source separation[J]. Signal processing, 2003, 83(2): 453-457.

共引文献13

同被引文献17

引证文献1

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部