摘要
在W-分离正交性假设的语音盲分离方法中,由于没有考虑多个源信号同时存在的情况,导致分离信号中不可避免地存在音乐噪声。针对这种部分W-分离正交情况,提出了基于信道估计的语音盲分离方法。该方法先检测只有一个源信号存在的时频点并进行归一化处理,使得处理后的结果与频率无关,克服了W-分离正交性假设的不足以及频率置换问题,通过K-means聚类估计出信道,再结合信号子空间方法重构源信号。仿真结果表明,提出的方法可以有效减少分离语音中的音乐噪声,与典型的时频二元掩蔽方法相比,其平均信号失真比提高3.02dB,同时平均信干比提高4.61dB。
In blind speech separation methods based on the assumption of W-disjoint orthogonality (W-DO), musical noise is inevitable in separated signals because the assumption does not include the case of existing multiple source signals in the time-frequency domain. A blind speech separation method based on channel estimation is proposed for partial approximate W-disjoint orthogonality. The time-frequency cells with only one source are detected and normalized to be independent of frequency, which overcomes not only the shortcoming of W-DO property but also the frequency permutation problem, and then the channel estimation is obtained by K-means clustering. Finally, signal subspace method is exploited to reconstruct sources. Simulation results demonstrate that the novel method can effectively reduce the musical noise in the separated speech signals, and it outperforms the typical time frequency binary masking method, the averaged signal to distortion ratio (SDR) is improved by 3.02 dB and the averaged signal to interference ratio (S/R) is improved by 4.61 dB.
出处
《电子科技大学学报》
EI
CAS
CSCD
北大核心
2010年第2期186-190,共5页
Journal of University of Electronic Science and Technology of China
基金
国家自然科学基金(60672157
60672158)
关键词
信道估计
部分W-分离正交
主分量分析:语音分离
channel estimation
partial W-disjoint orthogonality
principal component analysis
speech separation