期刊文献+

自适应并行模型组合的鲁棒语音身份识别算法 被引量:6

Robust Speaker Identification Algorithm Based on Adaptive Parallel Model Combination
下载PDF
导出
摘要 由于环境噪声的影响,实际应用中说话人识别系统性能会出现急剧下降。提出了一种基于高斯混合模型-通用背景模型和自适应并行模型组合的鲁棒性语音身份识别方法。自适应并行模型组合是一种噪声鲁棒性的特征补偿算法,能够有效减少训练环境与测试环境之间的不匹配现象,从而提高系统识别准确率和抗噪性能。首先,算法从测试语音中估计出噪声特征,然后用一个单高斯模型对噪声特征进行拟合得到噪声均值和协方差。最后,根据得出的噪声均值和协方差,调整训练好的高斯混合模型均值向量和协方差矩阵,使其尽可能地匹配测试环境。实验结果表明,该方法可以准确地重构干净语音的高斯混合模型参数,并且能够显著提高说话人识别的准确率,特别是在低信噪比情况下。 The performance of speaker recognition systems degrade rapidly in real applications due to environmental noise.This paper proposes a robust speaker recognition method based on Gaussian Mixture Model-Universal Background Model(GMM-UBM)and adaptive parallel model combination(APMC).APMC feature compensation algorithm,which is robust to noise,can effectively reduce the mismatch between training environment and testing environment so as to improve the recognition accuracy and anti-noise performance.Firstly,automatically estimating noise feature from test speech.Secondly,using a single Gaussian model to fit the feature,then getting the mean and covariance of noise feature.Finally,according to the mean and covariance of noise from the second step,the mean vectors and covariance matrices of the training GMM are transformed to the testing condition by this method as far as possible.The experimental results indicate that the proposed method can reconstruct the clean speech GMM parameters more accurately.Also,this method can significantly improve the speaker identification accuracy,especially in low SNR.
作者 李聪 葛洪伟 LI Cong;GE Hong-wei(Ministry of Education Key Laboratory of Advanced Process Control for Light Industry,Jiangnan University,Wuxi, Jiangsu 214122,China;School of Internet of Things,Jiangnan University,Wuxi,Jiangsu 214122,China)
出处 《信号处理》 CSCD 北大核心 2018年第7期867-875,共9页 Journal of Signal Processing
基金 江苏省普通高校研究生科研创新计划项目(KYLX16_0781 KYLX16_0782) 江苏高校优势学科建设工程资助项目(PAPD)
关键词 说话人识别 特征补偿 并行模型组合 高斯混合模型-通用背景模型 噪声 speaker recognition feature compensation parallel model combination(PMC) Gaussian mixture model-universal background model(GMM-UBM) noise
  • 相关文献

参考文献3

二级参考文献39

  • 1孙暐,吴镇扬.基于独立感知理论的鲁棒语音识别算法[J].东南大学学报(自然科学版),2005,35(4):506-509. 被引量:2
  • 2赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量:11
  • 3Serajul Haque, Roberto Togneri, Anthony Zaknich. Perceptual features for automatic speech recognition in noisy environments[J]. Speech Communication,2008,51(1) :15-25.
  • 4H ynek Hemansky, Nelson Morgan. RASTA Processing of Speech[J]. IEEE Trans on Speech and Audio Processing, 1994,2 (4) :578-589.
  • 5Doc-Sum Kim, Soo-Young Lec, Rhee M Kil. Auditory Processing of Speech Signal for Robust Speech Recogniton in Real World Noisy Environmens[J]. IEEE Transactio on Speech and Audio Processing, 1999,1 (7): 55-68.
  • 6Nasersharif B, Akbari A. SNR-dependent compression of enhanced Mel sub-band energies for compensation of noise effects on MFCC features [J ]. Pattern Recognition Letters, 2007,28( 11 ) : 1320 - 1326.
  • 7Cui X, Alwan A. Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR [ J ]. IEEE Transactions on Speech and Audio Processing, 2005, 13(6) : 1161 -1172.
  • 8Barreaud V, Illina I, Fohr D. On-line stochastic matching compensation for non-stationary noise [ J ]. Computer Speech and Language, 2008, 22 ( 3 ) : 207 - 229.
  • 9Moreno P J. Speech recognition in noisy environments [ D]. Pittsburgh, Pennsylvania, USA: Carnegie Mellon University, 1996: 79 - 126.
  • 10Kim W, Kwon O, Ko H. PCMM-based feature compensation schemes using model interpolation and mixture sharing [ C ]//IEEE International Conference on Acoustics, Speech, and Signal Processing. Montreal, Canada, 2004:989-992.

共引文献9

同被引文献24

引证文献6

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部