期刊文献+

具有环境自学习机制的鲁棒说话人识别算法 被引量:2

Robust Speaker-Recognition Algorithm with Environmental Self-Learning Mechanism
下载PDF
导出
摘要 说话人识别系统实际应用时,一旦应用环境和训练环境不一致,系统的性能会急剧下降。由于环境噪声的多变性,系统训练时无法预测实际应用中的环境噪声。因此,引入环境自学习和自适应思想,通过改进的矢量泰勒级数(Vector Taylor Series,VTS)刻画环境噪声模型和说话人语音模型之间的统计关系,提出一种具有环境自学习能力的鲁棒说话人识别算法。系统应用中每当环境变化时利用语音输入前采集到的环境噪声信号来迭代更新环境噪声模型参数,进一步基于VTS确立的统计关系,将说话人语音模型自适应到实际应用环境来补偿环境失配的影响。说话人辨认实验结果表明,提出的方法在低信噪比条件下对于不同种类的噪声都能显著提升系统的识别性能。 In the actual application of the speaker recognition system,once application environment and the training environment are inconsistent,the performance of the system will drop significantly.Due to the variability of environmental noise,the environmental noise in practical applications cannot be predicted during system training.Therefore,the environment self-learning and adaptive ideas are introduced to describe the statistical relationship between the environmental noise model and the speaker’s speech model through the improved VTS(Vector Taylor Series),and a robust speaker-recognition algorithm with environmental self-learning ability is proposed.In system application,when environment changes,the environment noise before speech input is collected to iteratively update the model parameters of environment noise,and further adapt the speaker model to the application environment to compensate for the environmental mismatch based on the statistical relationship established by VTS.The speaker-recognition experiment results indicate that the proposed method can significantly improve the recognition performance of the system for different kinds of noise under low SNR conditions.
作者 张靖 俞一彪 ZHANG Jing;YU Yi-biao(School of Electronic Information,Soochow University,Suzhou Jiangsu 215000,China)
出处 《通信技术》 2020年第3期618-624,共7页 Communications Technology
关键词 说话人识别 自学习 自适应 矢量泰勒级数 环境噪声 speaker recognition self-learning self-adaptation VTS(Vector Taylor Series) environmental noise
  • 相关文献

参考文献3

二级参考文献36

  • 1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量:8
  • 2刘海滨,吴镇扬,赵力,曾毓敏.噪声环境下基于最大后验非线性变换的隐马尔可夫模型自适应算法[J].声学学报,2004,29(5):467-471. 被引量:4
  • 3赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量:11
  • 4Chen C T, Chen C. Efficient genetic algorithm of codebook design for text-independent speaker recognition. IEICE,2002, E85-A(11): 2529-2531.
  • 5Lee Y -T. Information-theoretic distortion measures for speech recognition. IEEE-ASSP, 1991; 39:330-335.
  • 6Okawa S, Kobayashi T, Shirai K. Automatic training of phoneme dictionary based on mutual information criterion.ICASSP, 1994:241-244.
  • 7Bahl L R, Brown P F. Maximum mutual information estimation of hidden Markov model parameters for speech recognition. ICASSP, 1986:49-52.
  • 8Shaughnessy D O. Speech communications-human and machine. IEEE Press, NJ., 2000:378-383.
  • 9Naik J. Speaker verification: A tutorial. IEEE Commun.Mag., 1990; 28(1): 42-48.
  • 10Campbell J P. Speaker recognition: A tutorial. IEEE Proc., 1997; 85(9): 1436-1462.

共引文献16

同被引文献11

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部