期刊文献+

干扰空间投影在本征音说话人自适应中的应用

APPLICATION OF NUISANCE SPACE PROJECTION IN EIGENVOICE SPEAKER ADAPTATION
下载PDF
导出
摘要 本征音自适应是一种快速高效的自适应算法,它被广泛应用到说话人识别中,但由于同一个说话人的本征音自适应的说话人因子之间的信道特征和噪声存在差异,导致了算法的识别精度降低。针对这一问题,提出基于干扰空间投影的本征音说话人识别(EV-NSP)算法。将训练语音通过主成分分析(PCA)方法计算得到干扰投影矩阵;将投影矩阵应用到生成本征音矢量算法中;利用最大似然估计算法自适应地得到说话人因子的估计值。实验结果表明,EV-NSP算法相对于传统的本征音自适应算法识别性能有了较大的提高。 However, Eigenvoice adaption is a fast and efficient adaptive algorithm which is widely used the speaker factor of eigenvoice adaption from the same speaker has different channel c in speaker recognition. haracteristies and noise feature, thus reduced the recognition accuracy. To solve this problem, we propose an eigenvoice speaker recognition algorithm based on nuisance space projection (EV-NSP). We calculated the nuisance space projection matrix by principal component analyzing training data, and then run the eigenvoice vector algorithm by using projection matrix. The maximum likelihood estimation algorithm was used to adaptively estimate the speaker factor. Experimental results indicate that EV-NSP algorithm have better performance compared with traditional adaptive eigenvoice algorithm.
出处 《计算机应用与软件》 2017年第11期188-191,263,共5页 Computer Applications and Software
基金 国家自然科学基金青年基金项目(61601519 61402433) 山东省自然科学基金项目(ZR2014FM017) 青岛市科技创新计划项目(15-9-80-jch)
关键词 本征音自适应 干扰空间投影 主成分分析 信道失配 Eigenvoice adaption Nuance space projection PCA Channel mismatch
  • 相关文献

参考文献7

二级参考文献58

  • 1LIN Wei YANG Lili XU Boling.A new frequency scale of Chinese whispered speech in the application of speaker identification[J].Progress in Natural Science:Materials International,2006,16(10):1072-1078. 被引量:5
  • 2张昊天.[D].北京:清华大学电子工程系,2000.
  • 3Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 2000, 10(1): 19-41
  • 4Campbell W M, Sturim D E, Reynolds D A. Support vector machines using GMM supervectors for speaker verification. IEEE Signal Processing Letters, 2006, 13(5): 308-311
  • 5Campbell W M, Sturim D E, Reynolds D A, Solomonoff A. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Toulouse, France: IEEE, 2006. 97-100
  • 6Deng J, Zheng T F, Wu W H. Session variability subspace projection based model compensation for speaker verificatio. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Washington D. C., USA: IEEE, 2007. 47-50
  • 7Reynolds D A. Channel robust speaker verification via feature mapping. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Hong Kong, China: IEEE, 2003. 53-56
  • 8Matejka P, Burget L, Schwarz P, Glembek O, Karafiat M, Grezl F. STBU system for the NIST 2006 speaker recognition evaluation. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Washington D. C., USA: IEEE, 2007. 221-224
  • 9NIST. The NIST year 2006 speaker recognition evaluation plan [Online], available: http://www.nist.gov/speech/tests/ spk/2006/sre-06-evalplan-v9.pdf, February, 2007
  • 10Lamel L, Rabiner L, Rosenberq A, Wilpon J. An improved endpoint detector for isolated word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981, 29(4): 777-785

共引文献53

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部