期刊文献+

基于LDOF准则的自适应高斯后端语种识别方法 被引量:3

Adaptive Gaussian back-end based on LDOF criterion for language recognition
下载PDF
导出
摘要 针对由语种类内多样性引起的测试样本和训练模型不匹配的问题,提出一种基于局部距离离群因子准则(LDOF,local distance-based outlier factor)的自适应高斯后端语种识别方法。定义LDOF准则,实现有效的参数寻优过程并动态地在多类语种训练集上挑选出与测试样本特性相近的训练样本,调整原高斯后端,进而得到改进的语种识别方法。在NIST LRE 2009的6个易混淆语种任务集上的实验结果表明,所提方法的等错误概率(EER,equal error rate)和平均检测代价有显著提升。 In order to alleviate the mismatch in model between training and testing samples caused by inter-language variations, adaptive Gaussian back-end based on LDOF criterion was proposed for language recognition. The local distance-based outlier factor(LDOF) criterion was defined to find the appropriate model parameters and dynamically select the training data subset similar to the testing samples from multiple class training sets. Then original back-end was adjusted to obtain a more matched recognition model. Experimental results on NIST LRE 2009 easily-confused language data set show that proposed method achieves an obvious performance improvement on both the equal error rate(ERR) and average decision cost function.
作者 叶中付 戚婷 李赛峰 宋彦 YE Zhong-fu QI Ting LI Sai-feng SONG Yan(School of Information Science and Technology, University of Science and Technology of China, Hefei 230027, China National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei 230027, China State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi 214125, China)
出处 《通信学报》 EI CSCD 北大核心 2017年第4期17-24,共8页 Journal on Communications
基金 数学工程与先进计算国家重点实验室开放基金资助项目(No.2015A15)~~
关键词 语种识别 类内多样性 自适应高斯后端 LDOF language recognition inter-language variations adaptive Gaussian back-end LDOF
  • 相关文献

参考文献2

二级参考文献33

  • 1Zissman M A. Comparison of four approaches to automatic language identification of telephone speech. IEEE Transac- tions Speech and Audio Process, 1996, 4(3): 31-44.
  • 2Campbell W M, Sturim D E, Reynolds D A. Support vector machine using GMM supervectors for speaker verification. IEEE Signal Processing Letters, 2006, 13(5): 308-311.
  • 3Kenny P. Factor Analysis of Speaker and Session Variability Theory and Algorithms, Technical Report CRIM-06/08-13 Montreal, CRIM, 2005.
  • 4Kenny P, Boulianne G, Oullet P, Dumouchel P. Joint factor analysis versus eigenchannels in speaker recognition. IEEE Transactions on Audio, Speech, and Language Processing, 2007, 15(4): 1435-1447.
  • 5Martinez D, Plchot O, Burget L, Glembek O, Matejka P. Language Recognition in iVectors Space. In: INTER- SPEECH. Florence, Italy: ISCA, 2011. 861-864.
  • 6Dehak N, Torres P A, Reynolds D, Dehak R. Language recognition via iVectors and dimensionality reduction. In: INTERSPEECH. Florence, Italy: ISCA, 2011. 857-860.
  • 7Tipping M E, Bishop C M. Probabilistic principal compo- nent analysis. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 1999, 61(3): 611-622.
  • 8Turk M, Pentland A P. Face recognition using eigenfaces. In: Proceedings of the IEEE Conference on Computer Vi- sion and Pattern Recognition. Maui, Hawaii: IEEE, 1991. 586-591.
  • 9曾宪华.流形学习的谱方法相关问题研究[博士学位论文],北京交通大学,中国,2009.
  • 10Yang J C, Liang C Y, Yang L, Suo H B, Wang J J, Yan Y H. Factor analysis of Laplacian approach for speaker recogni- tion. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Ky- oto, Japan: IEEE, 2012. 4221-4224.

共引文献5

同被引文献14

引证文献3

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部