期刊文献+

基于i-vector局部加权线性判别分析的说话人识别 被引量:6

I-vector based speaker recognition using local weighted linear discriminant analysis
下载PDF
导出
摘要 基于i-vector的说话人识别系统通常采用LDA来消除训练和测试语音之间信道失配,不能保证样本在待识别语音近邻区域内具有最佳的分离度,这就使得目标说话人和其近邻间的得分差异较小,进而导致识别准确性下降。针对该问题,提出基于i-vector局部加权线性判别分析的说话人识别方法(LWLDA)。在计算类内和类间散度时,增加待识别语音近邻样本权重。在此基础上,通过提高待识别语音近邻域局部类间的分辨能力,尽可能减少因信道差异而产生的识别错误。在不同语音库上的实验结果表明:LWLDA在复杂信道环境下能够保持良好的鲁棒性;在交叉信道条件下的识别准确率比LDA平均提高3.6%。 Linear discriminant analysis( LDA) is often employed to eliminate the channel mismatch between training and testing speeches in identity vector( i-vector) based speaker recognition systems,which can not provide optimum separation of the samples in the near region of the utterance to be identified. In particular,there is small score difference between the target speaker and corresponding near neighbors,which results in the degradation of recognition accuracy. Aiming at this problem,the i-vector based speaker recognition method with local weighted linear discriminant analysis( LWLDA) is proposed. In the calculation of inter-class scatter and intra-class scatter,we increase the weights of the samples near the utterance to be identified; based on which,through enhancing the local inter-class discrimination ability in the near region of the utterance to be identified,the recognition errors caused by channel difference are reduced as much as possible. The experiments on different speech databases were conducted. The results demonstrate that,the LWLDA achieves good robustness under complex channel noise environment,and the recognition accuracy ratio is increased by 3. 6% under cross channel conditions compared with that of LDA method.
出处 《仪器仪表学报》 EI CAS CSCD 北大核心 2015年第12期2842-2848,共7页 Chinese Journal of Scientific Instrument
基金 国家自然科学基金(61473154)项目资助
关键词 语音处理 说话人识别 身份认证向量 局部加权线性判别分析 speech processing speaker recognition identity vector(i-vector) local weighted linear discriminant analysis(LWLDA)
  • 相关文献

参考文献19

  • 1KINNUNEN T, LI H ZH. An overview of text-independent speaker recognition: From features to super-vectors [ J ]. Speech Communication, 2010, 52( 1 ) : 12- 40.
  • 2LIU Shuang,MENG Jiayuan,ZHAO Xin,YANG Jiajia,HE Feng,QI Hongzhi,ZHOU Peng,HU Yong,MING Dong.Cross-task emotion recognition using EEG measures: first step towards practical application[J].Instrumentation,2014,1(3):17-24. 被引量:2
  • 3许良凤,李艳秋,胡敏,刘盛中,王晓华.基于I-DCV的人脸识别方法[J].电子测量与仪器学报,2015,29(1):106-110. 被引量:8
  • 4毕雪芹,苏艳娟,王琪.嵌入式指纹识别系统的设计及试验研究[J].国外电子测量技术,2015,34(2):50-53. 被引量:22
  • 5邱政权,范小春,王俊年.基于维纳滤波和混合模型的说话人识别[J].仪器仪表学报,2009,30(7):1436-1440. 被引量:5
  • 6GONZALEZ-RODRIGUEZ J. Evaluating automatic speaker recognition systems: An overview of the NIST speaker recognition evaluations ( 1996-2014 ) [ J ]. Lo- quens, 2014, 1 ( 1 ) : 1-15.
  • 7KHOURY E, VESNICER B, FRANCO-PEDROSO J, et al. The 2013 speaker recognition evaluation in mobile en- vironment[ C ]. Proceedings of IAPR International Con- ference on Biometrics (ICB), 2013: 1-8.
  • 8KENNY P, BOULIANNE G, OUELLET P, et al. Joint factor analysis versus eigenchannels in speaker recogni- tion[J]. IEEE Transactions on Audio, Speech and Lan- guage Processing, 2007, 15(4) : 1435-1447.
  • 9DEHAK N, KENNY P, DEHAK R, et al. Front-end factor analysis for speaker verification [ J ]. IEEE Trans-actions on Audio, Speech, and Language Processing, 2011, 19(4) : 788-798.
  • 10MCLAREN M, LEEUWEN D V. Source normalised and weighted LDA for robust speaker recognition u- sing i-veetors[ C ]. IEEE International Conferenee on Acoustics Speech and Signal Processing (ICASSP) , 2011:5456 -5459.

二级参考文献117

共引文献88

同被引文献23

引证文献6

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部