期刊文献+

一种基于深度神经网络的话者确认方法 被引量:4

A SPEAKER VERIFICATION METHOD BASED ON DEEP NEURAL NETWORK
下载PDF
导出
摘要 主要研究基于深度神经网络的话者确认方法。在训练阶段,以语音倒谱特征参数作为输入,说话人标签作为输出有监督的训练DNN;在话者注册阶段,从已训练的DNN最后一个隐藏层抽取与说话人相关的特征矢量,称为d-vector,作为话者模型;在测试阶段,从测试语音中抽取其d-vector与注册的话者模型相比较然后做出判决。实验结果表明,基于DNN的话者确认方法是可行的,并且在噪声环境及低的错误拒绝率的条件下,基于DNN的话者确认系统性能比i-vector基线系统性能更优。最后,将两个系统进行融合,融合后的系统相对于i-vector基线系统在干净语音和噪声语音条件下等误识率(EER)分别下降了13%和27%。 In this paper we mainly investigate the method of using deep neural network ( DNN) for speaker verification. At the stage of training, the DNN is trained under supervision using the feature parameter of speech cepstrum as input and the label of speaker as output. At the stage of speaker registration, an eigenvector correlated to the speaker, namely d-vector, is extracted from the last hidden layer of the trained DNN and is used as the model of speaker. At test stage, from testing speech a d-vector is extracted to compare it with the model of the registered speaker and then to make the verification decision. Experimental results show that the DNN-based speaker verification method is feasible. Moreover, under the condition of noisy environment and low error-rejection rate, the DNN-based speaker verification system outperforms the i-vector base line system in performance. Finally, we integrate these two systems, relative to the i-vector base line system,the integrated system reduces the equal error rate (EER) by 1 3 % and 2 7 % for clean speech and noisy speck conditions respectively.
作者 吴明辉 胡群威 李辉 Wu Minghui;Hu Qunwei;Li Hui(Department of Electronic Science and Technology, University of Science and Technology of China, Hefei 230027, Anhui, China)
出处 《计算机应用与软件》 CSCD 2016年第6期159-162,共4页 Computer Applications and Software
关键词 话者确认 深度神经网络 深度学习 Speaker verification Deep neural network ( DNN) Deep learning
  • 相关文献

参考文献17

  • 1Reynolds D A,Quatieri T F,Dunn R B.Speaker verification using adaptedGaussian mixture models [J].Digital signal processing, 2000,10(1):19-41.
  • 2Kenny P,Boulianne G,0uellet P,et al. Joint factor analysis versus eigenchannelsin speaker recognition [J].Audio,Speech,and Language Processing,IEEE Transactions on,2007,15(4):1435-1447.
  • 3Kenny P,Boulianne G,0uellet P,et al. Speaker and session variabilityin GMM-based speaker verification[J].Audio,Speech,and Language Processing,IEEE Transactions on,2007,15(4):1448-1460.
  • 4Kenny P,0uellet P,Dehak N,et al. A study of interspeaker variabilityin speaker verification [J].Audio,Speech,and Language Processing,IEEE Transactions on,2008,16(5):980-988.
  • 5Dehak N,Kenny P,Dehak R,et al.Front-end factor analysis for speakerverification [J].Audio,Speech,and Language Processing, IEEETransactions on,2011,19(4):788-798.
  • 6Hinton G,Deng L,Yu D,et al. Deep neural networks for acoustic modelingin speech recognition:The shared views of four research groups[J].Signal Processing Magazine,IEEE,2012,29(6):82-97.
  • 7Martin A F,Greenberg C S.The NIST 2010 speaker recognition evaluation[C]Interspeech 2010,11th Annual Conference of the InternationalSpeech Communication Association, Makuhari,Chiba,Japan,2010:2726-2729.
  • 8Kenny P.Bayesian Speaker Verification with Heavy-Tailed Priors[C]Proc.Odyssey Speaker and Language Recognition Workshop, Brno,CzechRepublic,2010:14.
  • 9Larcher A,Lee K A,Ma B,et al.Phonetically-constrained PLDA modelingfor text-dependent speaker verification with multiple short utterances[C]Acoustics,Speech and Signal Processing (ICASSP),20131EEE International Conference on.IEEE,2013:7673-7677.
  • 10Yegnanarayana B,Kishore S P.AANN:an alternative to GMM for patternrecognition[J].Neural Networks,2002,15(3):459-469.

同被引文献31

引证文献4

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部