期刊文献+

说话人识别中的HOCOR和改进的MCE

Speaker Recognition Based on HOCOR and MMCE
下载PDF
导出
摘要 从线性预测(LP)残差信号中提出了一种新的特征提取方法,这种特征跟单个的说话人的声道密切相关。不是应用傅立叶技术,而是把HAAR小波变换应用于残差信号,而这种计算更简单。通过把HAAR变换运用于LP残差而获得了一个新的特征HOCOR。为了进一步提高识别性能和训练速度,在识别阶段采用了改进的最大分类错误(MMCE)。实验结果显示采用所提出的新的特征和MMCE取得了较好的识别效果。 A new method to extract discriminative features from the linear prediction (LP) residual signal is proposed, which are closely related to the glottal excitation of individual speaker. Rather than taking Fourier transform, HAAR transform is applied to the residual signal, which is computationally simpler. A novel feature HOCOR is acquired by applying LP residue with HAAR transform. In order to improve the performance of recognition and the training velocity, modified maximum likelihood Error (MMCE) in the recognizing stage is appled. Experiment is showed that a better recognition result is acquired with the proposed novel feature and MMCE.
出处 《科学技术与工程》 2008年第12期3159-3161,3174,共4页 Science Technology and Engineering
关键词 LP残差 HAAR HOCOR MMCE LP residue HAAR HOCOR MMCE
  • 相关文献

参考文献6

  • 1[1]Chen Z H,Liao Y F,Juang Y T.Eigen-prosody analysis for robust speaker recognition under mismatch handset environment.Electronics Letters,2004;40(19):1233-1235
  • 2[2]Han Shichen,Wang Hsiaochuan.Improvement of speaker recognition by combining residual and prosodic features with acoustic features.Acoustics,Speech,and Signal Processing,2004.(ICASSP′04).IEEE International Conference on,Volume:1,2004;1:I-93-6
  • 3[3]Yegnanarayana B,Reddy S K,Kishore S P.Source and system features for speaker recognition using AANN models.ICASSP,Salt Lake City,2001:409-413
  • 4[4]Plumpe M D,Quatieri T F,Reynolds D A.Modeling of the glottal flow derivative waveform with application to speaker identification.IEEE Tans Speech Audio Processing,1999; 7(5):569-585
  • 5赵力,邹采荣,吴镇扬.HMM在说话人识别中的应用[J].电路与系统学报,2001,6(3):51-57. 被引量:10
  • 6李晓宇,李虎生,刘加,刘润生.利用MCE算法提高说话人识别性能[J].电路与系统学报,2000,5(3):46-49. 被引量:10

二级参考文献15

  • 1林平澜,王仁华.动态HMM及其在说话人识别中的应用[J].信号处理,1993,9(4):250-256. 被引量:1
  • 2赵力,声学学报,2000年,25卷,6期,618页
  • 3松井知子,电子情报通信学会论文志,1996年,79卷,5期,647页
  • 4松井知子,电子情报通信学会论文志,1992年,77卷,4期,601页
  • 5松井知子,电子情报通信学会论文志,1992年,75卷,4期,703页
  • 6古井贞熙,电子情报通信学会论文志,1974年,57卷,12期,880页
  • 7Merialdo B. Phonetic recognition using hidden Markov models and maximum mutual information trainning[A]. Proc. ICASSP-88[C]. 111-114
  • 8Normandin Y. Optimal splitting of HMM Gaussian mixture components with MMIE trainning[A]. Proc. ICASSP-95[C].449-452.
  • 9Davis S B, Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences[J].IEEE Trans, on Speech and Audio Signal Processing, 1980, 28 (4):357-366.
  • 10Joseph P. Campbell, JP.. Speaker Recognition: A Tutorial[J]. Proc. of the IEEE, 1997,.85(.9): 1437-1462.

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部