期刊文献+

融合LPC和MFCC特征的前馈神经网络短语音识别

下载PDF
导出
摘要 文章针对短语音条件下声纹识别的鲁棒性问题,结合前馈神经网络对短语音特征表示进行了研究,采用LPC和MFCC特征融合对短语音进行识别。实验显示,通过前馈神经网络的训练,分类器在对通过录音设备获取的短语音说话人识别能达到较高的准确率,同时采用融合的LPC和MFCC在前馈神经网络中可以对短语音说话人识别达到87%的准确率。
出处 《长江信息通信》 2023年第11期171-174,共4页 Changjiang Information & Communications
  • 相关文献

参考文献5

二级参考文献53

  • 1陈华伟,靳蕃.基于感知模型的美尔谱失真测度[J].西南交通大学学报,2006,41(6):723-728. 被引量:4
  • 2张军,张德运,傅鹏.一种改进的心理声学语音质量客观评价算法[J].微电子学与计算机,2007,24(3):203-206. 被引量:6
  • 3Telecommunication Standardization Sector of ITU. ITU- T Recommendation P. 830 Subjective performance assessment of telephone-band and wideband digital codecs[ S]. Geneva: International Telecommunication Union, 1996.
  • 4Telecommunication Standardization Sector of ITU. ITU- T Recommendation P. 862 Perceptual evaluation of speech quality (PESQ) : An objective method for end- to-end speech quality assessment of narrow-band telephone networks and speech codecs[ S]. Geneva: International Telecommunication Union, 2001.
  • 5KUBICHEK R. Mel-cepstral distance measure for objective speech quality assessment[ C]//Proceedings of IEEE Pacific Rim Conference on Communications, Computer and Signal Processing. Piscataway: IEEE Press, 1993: 125-128.
  • 6DAVIS S B, MERMELSTEIN P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences[ J ]. IEEE Trans. on Acoustics, Speech and Signal Processing, 1980, 28(4) : 357-366.
  • 7JOHANNESMA P I M. The pre-response stimulus ensemble of neurons in the cochlear nucleus [ C ] //// Proceedings of the Symposium on Hearing Theory. Eindhoven: IPO, 1972: 58-69.
  • 8NASERSHARIF B, AKBARI A. SNR-dependent compression of enhanced Mel subband energies for compensation of noise effects on MFCC features [J]. Pattern Recognition Letters, 2011, 28 (11) : 1320- 1326.
  • 9POVEY D, KINGSBURY B, MANGU L, et al. fMPE: Discriminatively trained features for speech recognition [C]///Proceedings of the International Con- ference on Audio, Speech and Signal Processing. Pis- cataway, NJ, USA: IEEE, 2005: 961-964.
  • 10ZHANG B, MATSOUKAS S, SCHWARTZ R. Re- cent progress on the discriminative region-dependent transform for speech feature extraction [C] // Proceed- ings of the Annual Conference of International Speech Communication Association. Baixs, France: ISCA, 2006: 1495-1498.

共引文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部