期刊文献+

基于神经网络的语音识别研究 被引量:10

Research on Speech Recognition Based on Neural Networks
下载PDF
导出
摘要 由于具有良好的抽象分类特性,神经网络现已应用于语音识别系统的研究和开发,并成为解决识别相关问题的有效工具。为解决一般语音识别系统准确率较低的问题,本文分别给出了由循环神经网络(RNN)和多层感知器(MLP)组成识别模块的两种语音识别系统,并对二者识别的准确性进行了比较。介绍了特征提取模块的主要工作步骤并讨论了组成识别模块的上述两种神经网络结构。其中,特征提取模块利用线性预测编码(LPC)倒谱编码器,把输入语音翻译成LPC倒谱空间中的曲线;而识别模块完成对某个特征空间曲线之间的联系和单词的识别。实验结果表明,MLP方法准确率高于RNN方法,而RNN方法准确率可达85%。 Because of good characteristics of the abstract classification, neural networks have become an effective tool for resolving issues related to recognition, and have been applied to the research and development of speech recognition systems. A speech recognizer system comprises of two blocks, Feature Extractor and Recognizer. For increasing the recognition accuracy, this paper proposes two types of speech recognition system whose recognition block uses the recurrent neural network(RNN) and multi layer pereeptron(MLP) respectively. Furthermore, the main work steps of Feature Extractor(FE) block is introduced and the structure of two types of neural networks mentioned above is discussed. Using a standard LPC Cepstrum, the FE translates the input speech into a trajectory in the LPC Cepstrum feature space. The recognizer block discovers the relationships between the trajectories and recognizes the word. The results show that the MLP's recognition accuracies were better than the RNN's, while the RNN's recognition accuracies achieved 85%.
出处 《重庆师范大学学报(自然科学版)》 CAS 2010年第4期73-76,共4页 Journal of Chongqing Normal University:Natural Science
基金 四川省教育厅重点科研项目(No.08ZA018) 校级科研项目(No.06A002)
关键词 神经网络 语音识别 循环神经网络 多层感知器 线性预测 矢量量化 neural networks speech recognition recurrent neural network multi layer perceptron linear prediction vector quantization
  • 相关文献

参考文献15

  • 1Rabiner L R, Juang B H. Fundamentals of speech recognition [ M ]. Upper Saddle River, NJ : Prentice hall, 1993.
  • 2孟显勇,袁丁.多层BP神经网络用于破译椭圆曲线密码[J].四川师范大学学报(自然科学版),2005,28(3):371-375. 被引量:3
  • 3张彤,肖南峰.基于BP网络的指纹识别系统[J].重庆理工大学学报(自然科学),2010,24(1):47-50. 被引量:8
  • 4高富强,邹恒,秦昌硕,须民健,杨勇.BP和RBF神经网络在字母识别中的比较[J].重庆工学院学报(自然科学版),2009,23(9):77-80. 被引量:5
  • 5宋智,何嘉.面向复杂问题的BP神经网络并行算法[J].西南师范大学学报(自然科学版),2009,34(3):103-106. 被引量:2
  • 6朱鑫森,刘顺承.基于神经网络与改进D-S证据理论的目标识别[J].四川兵工学报,2009,30(7):67-69. 被引量:3
  • 7Gandhiraj R, Sathidevi P S. Auditory-based wavelet packet filterbank for speech recognition using neural network [ A ]//Proc Int Conf Adv Comput Commun, ADCOM [ C ] Institute of Electrical and Electronics Engineers Inc,2007: 666-671.
  • 8Mohammad I, Shah R S, Saad P D. Improving speaker independent speech recognition process using speech recognition engine[A]//Proc Int Conf Artif Intell,ICAI Proc Int Conf Mach Learn ; Models, Technol Appl , MLMTA [ C ]. Las Vegas ,NV ,United states : CSREA Press ,2008:870-875.
  • 9Lee Chin H, Rabiner, Lawrence R. Directions in automatic speech recognition[ J]. NTT Review, 1995,7(2) : 19-29.
  • 10Lalith Kumar T, Kishore Kumar T, Soundar Rajan K. Speech recognition using neural networks [ A ]// Int Conf Signal Process Syst [ C ]. IEEE Computer Society,2009: 248 -252.

二级参考文献61

共引文献22

同被引文献80

引证文献10

二级引证文献66

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部