

On Chip Realization of HMM Speaker-independent Isolated Word Speech Recognizer
摘要 在SEED-DEC5502DSP嵌入式系统开发平台上实现了一个面向非特定人的孤立词语音识别系统,与传统的基于特定人的语音识别系统相比,该系统无需用户训练,易于使用。系统采用改进的基于语音对数域能量变化率的实时端点检测算法,仅对检测的有声段语音进行特征提取和解码,减少了要处理的语音帧数;对状态输出概率计算进行了分析和优化,进一步降低了计算负担。实验表明系统在100词条的情况下识别率达到98%,识别时间为1.03倍实时。 An embedded speaker-independent isolated word speech recognition system is designed and realized in the SEED-DEC5502 EVM platform. Compared with the speaker-dependent system, the speaker-independent recognition technique cannot requires training by the users and easy to use. With the help of a modified real time voice activity detection algorithm (VAD) based on the log-energy acceleration associated with voice onset, we only perform feature extraction and decoding to the active voice and ignore the frames of non-activity. To further decrease the computational loads, we analyze and optimize to the calculation of state output probabilities. Test on 100 words vocabulary shows that system provides a recognition accuracy rate of 98.1% using only 1.03 times of real time.
出处 《电信科学》 北大核心 2006年第10期60-63,共4页 Telecommunications Science
基金 河北省科技厅资助项目(No.052135147) 河北省科技厅指导性项目(No.042135105)
关键词 语音识别 嵌入式系统 端点检测 状态发射概率 speech recognition, embedded system, speech endpoint detect, state emission probability
  • 相关文献


  • 1Gong Y F,Kao Y H.Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP.In:Proc ICASSP'00,2000
  • 2Kao Y H,Rajasekaran P K.A low cost dynamic vocabulary speech recognizer on a gpp-dsp system.In:Proc ICASSP'00,2000
  • 3杜利民,谢凌云,刘斌.HMM非特定人连续语音识别的嵌入式实现[J].电子与信息学报,2005,27(1):60-63. 被引量:6
  • 4王志强.孤立词语音识别系统关键问题的研究.北京邮电大学硕士学位论文,2004
  • 5ETSI Standard,ES 202 212v 1.1.1.Distributed speech recognition,speech processing,transmission and quality aspect,2003
  • 6朱璇,李虎生,刘加,刘润生.高性能汉语数码串快速识别算法的研究[J].计算机研究与发展,2001,38(7):845-850. 被引量:5
  • 7Rogina F J.The bucket box intersection (BBI) algorithm for fast approximative evaluation of diagonal mixture Gaussians.In:Proc ICASSP.1996.


  • 1李虎生.汉语数码串语音识别及说话人自适应:硕士论文[M].北京:清华大学电子工程系,2000..
  • 2Du Limin, Feng Junlan, Song Yi, Sun Jinchen. A Chinese-English speech translation prototype system: CEST-CAS1.0.ICSPAT'99, Orlando, USA, 1999.
  • 3Du Limin, Feng Junlan, Song Yi, Wang Heng. Speech translation on internet CEST-CAS2.0. Proc. of ISIMP2001, Hong Kong,2001: 189- 192.
  • 4Rabiner L, Juang B H. Fundamentals of Speech Recognition.New Jersey, USA, Prentice Hall, 1993:350 - 352.
  • 5Ney H, Ortmanns S. Dynamic programming search for continuous speech recognition. IEEE Signal Processing Magazine, 1999, 16(5): 64 - 83.
  • 6李虎生,硕士论文,2000年
  • 7刘加,清华大学学报,1998年,38卷,9期,51页
  • 8韦小东,上海交通大学学报,1998年,32卷,10期,10页
  • 9Shyu R C,J Inform Sci Eng,1996年,12卷,3期,365页
  • 10Lee L S,Computer Speech Language,1991年,5卷,2期,181页









使用帮助 返回顶部