期刊文献+

利用语音的频谱空间特征进行汉语抗噪语音识别的方法

Spatial characteristics of speech spectrum for robust speech recognition
下载PDF
导出
摘要 抗噪连续语音识别是当前汉语连续语音识别的重要研究领域。采用通过度量连续语音帧之间频谱的稳定性,将连续语音切分成份,再将切分结果(无论时间长短)变换为与时间无关的大小固定的频谱空间特征,通过与模板库进行比较实现语音识别。新的频谱空间特征,与语音时长无关,同时表现出较好的抗噪声能力。在特定人连续语音识别测试系统中,取得了不错的识别效果。 The anti-noise continuous speech recognition is an important research topic of current Chinese continuous speech recognition. In this paper, by measuring the stability of the frequency spectrum between the continuous speech frames, speech signal can be segmented, and then these segmentations(regardless of the length of time) are transformed into time-independent and size-fixed spatial characteristics of speech spectrum. By comparing to speech template, the speech recognition results are obtained. The new spatial characteristics of speech spectrum are independent of speech length, and show better immunity to noise. In a specific continuous speech recognition testing system, favorable recognition result is obtained.
出处 《声学技术》 CSCD 北大核心 2015年第1期51-53,共3页 Technical Acoustics
关键词 语音特征 连续语音识别 抗噪语音识别 speech characteristics continuous speech recognition anti-noise speech recognition
  • 相关文献

参考文献3

二级参考文献16

  • 1冯清枝.基于直方图修正的图像增强技术[J].广东公安科技,2004(2):49-51. 被引量:9
  • 2林帆,徐明星.一种改进的基于时域参数的语音切分算法[J].计算机科学,2006,33(4):164-167. 被引量:3
  • 3Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition. In: Proc. IEEE, 1989; 77(2): 257--285.
  • 4Povey D. A tutorial-style introduction to subspace gaussian mixture models for speech recognition. Tech. Rep., Tech. Rep., MSR-TR-2009-111, Microsoft Research, 2009.
  • 5Povey D et al. Subspace Gaussian Mixture Models for Speech Recognition. In: Proc. of ICASSP2010, 2010: 4330--4333.
  • 6Xavier L Aubert. An overview of decoding techniques tor large vocabulary continuous speech recognition. Computer Speech and Language, 2002; 16(1): 89--114.
  • 7Demuynck K, Duchateuu J, van Compernolle D, Wambacq P. An efficient search space representation for large vocabulary continuous speech recognition. Speech Commun., 2000: 30(1): 37--53.
  • 8Ney H, Ortmanns S. Progress in dynamic programming search for LVCSR. In: Proc. IEEE, 2000; 88:1224--1240.
  • 9YANG Zhanlei, LIU Wenju. A novel path extension frame- work using steady segment detection for Mandarin speech recognition. In: Proc. of Interspeech 2010, Makuhari, Japan, 2010:226--229.
  • 10Povey D, Chu S M, Varadarajan B. Universal background model based speech recognition. In: Proc. ICASSP, 2008: 4561--4564.

共引文献38

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部