用于抗噪声语音识别的谐振强度特征被引量：1

Harmonic intensity feature for robust speech recognition

导出

摘要基于传统的Mel倒谱系数(MFCC)系列特征的语音识别系统在噪声环境中的识别性能会急剧下降。为了进行噪声环境中的自动语音识别,提出了一种反映语音信号谐振程度的特征:谐振强度,并用之代替传统MFCC特征中的能量维(零维倒谱C0,或者帧能量E)。在展览馆噪声、人群噪声和汽车噪声等情况下的语音识别实验结果表明:基于这种新特征的语音识别系统比基于传统特征的语音识别系统有更高的平均识别率和更好的抗噪声能力。 Automatic speech recognition (ASR) in noisy environments is a challenging problem. The performance of traditional Mel-frequency cepstral coefficient (MFCC) feature based ASR systems is dramatically degraded by additive noise. The harmonic intensity (H) feature was used to develop a robust ASR to replace the zero-order cepstral coefficient (C_0) or frame energy (E) feature in the MFCCs. A C_0-based ASR system, an E-based ASR system, and an H-based ASR system were tested with noise corrupted speech. The results show that the H-based ASR system has higher recognition accuracy and better robustness than the other systems.

作者许超曹志刚

机构地区清华大学电子工程系

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2004年第1期22-24,28,共4页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金资助项目(60072011)

关键词抗噪声语音识别谐波模型 MEL倒谱系数 speech recognition robustness harmonic model

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献6

1Young S, Evermann G, Kershaw D, et al. The HTK Book [EB/OL]. http://htk.eng.cam.ac.uk/docs/docs.shtml, 2002.
2Mark John Francis Gales. Model-Based Techniques for Noise Robust Speech Recognition [D]. University of Cambridge, Gonville and Caius College, 1995.
3McAulay R J, Quatieri T F. Speech analysis/synthesis based on a sinusoidal representation [J]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1986, 8(4): 744-754.
4Abu-Shikhah N, Deriche M. A Robust technique for harmonic analysis of speech [J]. Proc ICASSP'01 - Proceedings, 2001, (2): 877-880.
5Virtanen T, Klapuri A. Separation of harmonic sounds using linear models for the overtone series [J]. ICASSP'02 - Proceedings, 2002(2): 1757-1760.
6Pearce D, Hirsch H-G. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions [J]. ICSLP'00 - Proceedings, 2000, (4): 29-32.

同被引文献7

1Hermansky H. Perceptual Linear Predictive (PLP) Analysis of Speech. Journal of the Acoustical Society of America, 1990, 87(4): 1738-1752.
2You K H, Wang H C. Robust Features for Noisy Speech Recognition Based on Temporal Trajectory Filtering of Short Time Autocorrelation Sequences. Speech Communication, 1999, 28:13-24.
3Cooke M, Green P, Josifovski L, Vizinho A. Robust Automatic Speech Recognition with Missing and Unreliable Acoustic Data.Speech Communication, 2001, 34:267-285.
4Luo Y, Du L M. Single Gauss Model Set-Based Data Imputation Method for Complex ASR Task. In: Proc of the International Symposium on Circuits and Systems. Bangkok, Thailand,2003, Ⅱ : 564-567.
5Varga A, Steeneken H J M. Assessment for Automatic Speech Recognition: H. NOISEX-92: A Database and an Experiment to Study the Effect of Additive Noise on Speech Recogniiton Systems. Speech Communication, 1993, 12(3), 247-251.
6Young S, etal. The HTK Book (for HTK Version 3.0). Cambridge, UK, Cambridge University Technical Services, 2000.
7蒋文建,林耀荣,韦岗.基于响度特性加权的噪声下语音识别方法[J].模式识别与人工智能,2001,14(2):166-170. 被引量：7

引证文献1

1张军,韦岗,熊燕.基于相对自相关序列MFCC特征的丢失数据带噪语音识别方法[J].模式识别与人工智能,2005,18(1):45-49. 被引量：1

二级引证文献1

1单进,芮贤义.基于压缩感知的稳健性说话人识别[J].电声技术,2011,35(2):61-63. 被引量：2

1由红,陈健.改进的频域基音检测算法[J].上海交通大学学报,2001,35(6):855-858. 被引量：1
2熊燕.抗噪声语音识别技术研究[J].中国科技信息,2006(7):204-205. 被引量：5
3史林,姜敏,黄莉.基于谐波模型的生命探测雷达人体状态识别方法[J].西安电子科技大学学报,2005,32(2):179-183. 被引量：13
4龙潜,孔凡让,刘永斌,刘维来,刘志刚.一种基于MVDR和CCBC的抗噪语音识别方法[J].数据采集与处理,2006,21(3):297-301. 被引量：1
5方国强,滕克难.基于模糊综合评判的弹道中段目标识别技术[J].四川兵工学报,2014,35(3):112-114. 被引量：4
6张亮,房建成.电磁轴承开关功放的谐波模型仿真与实验研究[J].中国电机工程学报,2007,27(21):95-100. 被引量：11
7孙暐,吴镇扬.多带抗噪声语音识别算法研究[J].信号处理,2006,22(4):559-563.
8丁昊,李建忠,安昕,黄勇,关键.实测海杂波数据的多普勒谱特性[J].雷达科学与技术,2012,10(4):400-408. 被引量：9
9万义龙,张天骐,王志朝,金静.基于多频带谱减法的抗噪声语音识别研究[J].电视技术,2013,37(23):183-187. 被引量：5
10韦晓东,朱杰,胡光锐.汽车噪声中自动语音的识别技术[J].上海交通大学学报,1998,32(10):10-13. 被引量：6

清华大学学报（自然科学版）

2004年第1期

浏览历史

内容加载中请稍等...

用于抗噪声语音识别的谐振强度特征被引量：1

参考文献6

同被引文献7

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

用于抗噪声语音识别的谐振强度特征 被引量：1

参考文献6

同被引文献7

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

用于抗噪声语音识别的谐振强度特征被引量：1