Robust Speech Recognition Using a Harmonic Model

Robust Speech Recognition Using a Harmonic Model

导出

摘要 Automatic speech recognition under conditions of a noisy environment remains a challenging problem. Traditionally, methods focused on noise structure, such as spectral subtraction, have been em-ployed to address this problem, and thus the performance of such methods depends on the accuracy in noise estimation. In this paper, an alternative method, using a harmonic-based spectral reconstruction algo-rithm, is proposed for the enhancement of robust automatic speech recognition. Neither noise estimation nor noise-model training are required in the proposed approach. A spectral subtraction integrated autocorrela-tion function is proposed to determine the pitch for the harmonic model. Recognition results show that the harmonic-based spectral reconstruction approach outperforms spectral subtraction in the middle- and low-signal noise ratio (SNR) ranges. The advantage of the proposed method is more manifest for non-stationary noise, as the algorithm does not require an assumption of stationary noise. Automatic speech recognition under conditions of a noisy environment remains a challenging problem. Traditionally, methods focused on noise structure, such as spectral subtraction, have been em-ployed to address this problem, and thus the performance of such methods depends on the accuracy in noise estimation. In this paper, an alternative method, using a harmonic-based spectral reconstruction algo-rithm, is proposed for the enhancement of robust automatic speech recognition. Neither noise estimation nor noise-model training are required in the proposed approach. A spectral subtraction integrated autocorrela-tion function is proposed to determine the pitch for the harmonic model. Recognition results show that the harmonic-based spectral reconstruction approach outperforms spectral subtraction in the middle- and low-signal noise ratio (SNR) ranges. The advantage of the proposed method is more manifest for non-stationary noise, as the algorithm does not require an assumption of stationary noise.

作者许超曹志刚

机构地区 Department of Electronic Engineering

出处《Tsinghua Science and Technology》 SCIE EI CAS 2004年第2期202-206,共5页 清华大学学报（自然科学版（英文版）

基金 Supported by the National Natural Science Foundation of China (No. 60072011)

关键词 robust speech recognition speech enhancement pitch extraction harmonic model robust speech recognition speech enhancement pitch extraction harmonic model

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

1由红,陈健.改进的频域基音检测算法[J].上海交通大学学报,2001,35(6):855-858. 被引量：1
2李天伟,李正友,黄谦,郭姣.谱跟踪和噪声模型语音信号分析/合成方法[J].通信技术,2015,48(7):803-807.
3张敏娟,王召巴,王志斌,李晓,李世伟,李晋华.大光程差PEM-FTS的快速光谱重建[J].光谱学与光谱分析,2014,34(7):2010-2014. 被引量：1
4史林,姜敏,黄莉.基于谐波模型的生命探测雷达人体状态识别方法[J].西安电子科技大学学报,2005,32(2):179-183. 被引量：13
5LIU Xiang,ZHANG Bing,GAO LianRu,CHEN DongMei.A maximum noise fraction transform with improved noise estimation for hyperspectral images[J].Science in China(Series F),2009,52(9):1578-1587. 被引量：6
6许超,曹志刚.用于抗噪声语音识别的谐振强度特征[J].清华大学学报（自然科学版）,2004,44(1):22-24. 被引量：1
7ZHANG Yi,HE Chun-jiang,LUO Yuan,CHEN Kai,XING Wu-chao.Improved perceptually non-uniform spectral compression for robust speech recognition[J].The Journal of China Universities of Posts and Telecommunications,2013,20(4):122-126. 被引量：1
8于云,周伟栋.基于压缩感知的鲁棒性说话人识别参数研究[J].计算机技术与发展,2016,26(3):18-22. 被引量：1
9张亮,房建成.电磁轴承开关功放的谐波模型仿真与实验研究[J].中国电机工程学报,2007,27(21):95-100. 被引量：12
10CHENG Ning,LIU Wenju,WANG Lan.Subspace Noise Estimation and Gamma Distribution Based Microphone Array Post-filter Design[J].Chinese Journal of Electronics,2011,20(2):293-298. 被引量：1

Tsinghua Science and Technology

2004年第2期

浏览历史

内容加载中请稍等...

Robust Speech Recognition Using a Harmonic Model

相关作者

相关机构

相关主题

浏览历史