基于非线性取值DTW算法的鲁棒性语音识别系统

Robust Speech Recognition System Based on Nonlinear Extraction Dynamic Time Warping

下载PDF

导出

摘要提出了一个在噪声环境下高效的语音识别系统。针对端点检测,提出了基于平滑函数的检测方法,从而提高了利用短时能量算法的检测精度。运行频谱滤波器方法在能量频谱和对数频谱用了两次带通滤波器减少噪声,在对数频谱内用倒谱均值相减的方法去除卷积噪声,从而减少了计算量。对于普通DTW(Dynamic Time Warpin)算法得到某个测试语音与该语音所有的参考语音相似值,应用一个非线性中值滤波器取中间某个值的方法来进行识别,从而提高了DTW算法的识别精度。利用少量参考语音,实现了高于HMM的识别精度同时又减少了训练的花费时间。 In this paper, an efficient robust speech recognition system in noisy environment was proposed. A smooth function is used to short time energy （STE）, which has improved the detection accuracy of STE. The complexity of running spectrum filtering is high, because two band-pass filter are used. Hence, the cepstrum mean subtraction （CMS） was used to reduce the convolution noise in logarithm spectrum, and the calculation is reduced more much. Unlike convemional DTW （Dynamic Time Warping） algorithms, which search for the reference word with minimum distance from the unknown speech waveform, a nonlinear median filter （NMF） was used and the reference word with minimum median distance from the unknown speech waveform was searched for.DTW implementations can be improved substantially. In this approach yields, DTW recognition accuracy is higher than that of the HMM techniques. However, the training is saved.

作者张宇昕丁岩

机构地区长春理工大学计算机科学技术学院

出处《长春理工大学学报（自然科学版）》 2013年第6期144-148,107,共6页 Journal of Changchun University of Science and Technology(Natural Science Edition)

关键词动态时间规划短时能量运行频谱滤波器非线性中值滤波器 DTW short time energy running spectrum filtering nonlinear median filter

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1Rabiner L, Juang B H. Fundamemals of speech rec- ognitionM]. 1st ed USA:Prentice Hall PTR,1993.
2Erdol N, Castelluccia C, Zilouehian A. Recovery of missing speech packets using the short-time energy and zero-crossing measurements[J].IEEE Transac- tions on Speech and Audio Processing, 1993, 1(3): 295-303.
3苏广川.强噪声环境下汉语语音识别的模糊分类算法[J].北京理工大学学报,1997,17(6):686-690. 被引量：4
4Hermansky H, Morgan N. RASTA processing of speech E J]. IEEE Transactions on Speech and Au- dio Processing, 1994,2(4) 578-589.
5Hermansky H, Morgan N, Hirsch H G. Recognition of speech in additive and convolutional noise based on RASTA spectral processing[C]. IEEE Interna- tional Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1993.83-86.
6Fujioka K, Miyanaga Y. A new noise reduction method of speech signal with running spectrum fil- tering [C].Intemational Symposium on Intelligent Signal Processing and Corrlnmnication Systems, 2004:173-176.
7Hayasaka N, Miyanaga Y, Wada N. Running spec- trum filtering in speech recognition[C].Intemational Conference on Soft Computing and Intelligent Sys- tems (SCIS), 2002:154-157.
8Hayasaka N, Khankhavivone K, Miyanaga Y, et al. New robust speech recognition by using nonlinear running spectrum filter[C].Intemational Symposium on Communications and Information Technologies, 2006:133-136.
9Itakura F. Minimum prediction residual principle applied to speech recognition[J]. IEEE Transac- tions on Acoustics, Speech and Signal Processing, 1975,23(1 ) : 67-72.
10Sakoe H, Chiba S. Dynamic programming algo- rithm optimization for spoken word recognition[J]. IEEE Transactions on Acoustics, Speech and Sig- nal Processing, 1978,26(1) :43-49.

二级参考文献1

1苏广州.模糊算法在语音分析和合成中的应用[J].北京理工大学学报,1995,15(2):138-142. 被引量：1

共引文献3

1殷建,殷业.模糊模式识别在语音关键词识别中的应用[J].常州信息职业技术学院学报,2009,8(1):1-3.
2杨亚涛,张元.强噪声背景下确定性谐波信号的模糊滤波[J].微电子学与计算机,2002,19(12):22-24.
3黄石磊,武剑虹,匡镜明.用于语音识别的减谱结合RASTA的抗噪声方法[J].北京理工大学学报,2003,23(5):621-624. 被引量：1

1杨润辉,吴清江.基于步态的身份识别综述[J].电脑开发与应用,2007,20(9):30-32. 被引量：1
2马宁,陈晓冬,李亚楠,尹青云,汪毅,郁道银.内窥镜自动定位语音识别系统[J].计算机工程与应用,2014,50(8):207-210. 被引量：2
3吕雁,苏新主.一种基于背景预测的红外杂波抑制新方法[J].系统工程与电子技术,2007,29(8):1271-1274. 被引量：6
4刘敬伟,程乾生.基于动态时间规划的基因芯片数据识别[J].北京大学学报（自然科学版）,2002,38(5):611-615. 被引量：1
5吴进,张青.一种改进的孤立词语音识别系统设计[J].西安邮电大学学报,2016,21(1):76-80. 被引量：4
6魏星,周萍.改进型蚁群算法的语音动态规划研究[J].计算机仿真,2011,28(5):402-405. 被引量：7
7朱春媚,黎萍.基于支持向量机的咳嗽自动识别[J].计算机与现代化,2016(7):111-114.
8王一梅,贾克斌,庄新月.一种基于动态时间规划的视频特征检索改进算法[J].高技术通讯,2007,17(5):464-469. 被引量：1
9古今,郭立,郑东飞.一种基于感知特性的鲁棒性语音认证算法[J].中国科学院研究生院学报,2009,26(4):474-482.
10李海涛.基于DTW约束的动作行为识别[J].计算机仿真,2014,31(11):227-230. 被引量：4

长春理工大学学报（自然科学版）

2013年第6期

浏览历史

内容加载中请稍等...

基于非线性取值DTW算法的鲁棒性语音识别系统

参考文献13

二级参考文献1

共引文献3

相关作者

相关机构

相关主题

浏览历史