期刊文献+

基于非线性取值DTW算法的鲁棒性语音识别系统

Robust Speech Recognition System Based on Nonlinear Extraction Dynamic Time Warping
下载PDF
导出
摘要 提出了一个在噪声环境下高效的语音识别系统。针对端点检测,提出了基于平滑函数的检测方法,从而提高了利用短时能量算法的检测精度。运行频谱滤波器方法在能量频谱和对数频谱用了两次带通滤波器减少噪声,在对数频谱内用倒谱均值相减的方法去除卷积噪声,从而减少了计算量。对于普通DTW(Dynamic Time Warpin)算法得到某个测试语音与该语音所有的参考语音相似值,应用一个非线性中值滤波器取中间某个值的方法来进行识别,从而提高了DTW算法的识别精度。利用少量参考语音,实现了高于HMM的识别精度同时又减少了训练的花费时间。 In this paper, an efficient robust speech recognition system in noisy environment was proposed. A smooth function is used to short time energy (STE), which has improved the detection accuracy of STE. The complexity of running spectrum filtering is high, because two band-pass filter are used. Hence, the cepstrum mean subtraction (CMS) was used to reduce the convolution noise in logarithm spectrum, and the calculation is reduced more much. Unlike convemional DTW (Dynamic Time Warping) algorithms, which search for the reference word with minimum distance from the unknown speech waveform, a nonlinear median filter (NMF) was used and the reference word with minimum median distance from the unknown speech waveform was searched for.DTW implementations can be improved substantially. In this approach yields, DTW recognition accuracy is higher than that of the HMM techniques. However, the training is saved.
作者 张宇昕 丁岩
出处 《长春理工大学学报(自然科学版)》 2013年第6期144-148,107,共6页 Journal of Changchun University of Science and Technology(Natural Science Edition)
关键词 动态时间规划 短时能量 运行频谱滤波器 非线性中值滤波器 DTW short time energy running spectrum filtering nonlinear median filter
  • 相关文献

参考文献13

  • 1Rabiner L, Juang B H. Fundamemals of speech rec- ognitionM]. 1st ed USA:Prentice Hall PTR,1993.
  • 2Erdol N, Castelluccia C, Zilouehian A. Recovery of missing speech packets using the short-time energy and zero-crossing measurements[J].IEEE Transac- tions on Speech and Audio Processing, 1993, 1(3): 295-303.
  • 3苏广川.强噪声环境下汉语语音识别的模糊分类算法[J].北京理工大学学报,1997,17(6):686-690. 被引量:4
  • 4Hermansky H, Morgan N. RASTA processing of speech E J]. IEEE Transactions on Speech and Au- dio Processing, 1994,2(4) 578-589.
  • 5Hermansky H, Morgan N, Hirsch H G. Recognition of speech in additive and convolutional noise based on RASTA spectral processing[C]. IEEE Interna- tional Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1993.83-86.
  • 6Fujioka K, Miyanaga Y. A new noise reduction method of speech signal with running spectrum fil- tering [C].Intemational Symposium on Intelligent Signal Processing and Corrlnmnication Systems, 2004:173-176.
  • 7Hayasaka N, Miyanaga Y, Wada N. Running spec- trum filtering in speech recognition[C].Intemational Conference on Soft Computing and Intelligent Sys- tems (SCIS), 2002:154-157.
  • 8Hayasaka N, Khankhavivone K, Miyanaga Y, et al. New robust speech recognition by using nonlinear running spectrum filter[C].Intemational Symposium on Communications and Information Technologies, 2006:133-136.
  • 9Itakura F. Minimum prediction residual principle applied to speech recognition[J]. IEEE Transac- tions on Acoustics, Speech and Signal Processing, 1975,23(1 ) : 67-72.
  • 10Sakoe H, Chiba S. Dynamic programming algo- rithm optimization for spoken word recognition[J]. IEEE Transactions on Acoustics, Speech and Sig- nal Processing, 1978,26(1) :43-49.

二级参考文献1

共引文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部