利用语音的频谱空间特征进行汉语抗噪语音识别的方法

Spatial characteristics of speech spectrum for robust speech recognition

下载PDF

导出

摘要抗噪连续语音识别是当前汉语连续语音识别的重要研究领域。采用通过度量连续语音帧之间频谱的稳定性,将连续语音切分成份,再将切分结果(无论时间长短)变换为与时间无关的大小固定的频谱空间特征,通过与模板库进行比较实现语音识别。新的频谱空间特征,与语音时长无关,同时表现出较好的抗噪声能力。在特定人连续语音识别测试系统中,取得了不错的识别效果。 The anti-noise continuous speech recognition is an important research topic of current Chinese continuous speech recognition. In this paper, by measuring the stability of the frequency spectrum between the continuous speech frames, speech signal can be segmented, and then these segmentations（regardless of the length of time） are transformed into time-independent and size-fixed spatial characteristics of speech spectrum. By comparing to speech template, the speech recognition results are obtained. The new spatial characteristics of speech spectrum are independent of speech length, and show better immunity to noise. In a specific continuous speech recognition testing system, favorable recognition result is obtained.

作者张永锋田勇张阳

机构地区大连东软信息学院电子工程系

出处《声学技术》 CSCD 北大核心 2015年第1期51-53,共3页 Technical Acoustics

关键词语音特征连续语音识别抗噪语音识别 speech characteristics continuous speech recognition anti-noise speech recognition

分类号 TB533 [理学—声学]

引文网络
相关文献

参考文献3

1杨占磊,刘文举,晁浩.融合引导概率的语音识别解码算法研究[J].声学学报,2012,37(2):209-217. 被引量：1
2张永锋,杨影,肖莹莹.基于主成分分析的汉语连续语音切分算法[J].应用声学,2011,30(5):366-369. 被引量：3
3任艳斐.直方图均衡化在图像处理中的应用[J].科技信息,2007(4):37-38. 被引量：37

二级参考文献16

1冯清枝.基于直方图修正的图像增强技术[J].广东公安科技,2004(2):49-51. 被引量：9
2林帆,徐明星.一种改进的基于时域参数的语音切分算法[J].计算机科学,2006,33(4):164-167. 被引量：3
3Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition. In: Proc. IEEE, 1989; 77(2): 257--285.
4Povey D. A tutorial-style introduction to subspace gaussian mixture models for speech recognition. Tech. Rep., Tech. Rep., MSR-TR-2009-111, Microsoft Research, 2009.
5Povey D et al. Subspace Gaussian Mixture Models for Speech Recognition. In: Proc. of ICASSP2010, 2010: 4330--4333.
6Xavier L Aubert. An overview of decoding techniques tor large vocabulary continuous speech recognition. Computer Speech and Language, 2002; 16(1): 89--114.
7Demuynck K, Duchateuu J, van Compernolle D, Wambacq P. An efficient search space representation for large vocabulary continuous speech recognition. Speech Commun., 2000: 30(1): 37--53.
8Ney H, Ortmanns S. Progress in dynamic programming search for LVCSR. In: Proc. IEEE, 2000; 88:1224--1240.
9YANG Zhanlei, LIU Wenju. A novel path extension frame- work using steady segment detection for Mandarin speech recognition. In: Proc. of Interspeech 2010, Makuhari, Japan, 2010:226--229.
10Povey D, Chu S M, Varadarajan B. Universal background model based speech recognition. In: Proc. ICASSP, 2008: 4561--4564.

共引文献38

1黄靖,杨丰.基于空频结合的图像增强的脑肿瘤分割[J].光子学报,2012,41(7):850-854. 被引量：4
2李星,李向群.透射电子显微镜负染图像增强处理[J].甘肃科技,2009,25(7):20-21. 被引量：2
3欧阳彝华,黄芳,周敏.基于灰度直方图的心脏图像检索[J].计算机技术与发展,2009,19(9):125-127. 被引量：2
4仲岑然.基于Matlab的混纺纱横截面切片图像客观分析法[J].毛纺科技,2010,38(6):59-62. 被引量：1
5方飞.应用数字图像增强技术分析牙本质显微结构[J].临床口腔医学杂志,2011,27(10):591-593.
6邹江,闫树斌.红外图像综合处理算法研究[J].电子测试,2012,23(3):24-29. 被引量：1
7邱璇,黄靖,杨丰,邢栋,涂圣贤.结合图像增强的心血管内超声中-外膜边缘检测[J].中国图象图形学报,2012,17(4):537-545. 被引量：2
8张陈梅,陈芬,吴明昊,严迪群,彭宗举.基于SEED-DTK6437的视频图像增强系统设计[J].微型机与应用,2013,32(5):32-34.
9李玉三.疲劳驾驶中人脸检测问题的研究[J].电脑与电信,2013(6):46-48.
10刘谞承,张玉环,梁明易.输气管道建设项目景观及生态系统影响评价——以广东省天然气管网一期工程为例[J].环境与发展,2013,25(11):118-124. 被引量：1

1PENG Di LIU Gang GUO Jun.Study on Acoustic Modeling in a Mandarin Continuous Speech Recognition[J].Journal of China University of Mining and Technology,2007,17(1):143-146. 被引量：1
2W.T.S.数码相机的逆光拍摄[J].数码,2003(7):101-103.
3新书介绍[J].纺织服装教育,2013,28(5):348-348.
4姜哲,吴卫国.基于时域声辐射模态的结构噪声主动控制研究[J].江苏大学学报（自然科学版）,2004,25(5):453-456. 被引量：4
5袁莉芬,刘辉,程俊.基于独立成分分析技术的语音除噪系统[J].湖南师范大学自然科学学报,2011,34(3):24-26. 被引量：2
6刘青春,钱奇霞.竹材美学要素与设计应用探析[J].包装工程,2012,33(16):72-76. 被引量：8
7秦宪刚,张侃.刺激空间特征和反应位置对线索效应模式的影响[J].人类工效学,2006,12(1):7-10. 被引量：4
8我是设计师——绝非方形[J].照明设计,2009(5):16-18.
9史作义,王学营,刘娜,南照东,李修善.阴离子表面活性剂十二烷基苯磺酸钠为模板制备球霰石[J].曲阜师范大学学报（自然科学版）,2007,33(1):75-79. 被引量：7
10陈秉松.如入无人之境[J].数码世界,2014,0(8):74-75.

声学技术

2015年第1期

浏览历史

内容加载中请稍等...

利用语音的频谱空间特征进行汉语抗噪语音识别的方法

参考文献3

二级参考文献16

共引文献38

相关作者

相关机构

相关主题

浏览历史