利用声调提高中文连续数字串语音识别系统性能被引量：3

Improving the Performance of Continuous Mandarin Digit String Recognition System by Using Tones

下载PDF

导出

摘要采用平均幅度差法、自相关函数法和简单逆滤波器跟踪法相结合的方法计算语音信号的基音频率.根据声调模式的不同,采用基于规则的方法对声调进行识别,对连续数字串识别中一些易混淆的数字对进行区分,从而达到提高数字串识别系统性能的目的. According to the extracted pitch curve, it can discriminate between four Mandarin tones. A composite algorithm for pitch extraction was promoted which integrates AMDF, auto correlation and simple inverse filtering trucking (SIFT) algorithm by using some rules. In the Mandarin continuous digit string recognition system, the tones are used to discriminate some confusing digit pairs, which can improve the system's recognition rate.

作者章文义朱杰徐向华

机构地区上海交通大学电子工程系

出处《上海交通大学学报》 EI CAS CSCD 北大核心 2004年第2期185-188,共4页 Journal of Shanghai Jiaotong University

基金上海市科学技术委员会基础研究基金(01JC14033) 美国贝尔实验室上海分部资助项目

关键词语音识别声调数字识别 Computer simulation Correlation methods

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献7

1[1]Zhang J S, Hirose K. Anchoring hypothesis and its application to tone recognition of Chinese continuous speech acoustics [A]. Proc IEEE Int Conf Acoust,Speech, Signal Processing [C]. Istanbul, Turkey:ICASSP, 2000. 1419-1422.
2[2]u Y, Hemmi K, Inoue K. A tone recognition of polysyllabic Chinese words using an approximation model of four tone pitch patterns[A]. Proc Industrial Electronics, Control and Instrumentation Proceeding[C]. Asilomar, Califormia, USA: IECON,1991. 2115-2119.
3[3]Zhang G L, Zheng F, Wu W H. Tone recognition of Chinese continuous speech[A]. International Symposium on Chinese Spoken Language Processing[C].Beijing: ISCSLP, 2000. 207-210.
4[4]Kobayashi H, Shimamura T. A weighted autocorrelation method for pitch extraction of noisy speech[A]. Proc IEEE Int Conf Acoust, Speech, Signal Processing[C]. Istanbul, Turkey: ICASSP, 2000.1307- 1310.
5[5]Hemandez D H, Huici M E, Lorenzo G J. Combined algorithm for pitch detection of speech signals [J].Electronics Letters, 1995, 31 ( 5 ): 15 - 16.
6[6]Samad S A, Hussain A, Low K F. Pitch detection of speech signals using the cross correlation technique[A]. Intelligent Systems and Technologies for the Next Millenium[C]. Kuala Lumpur Malaysia: TENCON, 2000. 283-286.
7[7]Cherif A. Pitch and formants extraction algorithm for speech processing[A]. Proc IEEE Int Conf Electronics, Circuits and Systems[C]. Kaslik, Lebanon:ICECS, 2000. 595-598.

同被引文献39

1王韫佳.音高和时长在普通话轻声知觉中的作用[J].声学学报,2004,29(5):453-461. 被引量：33
2[英]克里斯特安尼,等(著).李国正,王猛,曾华军(译).支持向量机导论[M].北京:电子工业出版社,2004.
3顾明亮,夏玉果,王劲松.噪声环境下的汉语声调识别[J].计算机技术与发展,2007,17(8):70-72. 被引量：2
4王欢良,钱瑶,F.K.Soong,韩纪庆.基于声调建模的带噪汉语数字串语音识别[J].声学学报,2007,32(5):454-460. 被引量：2
5Huang C H, Side F. Pitch tracking and tone features for mandarin speech recognition. Proceedings of the 25th International Conference on Acoustics, Speech and Signal Processing, Istanbul, Turkey, 2000; 3:1523-1526
6Lei X, S M, Hwang M, Ostendorf M et al. Improved tone modeling for mandarin broadcast news speech recognition. In: Proceedings of Interspeech (ICSLP), Pittsburgh, USA, 2006:1277-1280
7Wang H L, Qian Y, Soong F K, Zhou J L et al. Improved Mandarin Speech Recognition by Lattice Rescoring with Enhanced Tone models. In: Proceedings of International Symposium on Chinese Spoken Language Processing, 2006: 445-443
8Yang W J, Lee J C, Chang Y C et al. Hidden Markov Model for Mandarin lexical tone recognition. IEEE Trans. on Acoustic Speech and Signal Processing, 1988; 36(7): 988-992
9Thubthong N, Kijsirikul B, Tone recognition of continuous Thai speech under tonal assimilation and declination effects using half-tone model. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 2001; 9(6): 815-825
10CAO Yang, ZHANG Shu Wu, HUANG Tai Yi et al. Tone modeling for continuous Mandarin speech recognition. International Journal of Speech Technology, 2004; 7(2-3): 115-128

引证文献3

1黄浩,朱杰.汉语语音识别中基于区分性权重训练的声调集成方法[J].声学学报,2008,33(1):1-8. 被引量：2
2HUANG Hao ZHU Jie.Tone model integration based on discriminative weight training for Putonghua speech recognition[J].Chinese Journal of Acoustics,2008,27(3):193-202.
3沈泉波,韩慧莲.基于支持向量机的汉语声调识别[J].电子世界,2013(4):74-75.

二级引证文献2

1侯丽敏,黄振华,谢娟敏.声门下共鸣的谱规整用于非特定人的语音识别[J].声学学报,2010,35(5):580-586.
2晁浩,杨占磊,刘文举.基于发音特征的汉语声调建模方法及其在汉语语音识别中的应用[J].计算机应用,2013,33(10):2939-2944. 被引量：2

1周利清.非特定人的语音数字识别硬件系统[J].电信科学,1991,7(3):36-40. 被引量：1
2章文义,朱杰.一种综合的基音提取方法[J].计算机应用与软件,2004,21(2):12-13. 被引量：2
3李娟.几种基音周期算法性能比较[J].运城学院学报,2010,28(2):35-37. 被引量：3
4满高华,宁存岱,李婷,周玲.基于Hopfiled离散型数字图像识别[J].硅谷,2011,4(13):193-194.
5曾水玲,徐蔚鸿.基于支持向量机的手写体数字识别[J].计算机与数字工程,2006,34(10):104-106. 被引量：9
6王跟东,林道发.非特定人连续汉语数字语音识别[J].模式识别与人工智能,1993,6(4):347-351. 被引量：1
7陈小利,徐金甫.基于小波变换和时域波形的基音检测算法[J].现代电子技术,2011,34(1):77-79. 被引量：4
8赵转萍,聂开俊.静态背景下移动数字快速捕获与识别方法研究[J].传感器与微系统,2006,25(5):28-30.
9王建华,卢鑫.基于DSP的图像式彩票数字识别终端[J].深圳信息职业技术学院学报,2009,7(2):19-23.
10汤霖,蔡莲红.基于层级策略的连续数字串识别的研究[J].计算机工程与应用,2003,39(21):83-86.

上海交通大学学报

2004年第2期

浏览历史

内容加载中请稍等...

利用声调提高中文连续数字串语音识别系统性能被引量：3

参考文献7

同被引文献39

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

利用声调提高中文连续数字串语音识别系统性能 被引量：3

参考文献7

同被引文献39

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

利用声调提高中文连续数字串语音识别系统性能被引量：3