基于多空间概率分布的汉语连续语音声调识别研究被引量：3

Research about Tone Recognition of Mandarin Continuous Speech Based on Multi-space Probability Distribution

下载PDF

导出

摘要汉语是一种带声调的语言,声调信息在汉语语音识别中具有非常重要的意义。提出了embedded声调模型与explicit声调模型相结合的方法用以识别汉语连续语音的声调。该方法能够将逐帧的基频信息和较强时长的基频信息相结合来识别声调。在"863-Test"和"TestCorpus 98"测试集上的实验表明,该方法分别能够达到96.12%和93.78%的声调识别正确率。 Chinese Mandarin is the tonal language.Tone is important to Mandarin speech recognition.We proposed a method to recognize the tone of Mandarin continuous speech,which is the combination of embedded tone model and explicit tone model.This method can fuse the fundamental frequency information of short time and long time.The experiments in ＂863-Test＂ and ＂TestCorpus98＂ test show that our proposed method can achieve 96.12% and 93.78% tone recognition correct rate separatively.

作者倪崇嘉刘文举徐波

机构地区山东财政学院统计与数理学院中国科学院自动化研究所模式识别国家重点实验室

出处《计算机科学》 CSCD 北大核心 2011年第9期224-226,241,共4页 Computer Science

基金国家自然科学基金(90820303 60675026 90820011) 国家高技术研究863计划(20060101Z4073 2006AA01Z194) 国家重点基础研究发展973计划(2004CB318105)资助

关键词声调基频多空间概率分布 Tone Fundamental frequency Multi-space probability distribution

分类号 TP319 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献14

1Xu Bo, Gao Sheng, Cao Yang, et al. Integrating tone information in continuous Mandarin recognition [C]//Proc. ISSPIS' 99. Guangzhou.
2Lee T,Lau W,Wong Y W,et al. Using tone information in Cantonese continuous speech recognition [J]. ACM Trans. on Asian Language Information Processing, 2002,1 (1) : 83-102.
3Liu J, Yu T. New tone recognition methods for Chinese continuous speech [C]//Proc. ICSLP'00. Beijing.
4Zhang J-S, Hirose K. Anchoring hypothesis and its application to tone recognition of Chinese continuous speech [C]// Proc. ICASSP 2000. Istanbul,June 2000.
5Peng G,Wang W S Y. An innovative prosody modeling method for Chinese speech recognition [J ]. International Journal of Speech Technology, 2004,7(4) : 129-140.
6Cao Y, Deng Y, Zhang H, et al. Decision tree based Mandarin tone model and its application to speech recognition [C]//Proc. ICASSP 2000. 2003,3:1759-1762.
7Sun Y, Willett D, Brueckner R, et al. Experiments on Chinese speech recognition with tonal models and pitch estimation using the Mandarin speech data [C]//Proe. ICSLP. 2006:1245-1248.
8Seide F,Wang N. Two-stream modeling of Mandarin tones [C]// Prom ICSLP. 2000:867-870.
9Qian Y, Song F K, Lee T. Tone-enhanced generalized character posterior probability(GCPP) for Cantonese LVCSR [J]. Computer Speech and Language, 2008,22(4) : 360-373.
10Peng G, Wang W S Y. Tone recognition of continuous Cantonese speech based on support vector maehines [J]. Speeeh Communication, 2005,45 (1) : 49-62.

同被引文献20

1楼红伟,胡光锐.基于Teager能量算子和频率弯折小波变换的语音识别特征参数[J].上海交通大学学报,2003,37(z1):79-82. 被引量：8
2高新涛,陈乖丽.语音识别技术的发展现状及应用前景[J].甘肃科技纵横,2007,36(4):13-13. 被引量：19
3Vijayasenan D, Valente F, Bourlard H. Multistream speaker dia- rization of meetings recordings beyond MFCC and TI)OA fea- tures [J]. Speech Communication, 2012,54 (1) : 55-67.
4Wang L, Minami K, Yamamoto K, et al. Speaker Recognition by Combining MFCC and Phase Information in Noisy Conditions [J]. IEICE Transactions on Information and Systems, 2010, E93D(9) :2397-2406.
5Li Q, Huang Y. An Auditory-Based Feature Extraction Algo- rithm for Robust Speaker Identification Under Mismatched Conditions [J]. IEEE Transactions on Audio Speech and Lan- guage Processing, 2011,19(6) : 1791-1801.
6Li Qi. An auditory-based transfrom for audio signal processing [C]// 2009 IEEE Workshop on Applications of Signal Proces- sing to Audio and Acoustics. New Paltz, NY, United states, Oct. 2009 : 181-184.
7Dimitriadis D,Maragos P,Potamianos A. On the Effects of Fil- terbank Design and Energy Computation on Robust Speech Recog-nition[J]. IEEE Transactions on Audio Speech and Lan- guage Processing, 2011,19(6) : 1504-1516.
8Tu C-C,Juang C-F. Recurrent type-2 fuzzy neural network using Haar wavelet energy and entropy features for speech detection in noisy environments [J]. Expert Systems With Applications, 2012,39 (3): 2479-2488.
9詹新明,黄南山,杨灿.语音识别技术研究进展[J].现代计算机,2008,14(9):43-45. 被引量：44
10黄浩,朱杰,哈力旦.汉语语音识别中的区分性声调建模方法[J].计算机工程与应用,2009,45(11):178-182. 被引量：4

引证文献3

1李晶皎,安冬,杨丹,王骄.噪声环境下说话人识别的TEO-CFCC特征参数提取方法[J].计算机科学,2012,39(12):195-197. 被引量：4
2王坤,郭起云,郭光.大数据时代下档案信息采集新思路[J].数字与缩微影像,2014(2):7-8. 被引量：2
3东青.浅谈科学发声法在声乐教学中的重要性[J].明日风尚,2017(16):158-158. 被引量：2

二级引证文献8

1张志强.论网络对当代大学生的负面影响[J].中小企业管理与科技,2017,1(5):161-162.
2陈树,于海波.一种改进的特征提取方法在语音识别中的应用[J].传感器与微系统,2018,37(5):154-157. 被引量：9
3史燕燕,白静.融合CFCC和Teager能量算子倒谱参数的语音识别[J].计算机科学,2019,46(5):286-289. 被引量：8
4易明,冯翠翠,莫富传.大数据时代的信息资源管理创新研究[J].图书馆学研究,2019,0(6):56-61. 被引量：14
5熊佳颖.声乐教学中如何引导学生正确保护嗓音[J].北方音乐,2019,39(18):131-132. 被引量：1
6陈永海.自然叹气发声在声乐教学中的运用探索[J].鸭绿江,2020(9):98-98.
7龙华,黄张衡,邵玉斌,杜庆治,苏树盟.基于改进CFCC特征提取的语种识别算法研究[J].通信学报,2022,43(12):211-221. 被引量：2
8黄张衡,龙华,邵玉斌,杜庆治,苏树盟,王延凯.噪声环境下听觉特征融合的语种识别[J].现代电子技术,2023,46(5):47-54. 被引量：1

1傅德胜,李仕强,王水平.支持向量机的汉语连续语音声调识别方法[J].计算机科学,2010,37(5):228-230. 被引量：4
2倪崇嘉,刘文举,徐波.韵律相关的汉语语音识别系统研究[J].计算机应用研究,2011,28(8):2941-2945. 被引量：3
3李壮辉.基音特征融合高斯混合模型的说话人识别研究[J].测控技术,2014,33(6):28-31. 被引量：2
4王欢良,钱瑶,F.K.Soong,韩纪庆.基于声调建模的带噪汉语数字串语音识别[J].声学学报,2007,32(5):454-460. 被引量：2
5龚一凡.汉语连续语音理解系统[J].东南大学学报（自然科学版）,1990,20(4):132-137.
6晁浩,宋成,刘志中.语音识别中基于发音特征的声调集成算法[J].计算机工程与应用,2014,50(23):21-25. 被引量：2
7刘赵杰,邵健,张鹏远,赵庆卫,颜永红,冯稷.汉语自然口语中声调识别的研究[J].物理学报,2007,56(12):7064-7069. 被引量：5
8钟金宏,杨善林,陶维青,徐士林.基于音节的三字词声调神经网络识别方法[J].模式识别与人工智能,2000,13(4):439-442. 被引量：3
9赵庆卫,王作英,陆大紟.基于音节间相关识别单元的汉语连续语音识别算法[J].清华大学学报（自然科学版）,1999,39(9):65-68. 被引量：2
10王改良,武妍.基于仿生模式识别理论的声调识别[J].计算机应用,2010,30(10):2709-2711. 被引量：2

计算机科学

2011年第9期

浏览历史

内容加载中请稍等...

基于多空间概率分布的汉语连续语音声调识别研究被引量：3

参考文献14

同被引文献20

引证文献3

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

基于多空间概率分布的汉语连续语音声调识别研究 被引量：3

参考文献14

同被引文献20

引证文献3

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

基于多空间概率分布的汉语连续语音声调识别研究被引量：3