训练数据有限的英文语音重音标注研究被引量：1

Stress detection in English sentences with limited training data

下载PDF

导出

摘要大规模语料库的手工韵律标注消耗大量的时间和人力。这篇论文的目的在于研究如何充分利用少量的手工标注数据训练得到尽可能精确的语音重音自动标注器。论文列举并对比了四种训练方法的效果。在训练中结合声学分类器和语言学分类器,同时使用了综合分类器做后期优化。在实验中,使用机器数据训练声学分类器,并将有限的手工数据用于后期综合分类器能得到最佳的标注正确率。最终的正确率达到了94.0%,与手工标注的正确率上限97.2%比较接近。 It is money and labor consuming to label stressed syllables manually,especially when the speech database is very large.An efficient and reliable automatic prosody labeler is always desired.When training data is limited,how to get the best use of it？ This paper proposes the optimization in using training data for automatic stress detection in English speech utterances.The detector consists of a linguistic classifier,an acoustic classifier and an AdaBonst classifier that can improve the accuracy by using more features and manual labels.The best resuh we obtained is 94.0%,which is approaching to the self-agreement ratio （97.2%） of the same annotator,or the upper hound of the performancc.

作者赖珉陈一宁初敏胡访宇

机构地区中国科学技术大学电子工程与信息科学系微软亚洲研究院

出处《计算机工程与应用》 CSCD 北大核心 2007年第33期48-50,共3页 Computer Engineering and Applications

关键词自动重音检测自动韵律标注自动语音识别 automatic stress detection automalic prosody labeler automatic speech recognition

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1Kuhlen E C.An introduction to English prosody [M].[S.l.]:Edward Arnold, 1986.
2Wightman C W,Ostendorf M.Automatic labeling of prosodic patterns[J].IEEE Trans on Speech and Audio Processing,1994,2(4): 469-481.
3Bulyko I,Ostendorf M.A bootstrapping approach to automating prosodic annotation for constrained domain synthesis [C].Proc of the IEEE Workshop on Speech Synthesis,2002:115-118.
4Conkie A,Riccardi G,Rose R C.Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events[C].Proc of EUROSPEECH, 1999 : 523-526.
5Imoto K,Tsubota Y,Raux A,et al.Modeling and automatic detection of English sentences accent tbr computer-assisted English prosody learning system[C]//Proc of ICSLP,2002:749-752.
6Chen K,Hasegawa-Johnson M.An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model[C]//Proc of ICASSP,2004:509-512.
7Arnfield S.Prosody and syntax in corpus based analysis of spoken English[D].University of Leeds, 1994-12.
8Bagshaw P C.Criteria for labelling prosodic aspects of English speech [C]//Proc 4th Australian International Conference on Speech Science and Technology, 1992.
9Lai M,Chen Y N,Chu M,et al.A hierarchical approach to detect stress in English sentences[C].ICASSP 2006.
10Werner S.Toward spontaneous speech synthesis utilizing language model information in TTS [J].IEEE Transactions on speech And Audio Processing, 2004,12(4 ) : 436-444.

同被引文献7

1Kandpal N,Rao M.Implementation of PCA&ICA for voice recognition and separation of speech[C]//IEEE Conf on Advanced Management Science,2010:536-538.
2Kim H C,Kim D,Bang Sung-Yang.A PCA mixture model with an efficient model selection method[C]//International Joint Conference on Neural Networks,2001:430-435.
3Zhang Wanfeng,Yang Yingchun,Wu Zhaohui.Experimental evaluation of a new speaker identification framework using PCA[C]//Proceedings of the IEEE International Conference on Systems,Man and Cybernetics,2003:4147-4152.
4邢玉娟,李明,张亚芬.基于PCA和核Fisher判别的说话人确认[J].计算机工程与设计,2008,29(15):3984-3986. 被引量：5
5鲍长春,樊昌信.基于归一化互相关函数的基音检测算法[J].通信学报,1998,19(10):27-31. 被引量：42
6江海燕,刘岩,卢莉.维吾尔语词重音实验研究[J].民族语文,2010(3):67-71. 被引量：9
7帕尔哈提.季兰,魏江.维吾尔语的重音[J].语言与翻译,1985,0(1):53-57. 被引量：9

引证文献1

1金惠琴,努尔麦麦提.尤鲁瓦斯,吾守尔.斯拉木,王辉.维吾尔语的重音检测[J].计算机工程与应用,2014,50(9):197-199. 被引量：1

二级引证文献1

1纪佳昕.俄语孤立数字语音识别研究[J].现代计算机,2021,27(23):11-16.

1刘栋,孟祥武,陈俊亮,夏亚梅.上下文感知系统中的规则生成与匹配算法[J].软件学报,2009,20(10):2655-2666. 被引量：14
2优必选与亚马逊合作推出人形机器人Lynx[J].智能机器人,2017,0(1):17-17.
3刘豫军,夏聪.语音合成音库自动标注方法研究[J].网络安全技术与应用,2015(2):65-66. 被引量：1
4静永文.基于粗糙集理论的网络入侵检测系统[J].微计算机信息,2009,25(21):32-34. 被引量：1
5王永生,李梅.英文文语转换系统中基于形态规则和机器学习的重音标注算法[J].计算机应用,2008,28(1):88-91. 被引量：2
6赵林.华为Voice Internet业务——带给您全新的感受[J].电信技术,2003(1):86-86.
7朱有产,王健,商李彪.网络入侵检测系统的新型综合分类器[J].华北电力大学学报（自然科学版）,2005,32(6):37-41. 被引量：1
8俞铁城.适用于自动语音识别的声道参数[J].物理,1998,27(2):125-125.
9最终幻想X Ⅲ国际版[J].游戏机实用技术,2011(1):42-47.
10陈兴东.VC++中制作带图像的半透明提示框的方法[J].微计算机应用,2005,26(3):273-273.

计算机工程与应用

2007年第33期

浏览历史

内容加载中请稍等...

训练数据有限的英文语音重音标注研究被引量：1

参考文献13

同被引文献7

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

训练数据有限的英文语音重音标注研究 被引量：1

参考文献13

同被引文献7

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

训练数据有限的英文语音重音标注研究被引量：1