期刊文献+

训练数据有限的英文语音重音标注研究 被引量:1

Stress detection in English sentences with limited training data
下载PDF
导出
摘要 大规模语料库的手工韵律标注消耗大量的时间和人力。这篇论文的目的在于研究如何充分利用少量的手工标注数据训练得到尽可能精确的语音重音自动标注器。论文列举并对比了四种训练方法的效果。在训练中结合声学分类器和语言学分类器,同时使用了综合分类器做后期优化。在实验中,使用机器数据训练声学分类器,并将有限的手工数据用于后期综合分类器能得到最佳的标注正确率。最终的正确率达到了94.0%,与手工标注的正确率上限97.2%比较接近。 It is money and labor consuming to label stressed syllables manually,especially when the speech database is very large.An efficient and reliable automatic prosody labeler is always desired.When training data is limited,how to get the best use of it? This paper proposes the optimization in using training data for automatic stress detection in English speech utterances.The detector consists of a linguistic classifier,an acoustic classifier and an AdaBonst classifier that can improve the accuracy by using more features and manual labels.The best resuh we obtained is 94.0%,which is approaching to the self-agreement ratio (97.2%) of the same annotator,or the upper hound of the performancc.
出处 《计算机工程与应用》 CSCD 北大核心 2007年第33期48-50,共3页 Computer Engineering and Applications
关键词 自动重音检测 自动韵律标注 自动语音识别 automatic stress detection automalic prosody labeler automatic speech recognition
  • 相关文献

参考文献13

  • 1Kuhlen E C.An introduction to English prosody [M].[S.l.]:Edward Arnold, 1986.
  • 2Wightman C W,Ostendorf M.Automatic labeling of prosodic patterns[J].IEEE Trans on Speech and Audio Processing,1994,2(4): 469-481.
  • 3Bulyko I,Ostendorf M.A bootstrapping approach to automating prosodic annotation for constrained domain synthesis [C].Proc of the IEEE Workshop on Speech Synthesis,2002:115-118.
  • 4Conkie A,Riccardi G,Rose R C.Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events[C].Proc of EUROSPEECH, 1999 : 523-526.
  • 5Imoto K,Tsubota Y,Raux A,et al.Modeling and automatic detection of English sentences accent tbr computer-assisted English prosody learning system[C]//Proc of ICSLP,2002:749-752.
  • 6Chen K,Hasegawa-Johnson M.An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model[C]//Proc of ICASSP,2004:509-512.
  • 7Arnfield S.Prosody and syntax in corpus based analysis of spoken English[D].University of Leeds, 1994-12.
  • 8Bagshaw P C.Criteria for labelling prosodic aspects of English speech [C]//Proc 4th Australian International Conference on Speech Science and Technology, 1992.
  • 9Lai M,Chen Y N,Chu M,et al.A hierarchical approach to detect stress in English sentences[C].ICASSP 2006.
  • 10Werner S.Toward spontaneous speech synthesis utilizing language model information in TTS [J].IEEE Transactions on speech And Audio Processing, 2004,12(4 ) : 436-444.

同被引文献7

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部