期刊文献+

面向语音合成的藏语音素切分算法研究

Study on Tibetan Phoneme Segmentation Algorithms Facing Speech Synthesis
下载PDF
导出
摘要 文章通过采用两种方法对藏语语音合成语料库中的语音进行音素切分:一种是基于单音素HMM模型的自动切分方法,一种是传统的人工切分方法,并通过实验分析了自动切分与人工切分方法的准确率程度.实验结果表明:在构建语料库时,前者有助于缩短建库周期,尤其对于大语料库的建立会有明显的优势.这种方法既节省了切分与标注的大量时间和人力成本,又提高了语音语料库标注信息的精确度和一致性. This paper adopted two methods being used for phoneme segmentation for Tibetan speech synthesis corpus:one was based on single phoneme HMM model automatic segmentation;the other was the traditional manual segmentation way.The accuracy degree between automatic and manual segmentation was analyzed through the experiments.The results of experiment showed that the automatic segmentation is helpful for shortening the cycle duration in building corpus process,especially for the establishment of large corpus.A lot of time for segmentation and labeling was reduced,the accuracy and consistency of speech corpus labeling information has been improved.
出处 《西北民族大学学报(自然科学版)》 2012年第4期27-31,共5页 Journal of Northwest Minzu University(Natural Science)
基金 国家自然基金项目(61262054) 西北民族大学中央高校基本科研业务费专项(ycx12024)
关键词 音素自动切分 藏语 语音合成 语料库 Phoneme automatic segmentation Tibetan Speech synthesis Corpus
  • 相关文献

参考文献6

二级参考文献23

  • 1郑玉玲.藏语方言语音量化分析[J].民族语文,1998(5):42-50. 被引量:4
  • 2孔江平.藏语(拉萨话)声调感知研究[J].民族语文,1995(3):56-64. 被引量:42
  • 3朱亚喆,柴佩琪.语音合成系统中语音库的设计与实现[J].计算机工程,1997,23(S1):45-46. 被引量:2
  • 4Brugnara F, Falavigna D, Omologo M. Automatic Segmentation and Labeling of Speech Based on Hidden Markov Models. Speech Comm,1993,12:357-370
  • 5Donovan R E, Woodland P C. A Hidden Markov Model Based Trainable Speech Synthesiser. Computer Speech and Language, 1999,13(3): 223-242
  • 6Doroteo Torre Toledano, Luis A Hernandez Gomez. Automatic Phonetic Segmentation [J]. IEEE Transactions on speech and audio processing, November 2003,11(6): 617~625.
  • 7Abhinav Sethy, Shrikanth Narayanam. Refined Speech Segmentation for Concatenative Speech Synthesis[C]. Proceeding of ICSLP, Denver, Colorado, USA, September 2002:145~148.
  • 8KI- Seung Lee, Jeong Su Kim. Context- adaptive Phone Boundary Refining for a TTS Database [C]. Proceeding of ICASSP, Hongkong, China, April 2003: 252~255.
  • 9Eun-Young Park, Sang-Hun Kim, Jae-Ho Chung. Automatic Speech Synthesis Unit Generation with MLP based Postprocessor Against Auto-segmented Phoneme Errors[C]. Proceeding of ICASSP, Phoenix, Arizona, March 1999:2985~2990.
  • 10Odell J, Ollason D, Woodland P, et al. The HTK Book for HTK V3.0 [M]. Cambridge University Press, Cambridge,UK, 2001.

共引文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部