期刊文献+

Study on automatic prediction of sentential stress for Chinese Putonghua Text-to-Speech system with natural style 被引量:2

Study on automatic prediction of sentential stress for Chinese Putonghua Text-to-Speech system with natural style
原文传递
导出
摘要 Stress is an important parameter for prosody processing in speech synthesis. In this paper, we compare the acoustic features of neutral tone syllables and strong stress syllables with moderate stress syllables, including pitch, syllable duration, intensity and pause length after syllable. The relation between duration and pitch, as well as the Third Tone (T3) and pitch are also studied. Three stress prediction models based on ANN, i.e. the acoustic model, the linguistic model and the mixed model, are presented for predicting Chinese sentential stress. The results show that the mixed model performs better than the other two models. In order to solve the problem of the diversity of manual labeling, an evaluation index of support ratio is proposed. Stress is an important parameter for prosody processing in speech synthesis. In this paper, we compare the acoustic features of neutral tone syllables and strong stress syllables with moderate stress syllables, including pitch, syllable duration, intensity and pause length after syllable. The relation between duration and pitch, as well as the Third Tone (T3) and pitch are also studied. Three stress prediction models based on ANN, i.e. the acoustic model, the linguistic model and the mixed model, are presented for predicting Chinese sentential stress. The results show that the mixed model performs better than the other two models. In order to solve the problem of the diversity of manual labeling, an evaluation index of support ratio is proposed.
出处 《Chinese Journal of Acoustics》 2007年第1期49-62,共14页 声学学报(英文版)
基金 This work was supported by the National Natural Science Foundation of China (No. 60085001)
  • 相关文献

参考文献3

二级参考文献20

  • 1杨玉芳.语句重音分布模式知觉[J].心理学报,1996,28(3):225-231. 被引量:6
  • 2沈炯.汉语语调模型刍议[J].语文研究,1992(4):16-24. 被引量:76
  • 3CHU Min and LU Shinan(Institute of Acoustics, Academia Sinica, Beijing 100080).A text-to-speech system with high intelligibility and naturalness for Chinese[J].Chinese Journal of Acoustics,1996,15(1):81-90. 被引量:5
  • 4王洪君.汉语的韵律词与韵律短语[J].中国语文,2000(6):525-536. 被引量:101
  • 5Niu Zhengyu, Chai Peiqi. Segmentation of Prosodic Phrase for Improving the Naturalness of Synthesized Chinese Speech. In The Proceedings of ICSLP'2000, III. 350-353.
  • 6Jianfen Cao & Wdbin Zhu. Syntactic and Lexical Constraint in Prosodic Segmentation and Grouping. In The Proceedings. of Speech Prosody2002.
  • 7Zheng, B., Wang, B., Yang, Y., Lu, S. & Cao, J.. The regular accent in Chinese sentences. In The Proceedings of ICSLP'2000, I, 86-89.
  • 8曹剑芬.普通话节奏的声学语音学特性[A].吕士楠等主编.现代语音学论文集[C].北京:金城出版社,1999年.155—159.
  • 9贺琳 初敏 吕士楠 等.汉语合成语料库的韵律层级标注研究[A]..五届全国现代语音学学术会议论文集[C].北京:清华大学出版社,2001.323—326.
  • 10Lehiste I. Suprasegmentals. M. I. T. Press, 1970. 150 - 151

共引文献58

同被引文献30

引证文献2

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部