Trainable prosodic model for standard Chinese Text-to-Speech system 被引量：1

Trainable prosodic model for standard Chinese Text-to-Speech system

导出

摘要 Putonghua prosody is characterized by its hierarchical structure when influenced by linguistic environments. Based on this, a neural network, with specially weighted factors and optimizing outputs, is described and applied to construct the Putonghua prosodic model in Text-to-Speech (TTS) system. Extensive tests show that the structure of the neural network characterizes the Putonghua prosody more exactly than traditional models. Learning rate is speeded up and computational precision is improved, which makes the whole prosodic model more efficient. Furthermore, the paper also stylizes the Putonghua syllable pitch contours with SPiS parameters (Syllable Pitch Stylized Parameters), and analyzes them in adjusting the syllable pitch. It shows that the SPiS parameters effectively characterize the Putonghua syllable pitch contours, and facilitate the establishment of the network model and the prosodic controlling. Putonghua prosody is characterized by its hierarchical structure when influenced by linguistic environments. Based on this, a neural network, with specially weighted factors and optimizing outputs, is described and applied to construct the Putonghua prosodic model in Text-to-Speech (TTS) system. Extensive tests show that the structure of the neural network characterizes the Putonghua prosody more exactly than traditional models. Learning rate is speeded up and computational precision is improved, which makes the whole prosodic model more efficient. Furthermore, the paper also stylizes the Putonghua syllable pitch contours with SPiS parameters (Syllable Pitch Stylized Parameters), and analyzes them in adjusting the syllable pitch. It shows that the SPiS parameters effectively characterize the Putonghua syllable pitch contours, and facilitate the establishment of the network model and the prosodic controlling.

作者 TAO Jianhua, CAI Lianhong, ZHAO Shixia (Department of Computer Science and Technology Tsinghua University Beijing 100084)

出处《Chinese Journal of Acoustics》 2001年第3期257-265,共9页 声学学报（英文版）

基金 This work was supported by the National Natural Science Foundation of China (69875008) and 863National High Technology Project

关键词 Trainable prosodic model for standard Chinese Text-to-Speech system TEXT

分类号 H017 [语言文字—语言学]

引文网络
相关文献

参考文献5

1HUANG Yan,HUANG Taiyi.A neural learning approach for duration parameter generation inPutonghua speech synthesis[].ISCSLP’.1998
2CHEN Sinhorng et al.An RNN-based prosodic information synthesizer for Putonghua text-to-speech[].IEEE Transcations on Speech and Audio Processing.1998
3TAO Jianhua,CAI Lianhong,ZHONG Yuzuo.The context-based method of creating Chineseprosodic model[].ISSPR’.1998
4YANG Shunan.A tonal model for synthesizing polysyllabic words and phrases in standard Chinese[].Essays on Linguistics.1990
5XU Chingx,XU Yi,LUO Lishi.A pitch target approximation model for FO contours in Putonghua[].ICPHS San Francisco.1999

引证文献1

1张皖志,陶建华.基于声韵母基元的嵌入式中文语音合成系统[J].信号处理,2005,21(z1):216-219. 被引量：1

二级引证文献1

1张小燕,宿建军,薛化建,王磊.维吾尔语语音识别语料库中的OOV研究[J].计算机工程与设计,2012,33(2):772-776. 被引量：4

1SHAO Yanqiu HAN Jiqing ZHAO Yongzhen LIU Ting.Study on automatic prediction of sentential stress for Chinese Putonghua Text-to-Speech system with natural style[J].Chinese Journal of Acoustics,2007,26(1):49-62. 被引量：2
2陈虎.自然语言的重音分布及其语义解释——西方研究综述[J].现代外语,2003,26(1):93-103. 被引量：21
3王利,王永生.基于语块的英语文语转换系统的韵律生成方法[J].计算机辅助工程,2007,16(1):44-47.

Chinese Journal of Acoustics

2001年第3期

浏览历史

内容加载中请稍等...

Trainable prosodic model for standard Chinese Text-to-Speech system 被引量：1

参考文献5

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史