Study on automatic prediction of sentential stress for Chinese Putonghua Text-to-Speech system with natural style 被引量：2

Study on automatic prediction of sentential stress for Chinese Putonghua Text-to-Speech system with natural style

导出

摘要 Stress is an important parameter for prosody processing in speech synthesis. In this paper, we compare the acoustic features of neutral tone syllables and strong stress syllables with moderate stress syllables, including pitch, syllable duration, intensity and pause length after syllable. The relation between duration and pitch, as well as the Third Tone （T3） and pitch are also studied. Three stress prediction models based on ANN, i.e. the acoustic model, the linguistic model and the mixed model, are presented for predicting Chinese sentential stress. The results show that the mixed model performs better than the other two models. In order to solve the problem of the diversity of manual labeling, an evaluation index of support ratio is proposed. Stress is an important parameter for prosody processing in speech synthesis. In this paper, we compare the acoustic features of neutral tone syllables and strong stress syllables with moderate stress syllables, including pitch, syllable duration, intensity and pause length after syllable. The relation between duration and pitch, as well as the Third Tone （T3） and pitch are also studied. Three stress prediction models based on ANN, i.e. the acoustic model, the linguistic model and the mixed model, are presented for predicting Chinese sentential stress. The results show that the mixed model performs better than the other two models. In order to solve the problem of the diversity of manual labeling, an evaluation index of support ratio is proposed.

作者 SHAO Yanqiu HAN Jiqing ZHAO Yongzhen LIU Ting

机构地区 School of Computer Science and Technology

出处《Chinese Journal of Acoustics》 2007年第1期49-62,共14页 声学学报（英文版）

基金 This work was supported by the National Natural Science Foundation of China (No. 60085001)

分类号 H102 [语言文字—汉语]

引文网络
相关文献

参考文献3

1曹剑芬.基于语法信息的汉语韵律结构预测[J].中文信息学报,2003,17(3):41-46. 被引量：41
2王韫佳,初敏,贺琳.汉语语句重音的分类和分布的初步实验研究[J].心理学报,2003,35(6):734-742. 被引量：19
3XU Jieping CHU Min HE Lin LU Shinan GUAN Dinghua (Institute of Acoustics, Academia Sinica Beijing 100080).The influence of Chinese sentence stress on pitch and duration[J].Chinese Journal of Acoustics,2000,19(3):270-276. 被引量：2

二级参考文献20

1杨玉芳.语句重音分布模式知觉[J].心理学报,1996,28(3):225-231. 被引量：6
2沈炯.汉语语调模型刍议[J].语文研究,1992(4):16-24. 被引量：76
3CHU Min and LU Shinan(Institute of Acoustics, Academia Sinica, Beijing 100080).A text-to-speech system with high intelligibility and naturalness for Chinese[J].Chinese Journal of Acoustics,1996,15(1):81-90. 被引量：5
4王洪君.汉语的韵律词与韵律短语[J].中国语文,2000(6):525-536. 被引量：101
5Niu Zhengyu, Chai Peiqi. Segmentation of Prosodic Phrase for Improving the Naturalness of Synthesized Chinese Speech. In The Proceedings of ICSLP'2000, III. 350-353.
6Jianfen Cao & Wdbin Zhu. Syntactic and Lexical Constraint in Prosodic Segmentation and Grouping. In The Proceedings. of Speech Prosody2002.
7Zheng, B., Wang, B., Yang, Y., Lu, S. & Cao, J.. The regular accent in Chinese sentences. In The Proceedings of ICSLP'2000, I, 86-89.
8曹剑芬.普通话节奏的声学语音学特性[A].吕士楠等主编.现代语音学论文集[C].北京:金城出版社,1999年.155—159.
9贺琳初敏吕士楠等.汉语合成语料库的韵律层级标注研究[A]..五届全国现代语音学学术会议论文集[C].北京:清华大学出版社,2001.323—326.
10Lehiste I. Suprasegmentals. M. I. T. Press, 1970. 150 - 151

共引文献58

1殷治纲.汉语节奏研究综述[J].中国语音学报,2022(2):33-50.
2张吉生.也论汉语词重音[J].中国语文,2021(1):43-55. 被引量：15
3王强.再论汉语并列结构的中心语[J].励耘语言学刊,2020(1):205-225.
4杨国文.汉语小句的尾调及末尾音节的声调变化[J].当代语言学,2021(1):87-96.
5冉启斌,段文君,贾媛.汉语句重音、焦点问题研究回顾与展望[J].南开语言学刊,2013(2):52-61. 被引量：6
6夏耕.声调作为二语习得中的韵律意识和声学意识[J].语文学刊（外语教育与教学）,2013(7):137-140.
7赵永贞,刘挺,王志伟,陈惠鹏,邵艳秋.汉语文语转换系统中停顿指数的自动标注[J].中文信息学报,2004,18(5):48-55. 被引量：6
8王茂林.汉语自然话语韵律组块的优选论分析[J].暨南学报（哲学社会科学版）,2005,27(4):85-87. 被引量：6
9胡伟湘,董宏辉,陶建华,黄泰翼.汉语朗读话语重音自动分类研究[J].中文信息学报,2005,19(6):78-83. 被引量：13
10刘浩杰,杜利民.汉语韵律词F0曲线的优化[J].中文信息学报,2006,20(1):98-104.

同被引文献30

1韩文静,李海峰,韩纪庆.基于长短时特征融合的语音情感识别方法[J].清华大学学报（自然科学版）,2008,48(S1):708-714. 被引量：20
2李剑锋,胡国平,王仁华.基于最大熵模型的韵律短语边界预测[J].中文信息学报,2004,18(5):56-63. 被引量：20
3周俏峰,蔡莲红.汉语重音及在TTS系统中的模拟[J].微型计算机,1996,16(4):16-19. 被引量：3
4WANG Zhiping ZHAO Li ZOU Cairong.Speech emotion recognition based on statistical pitch model[J].Chinese Journal of Acoustics,2006,25(1):87-96. 被引量：3
5王治平,赵力,邹采荣.基于基音参数规整及统计分布模型距离的语音情感识别[J].声学学报,2006,31(1):28-34. 被引量：26
6姜晓庆,田岚,崔国辉.多语种情感语音的韵律特征分析和情感识别研究[J].声学学报,2006,31(3):217-221. 被引量：8
7朱维彬.支持重音合成的汉语语音合成系统[J].中文信息学报,2007,21(3):122-128. 被引量：4
8Yamafishi J, Masuko T, Kobayashi T. HMM-based expressive speech synthesis-towards TTS with arbitrary speaking styles and emotions [R]. Special Workshop in Maui(SWIM), Maui, USA, 2004.
9YU K, Mairesse F, Young S. Word level emphasis modeling in HMM-based speech synthesis EC-// ICASSP 2010. NJ: IEEE Press, 2010: 4238-4241.
10Badino L, Andersson J S, Yamagishi J, et al. Identification of contrast and its emphatic realization in HMM-based speech synthesis [C]// INTERSPEECH 2009, Grenoble: ISCA, 2009:520 - 523.

引证文献2

1李雅,潘诗锋,陶建华.采用重音调整模型的HMM语音合成系统[J].清华大学学报（自然科学版）,2011,51(9):1171-1175. 被引量：2
2金赟,宋鹏,郑文明,赵力.半监督判别分析的跨库语音情感识别[J].声学学报,2015,40(1):20-27. 被引量：6

二级引证文献8

1张石清,刘瑞欣,赵小明.跨库语音情感识别研究进展[J].计算机系统应用,2022,31(11):31-48.
2孟凡博,吴志勇,蒙美玲,贾珈,蔡莲红.基于决策树的英语焦点语音转换[J].清华大学学报（自然科学版）,2013,53(7):1046-1051.
3FAN Xiaohe,ZHAO Heming,CHEN Xueqin,ZHOU Yan.Deceptive Chinese speech detection based on sparse decomposition of cepstral feature[J].Chinese Journal of Acoustics,2019,38(1):99-112.
4孟凡博,吴志勇,贾珈,蔡莲红.汉语重音的凸显度分析与合成[J].声学学报,2015,40(1):1-11. 被引量：1
5陶华伟,张昕然,梁瑞宇,查诚,赵力,王青云.面向语音情感识别的改进可辨别完全局部二值模式[J].声学学报,2016,41(6):905-912. 被引量：8
6樊晓鹤,赵鹤鸣,陈雪勤,周燕.倒谱参数稀疏分解下的汉语音谎言检测[J].声学学报,2018,43(1):121-128. 被引量：4
7张若凡,黄俊,古来,许二敏,古智星.基于语谱图的老年人语音情感识别方法[J].软件导刊,2018,17(9):28-31. 被引量：3
8杨子秀,金赟,马勇,戴妍妍,俞佳佳,顾煜.基于图卷积深浅特征融合的跨语料库情感识别[J].数据采集与处理,2023,38(1):111-120. 被引量：1

1TAO Jianhua, CAI Lianhong, ZHAO Shixia (Department of Computer Science and Technology Tsinghua University Beijing 100084).Trainable prosodic model for standard Chinese Text-to-Speech system[J].Chinese Journal of Acoustics,2001,20(3):257-265. 被引量：1
2刘梅.中国音韵学和语音学在汉语言语合成中的应用[J].吉林省教育学院学报（下旬）,2015,30(10):116-117. 被引量：1
3G.Fant,张家騄.言语研究展望[J].当代语言学,1992(2):23-25. 被引量：1
4J.N.Holmes,祖漪清.未来数十年的言语工程学[J].当代语言学,1987(2):49-55. 被引量：1
5哈里.F.奥尔森,王祖融.言语分析与合成[J].当代语言学,1979(6):29-33.
6永磊.近年来国外言语研究综述[J].外语教学,1986,7(3):22-29.
7沈家煊.说“偷”和“抢”[J].语言教学与研究,2000(1):19-24. 被引量：75
8Si Jun Han Chunming University of Science and Technology of China.ACHIEVING BETTER UNDERSTANDING BY LISTENING WITH PREDICTION[J].Chinese Journal of Applied Linguistics,2000,23(4):34-38. 被引量：5
9符佳磊,王晗.Prediction and Research for the RMB Exchange Rate Basing on Time Series Analysis[J].商情,2015(8):389-389.
10O.Fujimura,赵世开.语言学对未来言语工程技术的作用[J].当代语言学,1985(1):1-4. 被引量：1

Chinese Journal of Acoustics

2007年第1期

浏览历史

内容加载中请稍等...