期刊文献+

一种改进的基于决策树的英文韵律短语边界预测方法 被引量:3

Improved decision tree based method for English prosodic phrase boundary prediction
下载PDF
导出
摘要 在英文语音合成系统中,韵律短语边界预测的精度对合成语音的自然度和可懂度有着至关重要的影响。基于决策树的预测方法是现阶段最为常用的韵律短语边界预测方法,但因决策树构建时受到数据平衡性制约,难以针对关键词进行建模,而且在基于决策树进行预测时采用了局部最优的搜索方式无法达到全局最优。所以,为了进一步提升韵律短语边界的预测效果,对基于决策树的预测方法进行了改进,引入韵律短语条件概率,使用Viterbi算法同时优化韵律短语边界概率和条件概率,并提出了基于关键词在韵律短语中的位置分布特性的决策树节点概率优化方法。实验表明,在基线系统上使用改进方法后,F-Score由68.7%提升到77.8%,而不可接受率从22.4%降低到15.2%。 In English speech synthesis systems, the accuracy of prosodic phrase boundary prediction has a critical influence on the naturalness and intelligibility of synthetic speech. Currently, decision tree based prediction is the most popular method for predicting the prosodic phrase boundaries. However, this method can' t build models for specific keywords because of the data balance issue. Besides, it wouldn' t be possible to achieve the global optimization by the local optimization search method at prediction stage. Therefore, in order to improve the prediction performance, this paper introduced the conditional probability of prosodic phrases, and used Viterbi algorithm to optimize the prosodic phrase boundary probability and conditional probability simultaneously. Furthermore, it proposed an optimization method for probability distribution of the decision tree nodes, based on location distribution characteristics of keywords in prosodic phrases. The experimental results show that F-Score of phrase boundary prediction increases from 68.7% to 77.8% and the non-acceptance rate drops from 22.4% to 15.2% after adopting the proposed method.
出处 《计算机应用研究》 CSCD 北大核心 2012年第8期2921-2925,共5页 Application Research of Computers
关键词 语音合成 韵律短语 边界预测 决策树 位置分布 speech synthesis prosodic phrase boundary prediction decision tree location distribution
  • 相关文献

参考文献14

  • 1SILVERMAN K E A, BECKMAN M E, PITRELLI J F,et al. ToBI:a standard for labeling english prosody [ C ]//Proc of International Con- ference on Spoken Language Processing. 1992:867-870.
  • 2杨军.ToBI韵律标注体系及其运用[J].现代外语,2005,28(4):360-366. 被引量:14
  • 3LI Wei-jun,YANG Yu-fang. Perception of prosodic hierarchical bound- aries in Mandarin Chinese sentences [ J ]. Neuroseience, 2009, 158 (4) :1416-1425.
  • 4荀恩东,钱揖丽,郭庆,宋柔.应用二叉树剪枝识别韵律短语边界[J].中文信息学报,2006,20(3):1-5. 被引量:4
  • 5李剑锋,胡国平,王仁华.基于最大熵模型的韵律短语边界预测[J].中文信息学报,2004,18(5):56-63. 被引量:20
  • 6YING Zhi-wei, SHI Xiao-hua. An RNN-based algorithm to detect pro- sodic phrase for Chinese TIS[ C]//Proc of International Conference on Acoustics, Speech, and Signal Processing. 2001 : 809- 812.
  • 7BAILLY G, HOLM B. SFC : a trainable prosodic model [ J ]. Speech Communication ,2005,46 (3-4) :348-364.
  • 8FUJIO S, SAGISAKA Y, HIGUCHI N. Prediction of prosodic phrase boundaries using stochastic context-free grammar[ C]//Proc of the 3rd International Conference on Spoken Language Processing. 1994:18-22.
  • 9READ I, COX S. Stochastic and syntactic techniques for predicting phrase breaks [ J ]. Computer Speech & Language, 2007,21 ( 3 ) : 519-542.
  • 10董远,周涛,董乘宇,王海拉.中文语音合成系统中的一种两层韵律结构生成体系(英文)[J].自动化学报,2010,0(11):1569-1574. 被引量:2

二级参考文献48

共引文献51

同被引文献29

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部