用决策树指导TBL进行多音字消歧被引量：1

Polyphone disambiguation based on tree-guided TBL

下载PDF

导出

摘要多音字消歧是普通话语音合成系统中字音转换模块的核心问题。选择了常见易错的33个多音字和24个多音词作为研究对象,构建了一个平均每个多音字(词)5000句的语料库,并且提出了一种结合决策树和基于转换的错误驱动的学习(Transformation-Basederror-driven Learning,TBL)的混合算法。该方法根据决策树的指导,自动生成TBL算法的模板,避免了手工总结模板这一费时费力的过程。实验结果表明,该方法生成的模板与手工模板性能相当,其平均准确率达90.36%,明显优于决策树。 Polyphone disambiguation is the core issue of the grapheme-to-phoneme conversion in Mandarin Text-To-Speech （＇ITS） system.This paper selects 33 key polyphones and 24 key polyphonic words which are most ambiguous and frequently used as study objects,and builds a polyphone corpus of 5 000 sentences per polyphone on average.Furthermore,a hybrid algorithm called Tree-Guided Transformation-Based Leaming（TGTBL）,which combines decision tree with Transformation-Based error-driven Leaming（TBL）,is proposed to resolve the polyphonic ambiguity.It automatically generates TBL templates,thereby avoiding manually summarizing templates, which is time-consuming and laborious in conventional TBL.Results of comparative experiments show that, for the task of polyphone disambiguation, templates automatically generated by decision tree achieve comparable performance to manually summarized templates,and the average precision of TGTBL reaches 90.36%,siguificantly higher than that of decision tree.

作者刘方舟周游

机构地区湖南师范大学数学与计算机科学学院湖南财政经济学院应用数学系

出处《计算机工程与应用》 CSCD 北大核心 2011年第12期137-140,共4页 Computer Engineering and Applications

基金湖南省科技计划项目(No.2010FJ4131) 湖南省教育厅科研项目(No.10C0955)

关键词多音字消歧字音转换决策树基于转换的错误驱动的学习(TBL) polyphone disambiguation grapheme-to-phoneme decision tree Transformation-Based error-driven Leaming（TBL）

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献8

1Yarowaky D.Homograph disambiguation in speech synthesis[M]//Santen J,Sproat R,Olive J,et al.Progress in speech synthesis.New York:Springer-Verlag,1996:159-175.
2Wang Wern-jun,Hwang Shaw-hwa,Chen Sin-horag.The broad study of homograph disambiguity for mandarin speech synthesis[C]//Proc 4th International Conference on Spoken Language Processing,Philadelphia,1996:1389-1392.
3Zhang Zi-rong,Chu Min.An efficient way to learn rules for grapheme-to-phoneme conversion in Chinese[C]//Proc 3rd International Symposium on Chinese Spokon Language Processing,Taipci,2002:233-236.
4胡国平,陈志刚,王仁华.基于规则及 SVM 权值训练的汉语多音字自动消歧研究[C]//Proc 20th International Conference on Computer Processing of Oriental Languages,Shonyang,2003:599-605.
5Zheng Min,Shi Qin.Grapheme-to-phoneme conversion based on TBL algorithm in Mandarin TTS system[C]//Proc 6th Annual Conference of the International Speech Communication Association,Lisbon,2005:1897-1900.
6Brill E.Tranformation-based error-driven learning and natural language processing:A case study in part of speech tagging[J].Computational Linguistics,1995,21(4):543-565.
7Ramshaw L,Marcus M.Text chunking using transformation-based lesming[M]//Armstrong S,Church K,Isabelle P,et al.Natural language processing using very large corpora.Dordrecht:Kluwer Academic Publishers,1999:82-94.
8Brill E.Learning to parse with transformations[M]//Bunt H,Tomita M.Recent advances in parsing technology.Dordrecht:Kluwer Academic Pubfishers,1996:221-240.

同被引文献1

1张子荣,初敏.解决多音字字-音转换的一种统计学习方法[J].中文信息学报,2002,16(3):39-45. 被引量：10

引证文献1

1高羽,熊一瑾,叶建成.针对长尾问题的二重加权多音字消歧算法[J].中文信息学报,2022,36(11):169-176. 被引量：1

二级引证文献1

1赵立君,张军雁,何倩,庄严,郭锐.医学语言模型研究[J].长江信息通信,2023,36(11):1-7. 被引量：1

1范明,胡国平,王仁华.汉语字音转换中的多层面多音字读音消歧[J].计算机工程与应用,2006,42(2):167-170. 被引量：1
2王天航,史树敏,龙从军,黄河燕,李琳.基于错误驱动学习策略的藏语句法功能组块边界识别[J].中文信息学报,2014,28(5):170-175. 被引量：7
3王旗,马建芬.基于TBL的手写字体分段技术[J].电脑开发与应用,2011,24(6):53-55.
4郝东亮,杨鸿武,张策,张帅,郭立钊,杨静波.面向汉语统计参数语音合成的标注生成方法[J].计算机工程与应用,2016,52(19):146-153. 被引量：1
5杨云.基于句法结构的评价对象抽取方法在不同模板上的性能分析[J].长春教育学院学报,2017,33(4):38-41.
6张书洋.普通话语音机房设计及建议[J].资治文摘,2016,0(1):122-122.
7赵永贞,刘挺,王志伟,陈惠鹏,邵艳秋.汉语文语转换系统中停顿指数的自动标注[J].中文信息学报,2004,18(5):48-55. 被引量：6
8王洁,宋柔.字音转换策略介绍及性能代价评估[J].计算机工程与应用,2007,43(16):26-29.
9田卫东,李亚娟.基于CRF和错误驱动的中心词识别[J].计算机应用研究,2013,30(8):2345-2348. 被引量：3
10高璐,陈琪,李永宏,于洪志.藏语语音合成中文本分析的若干问题研究[J].西北民族大学学报（自然科学版）,2010,31(2):27-32. 被引量：5

计算机工程与应用

2011年第12期

浏览历史

内容加载中请稍等...

用决策树指导TBL进行多音字消歧被引量：1

参考文献8

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

用决策树指导TBL进行多音字消歧 被引量：1

参考文献8

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

用决策树指导TBL进行多音字消歧被引量：1