期刊文献+

傣语语音合成中的文本归一化方法

An Approach to Normalization of Dai Text for Speech Synthesis
下载PDF
导出
摘要 本文以开发傣语语音合成系统为目的,重点研究傣语文本中的数字归一化和特殊字符归一化问题。数字和特殊字符都属于傣语文本中的非标准词,文本归一化的主要目的是用标准词表示非标准词的发音。归一化处理过程包括:非标准词识别、歧义判断、消歧处理和非标准词转换为标准词4个步骤。本文采用基于规则和上下文关键词相结合的方法识别非标准词,利用正则表达式判断其歧义类型,根据转换规则对非标准词进行消歧并确定其正确的傣文读音。实验结果表明,本文提出的文本归一化方法的正确率达到了94.6%,可以完全满足傣语文语转换系统前端文本分析的需求,并具有良好的自然语言处理应用价值。 With the purpose of developing a Dai speech synthesis system, this paper focuses on the study of Dai numbers and special characters normalization. Both numbers and special characters are the non-standard words in Dai text. The main purpose of the text normalization is to represent the pronunciation of non-standard words with standard words. The normalization process includes non-standard words recognition, ambiguity judgment, disambiguation and non-standard transla-tion. Firstly, the non-standard words are recognized and the ambiguous types of these non-stan- dard words are determined using a method based on rule-based and context-keyword, in this paper. Then, the types of ambiguity are judged on regular expression. Lastly, the correct pronunciation of no-standard words is determined according to the transformation rules. Experimental results show that the correct rate of this normalization is more than 94.6%. This purposed method can fully satisfy the front-end text analysis in Dai text to speech conversion system, and has a good natural language processing application value.
出处 《计算机科学与应用》 2016年第7期415-422,共8页 Computer Science and Application
基金 国家自然科学基金(61262068).
  • 相关文献

参考文献2

二级参考文献24

  • 1爱德华·萨丕尔.《语言论》[M].商务印书馆,1985年版.第195页.
  • 2高立士:《西双版纳傣族的历史和文化》,云南民族出版社,1992年.
  • 3郭锡良:《汉字古音手册》,北京大学出版社,1986年.
  • 4王均等编著:《壮侗语族语言简志》,民族出版社,1984年.
  • 5西双版纳傣族自治州人民政府编:《傣汉字典》,云南民族出版社,2002年.
  • 6张公瑾:《傣族文化研究》,云南民族出版社,1988年.
  • 7--:《文化语言学发凡》,云南大学出版社,1998年.
  • 8周耀文、罗美珍:《傣语方言研究》,民族出版社,2001年.
  • 9GNU grep[EB/OL]. [2015- 04- 30]. ftp://reality.sgiweb.org/ freeware/relnotes/fw-5.3/fw_gnugrep/gnugrep.html.
  • 10Crochemore M, Czumaj A, Gasieniec L, et al. Speeding up two strings matching algorithms[J]. Algorithmica, 1994, 12 (4/5): 247-267.

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部