
中文语音合成中的文本正则化研究 被引量:12

Text Normalization in Chinese Text-to-Speech System
摘要 中文文本正则化是把非汉字字符串转化为汉字串以确定其读音的过程。该工作的难点:一是正则化对象——非汉字串形式复杂多样,难于归纳;二是非汉字串有歧义,需要消歧处理。文章引入非标准词的概念对非汉字串进行有效归类,提出非标准词的识别、消歧及标准词生成的三层正则化模型。在非标准词的消歧中引入机器学习的方法,避免了复杂规则的书写。实验表明,此方法取得了很好的效果,并具有良好的推广性,开放测试的正确率达到98.64%。 Chinese text normalization is the process of transforming non-Chinese character strings into their corresponding Chinese character strings to determine their pronunciations. The difficulties of this work mainly lie in two aspects: too many non-Chinese character strings of various formats and their high degree of ambiguities. This paper develops an effective taxonomy of non-Chinese character strings with the concept of Non-Standard Words (NSWs). And then a three-layer normalization model is proposed, including NSWs detection, NSWs disambiguation and standard words generation. In the NSWs disambiguation stage, a machine learning method is employed to overcome shortcomings of rule-based method. Experiment results show that this approach achieves a high performance and adapts well to new domains. The accuracy of open test is 98.64%.
出处 《中文信息学报》 CSCD 北大核心 2008年第5期45-50,55,共7页 Journal of Chinese Information Processing
基金 国家973课题资助项目(2004CB318102)
关键词 计算机应用 中文信息处理 文本正则化 语音合成 最大熵模型 computer application Chinese information processing text normalization text-to-speech maximum entropy model
  • 相关文献


  • 1Richard Sproat, Alan Black, Stanley Chen, et al.Normalization of Non-Standard Words [J]. Computer Speech and Language, 2001, 15(3):287-333.
  • 2Jan van Santen, Richard Sproat, Joseph Olive, et al. Progress in Speech Synthesis [ M]. New York: Springer, 1996.
  • 3Andrew Breen, Barry Eggleton, Peter Dion, et al. Refocusing on the Text Normalization Process in Text-to- Speech Systems [C]//Proc. ICSLP 2002. 2002: 153- 156.
  • 4K. Panchapagesan, Partha Pratim Talukdar, N. Sridhar Krishna, et al. Hindi Text Normalization [C]//Proc. KBCS2004. 2004: 19-22.
  • 5M. H. Moattar, M. M. Homayounpour, D. Zabihzadeh. Persian Text Normalization Using Classification Tree and Support Vector Machine [C]//Proc. ICTTA 2006. 2006: 1308-1311.
  • 6Virongrong Tesprasit, Paisarn Charoenpornsawat, Virach Sortlertlamvanich. A Context-Sensitive Homograph Disambiguation in Thai Text-to-Speech Synthesis [C]//Proc. HLT-NAACL 2003. 2003: 103-105.
  • 7Chilin Shih, Richard Sproat. Issues in Text-to-Speech Conversion for Mandarin [J], Computational Linguis- tics and Chinese Language Processing, 1996, 1 (1): 37-86.
  • 8Min Chu, Peng Hu, Yong Zhao, et al. Microsoft Mulan--a bilingual TTS system [C]//Proc. ICASSP 2003. 2003, 264-267.
  • 9蔡莲红,魏华武,周俏峰.汉语文-语转换中的语言学处理[J].中文信息学报,1995,9(1):31-36. 被引量:4
  • 10陈志刚,胡国平,王熙法.中文语音合成系统中的文本标准化方法[J].中文信息学报,2003,17(4):45-51. 被引量:8


  • 1朱学锋,俞士汶,王惠.现代汉语五万词语归类的实践[J].语言文字应用,1997(4):89-95. 被引量:8
  • 2段慧明,松井久仁於,徐国伟,胡国昕,俞士汶.大规模汉语标注语料库的制作与使用[J].语言文字应用,2000(2):72-77. 被引量:20
  • 3Richard Sproat. Multilingual text analysis for text- to-speech synthesis [C], ICSLP'96.
  • 4Richard Sproat, Alan Black, Stanley Chen, Shankar Kumar, Mari Ostendorf, Charistopher Richards. Normalization of Non- Standard Words [C]: WS '99 Final Report (1999).
  • 5Wu Xiaoru. Special Text Processing Based External Descriptor Rule [ C], ICSLP'2000.
  • 6Andrew Breen,Barry Eggleton.Refocussing on the text normalization process in Text-to-speech Systems[C]. ICSLP'2002.
  • 7Mehryar Mohri,Richard Sproat.A Efficient Compiler for Weighted Rewrite Rules [C] .Meeting of the Association for Computational Linguistics, 1996.
  • 8魏华武,计算机世界月刊,1992年,9卷
  • 9刘开瑛,自然语言处理,1991年
  • 10张志公,现代汉语,1985年












使用帮助 返回顶部