期刊文献+

基于扩张矩阵理论的汉语韵律短语分析 被引量:2

Chinese Prosodic Phrasing Based on Extension Matrix Theory
下载PDF
导出
摘要 提出了一种新的、基于扩张矩阵理论的归纳学习算法 :分组覆盖算法 ,并将其应用于汉语文语转换系统中的韵律短语分析问题 .算法以扩张矩阵为基础 ,在反例样本集背景下 ,将正例样本集划分为若干个一致的组 ,每一组对应于一个一致的规则 ,它覆盖了这组正例而且不覆盖任何的反例 ;建造了一个用于韵律短语分析的语料库 ,并提出了一组与韵律有关的特征 ;将数据分为训练集和测试集对算法进行了验证 .实验结果表明 ,新算法在正确率、规则个数和可懂性等各方面均优于传统的决策树方法 ,并接近于手工制定的规则 . This paper presents a new inductive learning algorithm based on the extension matrix theory, and uses it to solve the prosodic phrasing problem for Chinese Text-to-Speech systems. Authors propose a novel definition of the consistency of a rule and of a set of positive examples, and reveal their relationship using a theorem: By dividing the positive examples of a specific class in a given example set into consistent groups and adopting a simple strategy to find a conjunctive rule for each group which covers all the group's positive examples and none of the negative examples, the algorithm finds a set of consistent rules in the form of variable-valued logic. Authors collect 937 sentences of different genres (about 78 minutes length) from CCTV news program and built a large speech corpus. A group of features for modeling prosody are also proposed, and their effectiveness is measured by the interpretation of the resulting rules. Lastly, a serial of experiments are conducted. The data is divided into two parts: training set and test set, and the experimental results show that authors' method achieves higher accuracy, better interpretation and less rules than other algorithms. And the generated rules are quite similar to hand-crafted ones, which may help us better understand the relationship between Chinese syntax and prosody.
出处 《计算机学报》 EI CSCD 北大核心 2003年第1期26-31,共6页 Chinese Journal of Computers
基金 国家自然科学基金重点项目 ( 60 13 5 0 10 ) 国家"九七三"重点基础研究发展规划项目 (G19980 3 0 5 0 9)资助
关键词 扩张矩阵理论 汉语韵律短语分析 汉字信息处理 归纳学习算法 汉语文语转换系统 Knowledge based systems Learning algorithms Learning systems Linguistics Matrix algebra Speech processing Text processing
  • 相关文献

参考文献10

  • 1[1]Bachenko J, Fitzpatrick E A. Computational grammar of discourse-neutral prosodic phrasing in English. Computational Linguistics, 1990, 16(3):155~170
  • 2[2]Quene H, Kager R. The derivation of prosody for text tospeech from prosodic sentence structure. Computer Speech and Language, 1992, 6(1): 77~98
  • 3[3]Taylor P, Black A W. Assigning phrase breaks from part-ofspeech sequences. Computer Speech and Language, 1998, 12(2):99~117
  • 4[4]Muller A F, Zimmermann H G, Neuneier R. Robust generation of symbolic prosody by a neural classifier based on autoas sociators. In: Proceedings of ICASSP, Istanbul, Turkey,2000. 1285~ 1288
  • 5[5]Wang M, Hirschberg J. Automatic classification of intonational phrase boundaries. Computer Speech and Language, 1992, 6(2): 175~196
  • 6[6]Hirschberg J, Prieto P. Training intonational phrasing rules automatically for English and Spanish text to speech. Speech Communication, 1996, 18(3): 281~290
  • 7[7]Lee S, Oh Y H. Tree hased modeling of prosodic phrasing and segmental duration for Korean TTS systems. Speech Commu nication, 1999, 28(4): 283~300
  • 8[8]Fordyce C, Ostendorf M. Prosody prediction for speech synthesis using transformational rule-based learning. In: Proceedings of ICSLP, Sydney, Australia, 1998. 843~846
  • 9[9]Chen W J, Lin F Z, Li J M, Zhang B. Training prosodic phrasing rules for Chinese TTS systems. In: Proceedings of Eurospeech, Scandinavia, 2001. 1159~1162
  • 10[10].Hong J R. AE1: An extension matrix approximate method for the general covering problem. International Journal of Comput er and Information Science, 1985, 14(6): 421~437

同被引文献15

  • 1黄衍.试论英语主位和述位[J].外国语,1985,8(5):34-38. 被引量:366
  • 2徐盛桓.主位和述位[J].外语教学与研究,1982,14(1):1-9. 被引量:392
  • 3朱永生.主位推进模式与语篇分析[J].外语教学与研究,1995,27(3):6-12. 被引量:426
  • 4Hong J R.AE1 :Extension matrix approximate method for the general covering problem[J].International Journal of Computer & Information Science, 1985; 14(6) :421~437
  • 5Xindong Wu. Inductive learning:algorithms and frontiers[J].Artificial Intelligence Review,1993; (7) :93~108
  • 6Quinlan J R.C4.5:Programs for machine learning[M].San Mateo,USA: Morgan Kaufmann, 1993
  • 7Utgoff P E,Berkman N C,Clouse J A.Decision tree induction based on efficient tree restructuring[J].Machine Learning, 1997; 29 ( 1 ): 5~44
  • 8Guido Cervone,Liviu Panait,Ryszard S Michalski.The Development of the AQ20 Learning System and Initial Experiments[J].Intelligent Information Systems,2001:13~29
  • 9Wu XD.Rule induction with extension matrices[J].Journal of the American Society for Information Science, 1998; 49 (5) :435~454
  • 10Kargupta H,Park B H.A Fourier spectrum-based approach to represent decision trees for mining data streams in mobile environments[J].IEEE Transactions on Knowledge and Data Engineering,2004;16(2) :216~229

引证文献2

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部