期刊文献+

基于TBL算法的汉语韵律词预测 被引量:6

Predicting chinese prosodic word based on transformation-based error-driven learning
下载PDF
导出
摘要 提出了一种新的汉语韵律词预测方法.利用标注过的语料,分析了语法词与韵律词之间的关系,发现24%的韵律词由不同语法词组合而成,语法词的词长是确定韵律词边界的主要特征.基于以上分析,实现了一种基于错误驱动的规则学习算法(TBL)的韵律词预测方法.实验结果表明,所提出的方法在测试集上能够达到97.5%的预测精度. A novel approach for predicting chinese prosodic word is introduced. By analyzing a manual tagged corpus, the relationship between lexical word and prosodic word are found. The analysis results show that 24% prosodic words consist of two or more lexical words, and the length of lexical word is a most important feature for predicting prosodic words. A transformation-based error-driven learning algorithm is proposed to predicting prosodic word with lexical features. Experiments demonstrat that the proposed approach outperform other methods with over 97.5% predicting precision.
出处 《西北师范大学学报(自然科学版)》 CAS 2008年第1期47-51,共5页 Journal of Northwest Normal University(Natural Science)
基金 西北师范大学科研骨干培育项目(NWNU-KJCXGC-03-42)
关键词 韵律词 语法词 TBL算法 文语转换 prosodic word lexical word transformation-based error-driven learning text to speech
  • 相关文献

参考文献10

  • 1曹剑芬.普通话节奏的声学语音学特性[A].吕士楠等主编.现代语音学论文集[C].北京:金城出版社,1999年.155—159.
  • 2GEE J P,GROSJEAN F.Performance structures:A psycholinguistic and linguistic appraisal[J].Cognitive Psychology,1983,15:411-458.
  • 3曹剑芬.基于语法信息的汉语韵律结构预测[J].中文信息学报,2003,17(3):41-46. 被引量:41
  • 4应宏,蔡莲红.基于结构助词驱动的韵律短语界定的研究[J].中文信息学报,1999,13(6):41-46. 被引量:18
  • 5WANG M,HIRSCHBERG J.Predicting intonational boundaries automatically from text:the ATIS domain[C]//Proceedings of the Workshop on Speech and Natural Language.California,1991:378-383.
  • 6赵晟,陶建华,蔡莲红.基于规则学习的韵律结构预测[J].中文信息学报,2002,16(5):30-37. 被引量:25
  • 7李剑锋,胡国平,王仁华.基于最大熵模型的韵律短语边界预测[J].中文信息学报,2004,18(5):56-63. 被引量:20
  • 8ZHANG Xiao-nan,XU Jun,CAI Lian-hong.Prosodic structure prediction based on maximum entropy model with Error-driven modification[C]//Proceedings of Internation Symposium of Chinese Spoken Language Processing.Singapore.2006:149-160.
  • 9BRILL ERIC.Transformation-based error-driven learning and natural language processing:A case study in part-of-speech tagging[J].Computational Linguistics,1995,21(4):543-565.
  • 10C J van RIJSBERGEN.Information Retrieval[M].London:Butterworths,1979.

二级参考文献25

  • 1王洪君.汉语的韵律词与韵律短语[J].中国语文,2000(6):525-536. 被引量:101
  • 2蔡莲红,魏华武,周俏峰.汉语文-语转换中的语言学处理[J].中文信息学报,1995,9(1):31-36. 被引量:4
  • 3叶军.停顿的声学征兆.第三界全国语音学研讨会论文集[M].北京:-,1996.21-22.
  • 4Niu Zhengyu, Chai Peiqi. Segmentation of Prosodic Phrase for Improving the Naturalness of Synthesized Chinese Speech. In The Proceedings of ICSLP'2000, III. 350-353.
  • 5Jianfen Cao & Wdbin Zhu. Syntactic and Lexical Constraint in Prosodic Segmentation and Grouping. In The Proceedings. of Speech Prosody2002.
  • 6Zheng, B., Wang, B., Yang, Y., Lu, S. & Cao, J.. The regular accent in Chinese sentences. In The Proceedings of ICSLP'2000, I, 86-89.
  • 7Tseng Chiuyu,Second Int Workshop on East Asian Language Resources and Evaluation,1999年,5卷,65页
  • 8俞士汶,现代汉语语法信息词典详解,1998年
  • 9王厚峰,Communications COLIPS,1997年,17卷,2期
  • 10叶军,第三界全国语音学研讨会论文集,1996年,21页

共引文献70

同被引文献51

引证文献6

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部