期刊文献+

基于汉英平行语料库的英文显式篇章关系识别 被引量:2

English explicit discourse relation recognition on Chinese-English parallel corpus
下载PDF
导出
摘要 汉英篇章平行语料库有助于基于篇章的双语研究.构建了汉英平行语料库,对语料中的汉语及其英语对译中的连接词分别进行了标注和关系分类.其中英文连接词比单语语料上的英文连接词定义广泛,更为复杂.在此语料上,抽取词法、句法和位置信息等特征在英文文本上进行显式篇章关系识别.实验采用最大熵分类方法,获得连接词识别正确率92.5%;抽取英文和对应中文连接词作为特征,获得给定连接词关系分类正确率85.6%.研究结果可为今后的中英篇章关系对比识别提供参考. Chinese-English discourse parallel corpus contributes to bilingual discourse research.The Chinese-English discourse Parallel Corpus,which annotates conjunctions and relation classification in Chinese corpus and English corpus was constructed in this paper.In this corpus English conjunction definition is wider than traditional conjunction’s,and is more complicate.On this corpus,the paper extracts lexical,syntactic features and location information to identify and classify the explicit discourse relation in the English text.Experiment adopts with maximum entropy classification method to obtain conjunction recognition accuracy of 92.5%;and extracts English and Chinese conjunction as features to obtain given conjunction classification accuracy of 85.6%.The research provides a reference for contrast recognition of Chinese-English discourse relation for future.
作者 冯洪玉 李艳翠 冯文贺 FENG Hongyu;LI Yancui;FENG Wenhe(School of Information Engineering,Henan Institute of Science and Technology,Xinxiang 453003,China;Guangdong University of Foreign Studies,Guangzhou 510420,China)
出处 《河南科技学院学报(自然科学版)》 2019年第5期55-62,共8页 Journal of Henan Institute of Science and Technology(Natural Science Edition)
基金 国家自然科学基金(61502149) 河南省科技计划项目(182102210048)
关键词 显式篇章关系 连接词识别 分类 explicit discourse relation conjunction recognition classification
  • 相关文献

参考文献6

二级参考文献40

  • 1邢福义.往前又是O起点──《汉语复句研究》自序[J].汉语学习,2000(4):12-13. 被引量:3
  • 2周强.汉语句法树库标注体系[J].中文信息学报,2004,18(4):1-8. 被引量:90
  • 3邢福义.汉语语法学[M].长春:东北师范大学出版社,2000.75-78.
  • 4邹嘉彦,连兴隆,高维君,等.中文篇章中的关联词语及其引导的句子关系的自动标注--面向话语分析的中文篇章语料库的开发[C]//1998中文信息处理国际会议论文集,1998:288-297.
  • 5李晋霞,刘云.面向计算机的二重复句层次划分研究[C]∥语言计算与基于内容的文本处理,2003:147-153.
  • 6柏晓静 常宝宝 詹卫东.构建大规模的汉英双语平行语料库[C]..见:2002全国机器翻译研讨会文集:机器翻译研究进展[C].,2002.147-154.
  • 7邢福义.汉语复句研究[M].北京:商务印书馆.2002.
  • 8Z Lin, H T Ng, M Y Kan. Automatically Evaluating Text Coherence Using Discourse Relations[C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011, Volume 1: 997-1006.
  • 9D Marcu, A Echihabi. An Unsupervised Approach to Recognizing Discourse Relations [C]//Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, 2002: 368-375.
  • 10M Saito, K Yamamoto, S Sekine. Using Phrasal Patterns to Identify Discourse Relations[C]//Proceedings of the Human Language Technology Conference of the NAACL, 2006, Companion Volume:133-136.

共引文献47

同被引文献15

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部