期刊文献+

基于小句复合体的中文机器阅读理解研究

Machine Reading Comprehension Based on Clause Complex
下载PDF
导出
摘要 机器阅读理解任务要求机器根据篇章文本回答相关问题。该文以抽取式机器阅读理解为例,重点考察当问题的线索要素与答案在篇章文本中跨越多个标点句时的阅读理解问题。该文将小句复合体结构自动分析任务与机器阅读理解任务融合,利用小句复合体中跨标点句话头-话体共享关系,来降低机器阅读理解任务的难度;并设计与实现了基于小句复合体的机器阅读理解模型。实验结果表明,在问题线索要素与答案跨越多个标点句时,答案抽取的精确匹配率(EM)相对于基准模型提升了3.49%,模型整体的精确匹配率提升了3.26%。 The machine reading comprehension task requires the machine to answer relevant questions according to the context.Focused on extractive machine reading comprehension,this paper proposes a clause complex based machine reading comprehension method.The naming structure relationship of clause complex is introduced to alleviate the difficult cases with clue elements or answer elements spreading across multiple punctuation sentences.The experimental results show that the proposed method improves the EM of the whole model by 3.26%,and that for the difficult cases by 3.49%.
作者 王瑞琦 罗智勇 刘祥 韩瑞昉 李舒馨 WANG Ruiqi;LUO Zhiyong;LIU Xiang;HAN Ruifang;LI Shuxin(School of Information Science,Beijing Language and Culture University,Beijing 100083,China)
出处 《中文信息学报》 CSCD 北大核心 2024年第3期130-140,共11页 Journal of Chinese Information Processing
基金 国家自然科学基金(62076037)。
关键词 机器阅读理解 跨标点句问答 小句复合体 machine reading comprehension cross-punctuation questions and answers clause complex
  • 相关文献

参考文献3

二级参考文献11

  • 1黄健传,宋柔.标点句标注研究[C]//第九届全国计算语言学学术会议论文集.北京:清华大学出版社,2007:350-355.
  • 2SONG R,JIANG Y,WANG J.On generalized-topic-based Chinese discourse structure[C]//S1GHAN 2010:Proceedings of CIPS-SIGHAN Joint Conference on Chinese Language Processing.Beijing:Tsinghua University Press,2010:23-33.
  • 3宋柔.汉语篇章广义话题结构研究[R].北京:北京语言大学,2012.
  • 4GILLELAND M.Levenshtein distance,in three flavors[EB/OL].[2013-02-04].http://people.cs.pitt.edu/-kirk/csl501/Pruhs/Spring2006/assiguments/editdistance/Levenshtein%20Distance.htm.
  • 5胡乔木.中国大百科全书:图文数据光盘[M/CD].北京:中国大百科全书出版社,1999.
  • 6KOHAVI R.A study of cross-validation and bootstrap for accuracy estimation and model selection[C]//IJCAI'95:Proceedings of the 14th International Joint Conference on Artificial Intelligence.San Francisco:Morgan Kaufmann,1995,2:1137-1143.
  • 7JIANG Y,SONG R.Topic structure identification of PClause sequence based on generalized topic theory[C]//Proceedings of the 2012 1st CCF Conference on Natural Language Processing and Chinese Computing.Berlin:Springer-Verlag,2012:85-96.
  • 8宋柔.现代汉语跨标点句句法关系的性质研究[J].世界汉语教学,2008,22(2):26-44. 被引量:27
  • 9黄娴,张克亮.汉语零形回指研究综述[J].中文信息学报,2009,23(4):10-15. 被引量:9
  • 10蒋玉茹,宋柔.基于广义话题理论的话题句识别[J].中文信息学报,2012,26(5):114-119. 被引量:13

共引文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部