一种基于关键词扩展的答案块提取模型

Answer Chunk Extraction Model Based on Key Words’ Extension

下载PDF

导出

摘要针对问答式机器阅读理解中非定长答案的提取问题,本文提出了一种基于关键词扩展的答案块提取模型.该模型首先确定答案所在区块的中心词,即将文本与问题进行联合处理后计算问题关于联合向量的注意力值并按列输入softmax函数,将此概率分布矩阵逐列相加后遍历全文,检索出答案所在区块的中心词.然后,以该词为中心进行答案块扩展,并在每次扩展后计算答案块与问题向量之间的相似程度,相似度开始减小时停止扩展以优化候选答案块的质量.相较于以往的答案块提取模型,该模型一方面不再依赖于词性标注,另一方面大大提高了答案块的生成效率,在简化模型的同时提高了机器阅读理解的准确性.实验结果表明,该模型在SQuAD测试数据集上的EM(Exact Match)和F1值均表现优异,分别获得了65. 7%和74. 3%的准确度. For answering a question who has non-fixed length in machine reading comprehension,this paper proposes an answer chunk extraction model based on keywords’ extension. Firstly,the model determines the central word of the chunk where the answer exists. It calculates the joint vector of the passage and question,and applies a column-wise softmax function to get probability distributions in each column,where each column is an individual joint vector-level attention when considering a single query word. After adding this probability distribution matrix column by column,it traverses the full passage and retrieves the central word. Then,the model extends the answer chunk around the center word. Afterwards,it calculates the similarity between the answer chunk and the question,and stops extension when similarity starts to decrease. The purpose of this step is to optimize the quality of candidate answer chunks. Compared with the previous models which extract answer chunk,our approach no longer depends on part-of-speech tagging,but greatly improves the generation efficiency of answer chunks. Our approach improves the accuracy of machine reading comprehension while simplifying the model. Experiments on the SQuAD dataset show that our model achieves an excellent performance in both the EM( exact match) and F1 values,and the accuracy reaches 65. 7% and 74. 3%,respectively.

作者霍欢薛瑶环周澄睿邹依婷金轩城黄君扬 HUO Huan;XUE Yao-huan;ZHOU Cheng-rui;ZOU Yi-ting;JIN Xuan-cheng;HUANG Jun-yang(School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China;Shanghai Key Laboratory of Data Science,Fudan University,Shanghai 201203 ,China)

机构地区上海理工大学光电信息与计算机工程学院复旦大学上海市数据科学重点实验室

出处《小型微型计算机系统》 CSCD 北大核心 2019年第4期749-754,共6页 Journal of Chinese Computer Systems

基金国家自然科学基金项目(61003031)资助上海重点科技攻关项目(14511107902)资助上海市工程中心建设项目(GCZX14014)资助上海市一流学科建设项目(XTKX2012)资助上海市数据科学重点实验室开放课题资助课题项目(201609060003)资助沪江基金研究基地专项项目(C14001)资助

关键词机器阅读理解非定长答案关键词扩展块提取 machine reading comprehension non-fixed length answer keyword extension chunk extraction

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1霍欢,张薇,刘亮,李洋.一种针对句法树的混合神经网络模型[J].中文信息学报,2017,31(6):58-66. 被引量：4

共引文献3

1霍欢,薛瑶环,黄君扬,金轩城,邹依婷.一种针对成分树的混合神经网络模型[J].中文信息学报,2019,33(3):8-16. 被引量：2
2霍欢,刘亮.一种在矩阵空间中识别文本蕴涵的动态交互网络[J].计算机应用研究,2019,36(10):2965-2970.
3霍欢,邹依婷,周澄睿,薛瑶环,黄君扬,金轩城.一种应用于填空型阅读理解的句式注意力网络[J].小型微型计算机系统,2019,40(3):482-487. 被引量：3

1张晶.论通信领域关键词扩展[J].科学与信息化,2018,0(30):188-188.
2范振.基于LSTM模型的分词及词性标注一体化设计[J].科学与信息化,2018,0(6):194-195.
3唐铿.串串红[J].小学时代,2019,0(4):30-30.
4张琴.一道例题的有效变式练习——min、max函数问题研究[J].数理化解题研究,2018,0(32):35-36.
5于波,杨红立,冷淼.基于用户兴趣模型的推荐算法[J].计算机系统应用,2018,27(9):182-187. 被引量：7
6凌燕群.浅谈语文课堂教学的简约美[J].小学生（多元智能大王）,2019(3):76-76.
7李伟康,洪宇,陈鑫,邹博伟,张民.基于密度优先策略的答案源搜索方法研究[J].山西大学学报（自然科学版）,2019,42(1):12-22.
8文静.MAX函数在工资薪金所得税计算中应用的探索研究[J].智能计算机与应用,2019,9(2):221-223.
9刘慧婷,程雷,郭孝雪,赵鹏.实时个性化微博推荐系统[J].计算机科学,2018,45(9):253-259. 被引量：1
10王逸凡,李国平.基于语义相似度及命名实体识别的主观题自动评分方法[J].电子测量技术,2019,42(2):84-87. 被引量：6

小型微型计算机系统

2019年第4期

浏览历史

内容加载中请稍等...

一种基于关键词扩展的答案块提取模型

参考文献1

共引文献3

相关作者

相关机构

相关主题

浏览历史