基于度量学习和层级推理网络的抽取式摘要方法

Metric learning based hierarchical inference network for extractive text summarization

下载PDF

导出

摘要当前大部分的抽取式摘要方法主要关注对摘要句的表示和抽取,容易忽略对文本特征表示的充分性。为了解决这一问题,提出一种基于度量学习和层级推理网络的抽取式摘要方法。首先,在抽取式任务基础上提出基于度量学习和层级推理的抽取式摘要模型(MLHIN);其次,在CNN/DailyMail数据集上进行模型评估,并在英文摘要数据集CNN/DailyMail上进行测试;最后,对测试结果进行验证。结果显示,所提方法模型在Rouge-1,Rouge-2,Rouge-L上的得分明显优于其他模型,比Lead-3模型分别高出0.84%,1.29%和2.43%;通过将提出的度量损失metric和层级推理模型中的句子编码器替换掉,可以看出模型性能均有不同程度的下降,证明了提出的层级推理网络和度量损失的有效性。新算法能够提高模型捕捉长距离依赖的能力,增强模型对摘要句与非摘要句的分辨力,有效改善了抽取式摘要方法的性能。 Most of the current extractive summarization methods mainly focus on the representation and extraction of summary sentences, and tend to ignore the adequacy of text feature representation.In order to solve this problem, an extractive summarization method was proposed.Firstly, on the basis of abstract tasks, an extractive summarization model(MLHIN) based on metric learning and hierarchical inference was proposed.Secondly, the model was evaluated and tested on the English CNN/DailyMail dataset.Finally, the test results of the model on the dataset are verified.The results show that the proposed model has significantly higher scores than other models on Rouge-1,Rouge-2 and Rouge-L,which are 0.84%,1.29% and 2.43% higher than the Lead-3 model respectively.After replacing the metric loss metric and the sentence encoder with other modules, it can be seen that the performance of the model has declined to varying degrees, which proves the effectiveness of the proposed hierarchical inference network and metric loss.The algorithm can improve the ability of model to capture long-distance dependency, enhance the ability of model to distinguish summary sentences from non-summary sentences, and effectively improve the performance of the extractive summarization methods.

作者成悦赵康勾智楠高凯 CHENG Yue;ZHAO Kang;GOU Zhinan;GAO Kai(School of Information Science and Engineering,Hebei University of Science and Technology,Shijiazhuang,Hebei 050018,China;School of Information Technology,Hebei University of Economics and Business,Shijiazhuang,Hebei 050061,China)

机构地区河北科技大学信息科学与工程学院河北经贸大学信息技术学院

出处《河北科技大学学报》 CAS 北大核心 2022年第6期594-601,共8页 Journal of Hebei University of Science and Technology

基金河北省自然科学基金(F2022208006) 河北省高等学校科学技术研究项目(QN2020198)。

关键词自然语言处理句子编码器文档编码器度量学习层级推理抽取式文本摘要 natural language processing sentence encoder document encoder metric learning hierarchical inference extractive text summarization

分类号 TN958.98 [电子电信—信号与信息处理]

引文网络
相关文献

1许文军,郑虹,郑肇谦.基于ALBERT预训练模型生成式文本摘要[J].长春工业大学学报,2022,43(6):719-725. 被引量：1
2张文华,吴媛,施雅梅,汪鹏,叶芮辰,徐可,谢文,徐敦明,伊雄海.超高效合相色谱法测定鱼肉中氟苯尼考对映体及其代谢产物残留量[J].食品科学,2022,43(20):321-327. 被引量：1
3朱广丽,许鑫,张顺香,吴厚月,黄菊.PosNet:基于位置的因果关系抽取网络[J].计算机科学,2022,49(12):305-311.
4耿圆,谭红臣,李敬华,王立春.基于视觉信息积累的行人重识别网络[J].图学学报,2022,43(6):1193-1200. 被引量：4
5刘春磊,陈天恩,王聪,姜舒文,陈栋.小样本目标检测研究综述[J].计算机科学与探索,2023,17(1):53-73. 被引量：13
6庞伊琼,许华,蒋磊,史蕴豪,彭翔.基于混合注意力原型网络的调制识别算法[J].西北工业大学学报,2022,40(6):1375-1384. 被引量：1
7张云佐,郭亚宁,蔡昭权,张嘉煜.顾及方向信息的时空联合监控视频摘要方法[J].光电子．激光,2022,33(9):992-1000.
8刘春子,赖冬丽,米宏图,曹晶.牙周基础治疗对牙周炎合并2型糖尿病患者相关指标的影响[J].中外医学研究,2022,20(26):48-50. 被引量：2
9章文显,桑劲鹏,高立,陆春宇,钱政平.基于多种检测方式的盘形件超声检测研究[J].轨道交通装备与技术,2022(6):24-27. 被引量：1
10谢珺,王雨竹,陈波,张泽华,刘琴.基于双指导注意力网络的属性情感分析模型[J].计算机研究与发展,2022,59(12):2831-2843. 被引量：3

河北科技大学学报

2022年第6期

浏览历史

内容加载中请稍等...

基于度量学习和层级推理网络的抽取式摘要方法

相关作者

相关机构

相关主题

浏览历史