结合预训练的多文档摘要:研究

Study on Pre-training Tasks for Multi-document Summarization

下载PDF

导出

摘要新闻文本摘要任务旨在从庞大复杂的新闻文本中快速准确地提炼出简明扼要的摘要。基于预训练语言模型对多文档摘要进行研究,重点研究结合预训练任务的具体模型训练方式对模型效果提升的作用,强化多文档之间的信息交流,以生成更全面、更简练的摘要。对于结合预训练任务,提出对基线模型、预训练任务内容、预训练任务数量、预训练任务顺序的对比实验,探索标记了行之有效的预训练任务,总结归纳了强化多文档之间的信息交流的具体方法,精炼提出了简明高效的预训练流程。在公开新闻多文档数据集上进行训练和测试,实验结果表明预训练任务的内容、数量、顺序对ROUGE值都有一定提升,并且整合三者结论提出的特定预训练组合对ROUGE值有明显提升。 News summarization aims to quickly and accurately extract a concise summary from the complex news text.This paper studies the multi-document summary based on the pre-training language model,focusing on the effect of model training methods combined with pre-training tasks on improving model performance,and strengthening information exchange between multiple documents to generate more comprehensive and brief summaries.For combined pre-training tasks,this paper conducts comparative experiments on the baseline model,pre-training task content,pre-training task quantity,and pre-training task order,explores and marks effective pre-training tasks,summarizes the specific methods to strengthen the information exchange between documents,and refines and proposes a concise and efficient pre-training process.Through training and testing on the public news multi-document dataset,experimental results show that the content,quantity,and order of the pre-training tasks have a certain improvement on the ROUGE value,and the specific pre-training combination proposed by integrating the conclusions of the three has a significant increase in the ROUGE value.

作者丁一王中卿 DING Yi;WANG Zhongqing(School of Computer Science and Technology,Soochow University,Suzhou,Jiangsu 215006,China)

机构地区苏州大学计算机科学与技术学院

出处《计算机科学》 CSCD 北大核心 2024年第S01期174-181,共8页 Computer Science

关键词新闻摘要: 预训练多文档信息交流 News Summarization Pre-training Multi-document Information exchange

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1谭亮亮,廖文强.黔东南非遗反排木鼓舞步伐训练组合短句分析[J].艺术时尚,2023(17):58-60.
2罗俊如,丁言瑞,徐明华,胡超,刘炳官,孔维军,马强维,石林.基于深度AUC最大化算法的井漏风险预测[J].常州大学学报（自然科学版）,2024,36(3):34-44.
3李小涛,王子牛,张剑锋,王惠娟,高原.高强度间歇训练在航天失重防护中的应用[J].航天医学与医学工程,2024,35(1):66-72.

计算机科学

2024年第S01期

浏览历史

内容加载中请稍等...

结合预训练的多文档摘要:研究

相关作者

相关机构

相关主题

浏览历史