期刊文献+

基于深度学习的生成式文本摘要技术综述 被引量:14

Survey on Abstractive Text Summarization Technologies Based on Deep Learning
下载PDF
导出
摘要 在互联网数据急剧扩张和深度学习技术高速发展的背景下,自动文本摘要任务作为自然语言处理领域的主要研究方向之一,其相关技术及应用被广泛研究。基于摘要任务深化研究需求,以研究过程中存在的关键问题为导向,介绍现有基于深度学习的生成式文本摘要模型,简述定义及来源、数据预处理及基本框架、常用数据集及评价标准等,指出发展优势和关键问题,并针对关键问题阐述对应的可行性解决方案。对比常用的深度预训练模型和创新方法融合模型,分析各模型的创新性和局限性,提出对部分局限性问题的解决思路。进一步地,对该技术领域的未来发展方向进行展望总结。 Boosted by the rapid expansion of Internet data and the development of deep learning technologies,automatic text summarization is now one of the main research directions in the field of natural language processing.Its related technologies and applications have been widely studied.To assist further studies required by summarization tasks,and to help solve the key problems in the earlier studies,this paper introduces the existing abstractive text summarization models based on deep learning by briefly describing their definition and source,data preprocessing and basic framework,common data sets,and evaluation standards.Additionally,the paper gives the development advantages and key problems of the models,and elaborates on the corresponding feasible solutions.Then the paper compares the commonly used deep pre-trained models and innovative methods,analyzes the innovations and limits of each model,and gives corresponding solutions.Finally,the paper discusses the future development directions in this field.
作者 朱永清 赵鹏 赵菲菲 慕晓冬 白坤 尤轩昂 ZHU Yongqing;ZHAO Peng;ZHAO Feifei;MU Xiaodong;BAI Kun;YOU Xuanang(College of Operational Support,Rocket Force University of Engineering,Xi’an 710025,China;Army Academy of Border and Coastal Defence,Xi’an 710025,China)
出处 《计算机工程》 CAS CSCD 北大核心 2021年第11期11-21,28,共12页 Computer Engineering
基金 国家部委基金。
关键词 深度学习 生成式文本摘要 未登录词 生成重复 长程依赖 评价标准 deep learning abstractive text summarization Out of Vocabulary(OOV) generative repetition long-term dependence evaluation criteria
  • 相关文献

参考文献2

二级参考文献37

  • 1刘德荣 ,王永成 ,刘传汉 .基于主题概念的多文档自动摘要研究[J].情报学报,2005,24(1):69-74. 被引量:7
  • 2耿焕同,蔡庆生,赵鹏,于琨.一种基于词共现图的文档自动摘要研究[J].情报学报,2005,24(6):651-656. 被引量:15
  • 3马慧芳,祁云平,杨小东.一种基于文本关系图的多文档自动摘要技术[J].情报杂志,2007,26(3):67-69. 被引量:7
  • 4Kupiec J,Pedersen J,Chen F.A Trainable Document Summarizer[C].ACM SIGIR.New York,USA,1995.
  • 5Conroy J M,O'leary D P.Text Summarization Via Hidden Markov Models[C].ACM SIGIR,New Orleans,Louisiana,USA,2001.
  • 6Luhn H P.The Automatic Creation of Literature Abstracts[J].IBM Journal of Research Development,1958,2(2):159.
  • 7Text Summarization[EB/OL].http://www.summarization.com/sigirtutorial2001.ppt.
  • 8Baxendale P.Machine-made Index for Technical Literature-an Experiment[J].IBM Journal of Research Development,1958,2(4):354.
  • 9Edmundson,H P.New Methods in Automatic Extracting[J].Journal of the ACM,1969,16(2):264.
  • 10C Aone M E.Okurowaki,J Gorlinsky,and B.Larsen.A Trainable Summarizer With Knowledge Acquired from Robust NLP Techniques//I.Mani and M.Maybury(eds.).Advances in Automated Text Summarization.adl:MIT Press,1999:71.

共引文献38

同被引文献73

引证文献14

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部