期刊文献+

自动文本摘要研究综述 被引量:44

Survey on Automatic Text Summarization
下载PDF
导出
摘要 近年来,互联网技术的蓬勃发展极大地便利了人类的日常生活,不可避免的是互联网中的信息呈井喷式爆发,如何从中快速有效地获取所需信息显得极为重要.自动文本摘要技术的出现可以有效缓解该问题,其作为自然语言处理和人工智能领域的重要研究内容之一,利用计算机自动地从长文本或文本集合中提炼出一段能准确反映源文中心内容的简洁连贯的短文.探讨自动文本摘要任务的内涵,回顾和分析了自动文本摘要技术的发展,针对目前主要的2种摘要产生形式(抽取式和生成式)的具体工作进行了详细介绍,包括特征评分、分类算法、线性规划、次模函数、图排序、序列标注、启发式算法、深度学习等算法.并对自动文本摘要常用的数据集以及评价指标进行了分析,最后对其面临的挑战和未来的研究趋势、应用等进行了预测. In recent years,the rapid development of Internet technology has greatly facilitated the daily life of human,and it is inevitable that massive information erupts in a blowout.How to quickly and effectively obtain the required information on the Internet is an urgent problem.The automatic text summarization technology can effectively alleviate this problem.As one of the most important fields in natural language processing and artificial intelligence,it can automatically produce a concise and coherent summary from a long text or text set through computer,in which the summary should accurately reflect the central themes of source text.In this paper,we expound the connotation of automatic summarization,review the development of automatic text summarization technique and introduce two main techniques in detail:extractive and abstractive summarization,including feature scoring,classification method,linear programming,submodular function,graph ranking,sequence labeling,heuristic algorithm,deep learning,etc.We also analyze the datasets and evaluation metrics that are commonly used in automatic summarization.Finally,the challenges ahead and the future trends of research and application have been predicted.
作者 李金鹏 张闯 陈小军 胡玥 廖鹏程 Li Jinpeng;Zhang Chuang;Chen Xiaojun;Hu Yue;Liao Pengcheng(Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100093;School of Cyber Security,University of Chinese Academy of Sciences,Beijing 100040)
出处 《计算机研究与发展》 EI CSCD 北大核心 2021年第1期1-21,共21页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61602474)。
关键词 自动文本摘要 抽取式方法 生成式方法 深度学习 ROUGE指标 automatic text summarization extractive abstractive deep learning ROUGE metric
  • 相关文献

参考文献3

二级参考文献131

  • 1秦兵,刘挺,李生.基于局部主题判定与抽取的多文档文摘技术[J].自动化学报,2004,30(6):905-910. 被引量:10
  • 2R. Radev, Hongyan Jing, Malgorzata Budzikowska, Centroid-based summarization of multiple documents: Sentence extraction,utility-based evaluation, and user studies. ANLP/NAACL Workshop on Summarization, Seattle, WA, 2000
  • 3J, G, Carbonell, J. Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries.ACM-SIGIR'98, Melbourne, Australia, 1998
  • 4Dragomir R. Radev, Kathleen R. McKeovwn. Generating natural languages summaries from multiple on-line sources. Computational Linguistics, 1998, 24(3) : 21-29
  • 5Paseale Fung, Grace Ngai. Combining optimal clustering and hidden Markov model for extractive summarization. ACL 2003 Workshop on Multilingual Summarization and Question Answering, Sapporo, Japan, 2003
  • 6Naomi Daniel, Dragomir Radev, Timothy Allison. Sub-event based multi-document summarization, HLT NAACL Workshop on Text Summarization, Edmonton, Alberta. Canada, 2003
  • 7Luhn H P. The automatic creation of literature abstracts[J]. IBM Journal of Research and Development, 1958, 2(2): 159-165.
  • 8Mani I, Maybury M T. Advances in automatic text summarization[M]. Cambridge: MIT Press, 1999.
  • 9Mani I, Bloedorn E. Machine learning of generic and user-focused summarization[C]//Proceedings of the Fifteenth National Conference on Artificial Intelligence.Reston VA:AAAI Press, 1998: 821-826.
  • 10Mitchell T M. Machine learning[M]. Burr Ridge: McGraw Hill, 1997:45.

共引文献37

同被引文献236

引证文献44

二级引证文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部