期刊文献+

TP-AS:一种面向长文本的两阶段自动摘要方法 被引量:12

TP-AS:A Two-phase Approach to Long Text Automatic Summarization
下载PDF
导出
摘要 随着互联网上信息的爆炸式增长,如何有效提高知识获取效率变得尤为重要。文本自动摘要技术通过对信息的压缩和精炼,为知识的快速获取提供了很好的辅助手段。现有的文本自动摘要方法在处理长文本的过程中,存在准确率低的问题,无法达到令用户满意的性能效果。为此,该文提出一种新的两阶段的长文本自动摘要方法TP-AS,首先利用基于图模型的混合文本相似度计算方法进行关键句抽取,然后结合指针机制和注意力机制构建一种基于循环神经网络的编码器—解码器模型进行摘要生成。通过基于真实大规模金融领域长文本数据上的实验,验证了TP-AS方法的有效性,其自动摘要的准确性在ROUGE-1的指标下分别达到了36.6%(词)和33.9%(字符),明显优于现有其他方法。 With the explosive growth of information on the Internet,it becomes more important to improve the effi- ciency of knowledge acquisition. Automatic text summarization techniques provide a good means for fast knowledge acquisition by compressing and refining information. Existing automatic text summarization methods,when dealing with long text,exhibit poor accuracy,and fail to meet users' need for performance. In this paper,we propose a two- phase automatic summarization method for long text, namely,TP-AS. Firstly,it employs a hybrid semantic similarity computation method based on a graph model to extract key sentences. Then,it constructs a recurrent neural network encoder-decoder model with attention and pointer mechanisms to generate summaries. Through experiments on real large-scale long-text corpora in financial area, the effectiveness of TP-AS is verified, and its accuracy for automatic summarization notably outperforms other existing methods.
作者 王帅 赵翔 李博 葛斌 汤大权 WANG Shuai;ZHAO Xiang;LI Bo;GE Bin;TANG Daquan(Science and Technology on Information System Engineering Laboratory,National University of Defense Technology,Changsha,Hunan 410073,China;Collaborative Innovation Center of Geospatial Technology,Wuhan,Hubei 430079,China)
出处 《中文信息学报》 CSCD 北大核心 2018年第6期71-79,共9页 Journal of Chinese Information Processing
基金 国家自然科学基金(61402494 61402498 71690233) 湖南省自然科学基金(2015JJ4009)
关键词 自动文本摘要 自然语言处理 抽取和生成 循环神经网络 automatic text summarization natural language processing extraction and abstraction RNN
  • 相关文献

参考文献3

二级参考文献31

  • 1莫燕,王永成.中文文献摘要的自动编制[J].现代图书情报技术,1993(3):10-12. 被引量:15
  • 2C Brew,H S Thompson.Automatic evaluation of computer generated text:A progress report on the TextEval project.Human Language Technology Workshop,Arpa,Isto,1994
  • 3Shiwen Yu.Automatic evaluation of quality for machine translation systems.Machine Translation,1993,8(1/2):117~126
  • 4A Guessoum,R Zantout.Semi-automatic evaluation of the grammatical coverage of machine translation systems.MT Summit Conf,Santiago de Compostela,2001
  • 5Sungryong Koh,Jinee Maeng,et al.A test suite for evaluation of English-to-Korean machine translation systems.MT Summit Conf,Santiago de Compostela,2001
  • 6Shoichi Yokoyama,Hideki Kashioka,et al.An automatic evaluation method for machine translation using two-way MT.MT Summit Conf,Santiago de Compostela,2001
  • 7Keiji Yasuda,Fumiaki Sugaya,et al.An automatic evaluation method of translation quality using translation answer candidates queried from a parallel corpus.MT Summit Conf,Santiago de Compostela,2001
  • 8Yasuhiro Akiba,Kenji Imamura,Eiichiro Sumita.Using multiple edit distances to automatically rank machine translation output.MT Summit Conf,Santiago de Compostela,2001
  • 9Jianmin Yao,Ming Zhou,et al.An automatic evaluation method for localization oriented lexicalised EBMT system.The 19th Int'l Conf on Computational Linguistics (COLING-2002),Taipei,2002
  • 10V I Levenshtein.Binary codes capable of correcting deletions,insertions and reversals.Doklady Akademii Nauk SSSR,1965,163(4):845~848

共引文献41

同被引文献117

引证文献12

二级引证文献132

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部