期刊文献+

DAPC:结合双注意力和指针覆盖的文本摘要模型 被引量:7

DAPC:Dual Attention and Pointer-Coverage Network Based Summarization Model
下载PDF
导出
摘要 基于注意力机制的序列到序列模型在生成式摘要方法中得到广泛应用,并取得较好的表现。但现有模型方法生成的摘要普遍存在语义无关、句内重复和未登录词等问题。为了解决这些问题,在典型基于注意力机制的序列到序列模型基础上,提出了结合双注意力和指针覆盖机制的生成式文本摘要方法DAPC(Dual Attention and Pointer-Coverage based model)模型。组合局部注意力和卷积神经网络,提取输入文本的更高层次的语言特征;引入指针-生成网络来解决未登录词问题;使用覆盖机制解决模型生成摘要句内重复的问题。实验结果表明,模型在CNN/Daily Mail数据集中有较好的表现。 Attention-based encoder-decoder abstractive summarization methods have been widely used for auto text summarization.However,these methods suffer from three shortcomings:usually producing semantic irrelevance,repeated phrase sentences and Out-Of-Vocabulary(OOV)words.In this work,it proposes an abstractive summarization method,DAPC model that combines dual attention,pointer-generator network and coverage mechanism based on typical attentionbased encoder-decoder framework to solve these problems.First,it uses local attention and convolutional neural network to obtain the deep hidden n-gram language features.Then,based on the typical encoder-decoder with attention model,it adds a pointer-generator network that through the pointing mechanism to copy words from source text,and solves the OOV problem.Last,coverage mechanism is used to solve the problem of repetition.Experiments on non-anonymized CNN/Daily Mail prove that the model has high semantic relevance to the source text and owns the capacity to reduce repetition.
作者 张敏 曾碧卿 韩旭丽 徐如阳 ZHANG Min;ZENG Biqing;HAN Xuli;XU Ruyang(School of Computer,South China Normal University,Guangzhou 510631,China;School of Software,South China Normal University,Foshan,Guangdong 528225,China)
出处 《计算机工程与应用》 CSCD 北大核心 2020年第8期149-157,共9页 Computer Engineering and Applications
基金 国家自然科学基金(No.61772211,No.61503143)。
关键词 生成式文本摘要 局部注意力 序列到序列框架 覆盖机制 abstractive summarization local attention sequence-to-sequence framework coverage mechanismlocal attention coverage mechanism
  • 相关文献

参考文献2

二级参考文献41

  • 1刘德荣 ,王永成 ,刘传汉 .基于主题概念的多文档自动摘要研究[J].情报学报,2005,24(1):69-74. 被引量:7
  • 2耿焕同,蔡庆生,赵鹏,于琨.一种基于词共现图的文档自动摘要研究[J].情报学报,2005,24(6):651-656. 被引量:15
  • 3马慧芳,祁云平,杨小东.一种基于文本关系图的多文档自动摘要技术[J].情报杂志,2007,26(3):67-69. 被引量:7
  • 4Kupiec J,Pedersen J,Chen F.A Trainable Document Summarizer[C].ACM SIGIR.New York,USA,1995.
  • 5Conroy J M,O'leary D P.Text Summarization Via Hidden Markov Models[C].ACM SIGIR,New Orleans,Louisiana,USA,2001.
  • 6Luhn H P.The Automatic Creation of Literature Abstracts[J].IBM Journal of Research Development,1958,2(2):159.
  • 7Text Summarization[EB/OL].http://www.summarization.com/sigirtutorial2001.ppt.
  • 8Baxendale P.Machine-made Index for Technical Literature-an Experiment[J].IBM Journal of Research Development,1958,2(4):354.
  • 9Edmundson,H P.New Methods in Automatic Extracting[J].Journal of the ACM,1969,16(2):264.
  • 10C Aone M E.Okurowaki,J Gorlinsky,and B.Larsen.A Trainable Summarizer With Knowledge Acquired from Robust NLP Techniques//I.Mani and M.Maybury(eds.).Advances in Automated Text Summarization.adl:MIT Press,1999:71.

共引文献69

同被引文献33

引证文献7

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部