利用层级交互注意力的文本摘要方法

Exploiting Multi-layer Interactive Attention for Abstractive Text Summarization

下载PDF

导出

摘要基于注意力机制的编解码模型在文本摘要、机器翻译等序列到序列任务上得到了广泛的应用。在深度学习框架中,深层神经网络能够提取输入数据不同的特征表示,因此传统编解码模型中通常堆叠多层解码器来提高模型性能。然而现有的模型在解码时仅利用编码器最后一层信息,而忽略编码器其余层的特征。鉴于此,提出一种基于多层循环神经网络和层级交互注意力机制的摘要生成模型,通过层级交互注意力提取编码器不同层次的特征信息来指导摘要的生成。为了处理因引入不同层次特征而带来的信息冗余问题,引入变分信息瓶颈压缩数据噪声。最后在Gigaword和DUC2004摘要数据集上进行实验,结果表明所提方法能够获得最佳性能。 Attention-based encoding and decoding models have been widely used in text abstracts, machine translation and other sequence-to-sequence tasks. In deep learning framework, multi-layer neural network can obtain different feature representations of input data. Therefore, in conventional encoding and decoding model, the performance of the model is usually improved by stacking multi-layer decoders. However, the existing models only pay attention to the output of the last layer of the encoder when decoding, and ignore the information of other layers. In view of this,this paper proposes a novel abstractive text summarization model based on recurrent neural network and multi-layer interactive attention mechanism. The multi-layer interactive attention mechanism is introduced to extract contextual information from different levels of the encoder to guide the generation of abstracts. In order to deal with the problem of information redundancy caused by introducing different levels of context, the variational information bottleneck is adopted to compress data noise. Finally, this paper conducts experiments on Gigaword and DUC2004 datasets, and the results show that the proposed method achieves state of the art performance.

作者黄于欣余正涛相艳高盛祥郭军军 HUANG Yuxin;YU Zhengtao;XIANG Yan;GAO Shengxiang;GUO Junjun(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China;Yunnan Key Laboratory of Artificial Intelligence,Kunming University of Science and Technology,Kunming 650500,China)

机构地区昆明理工大学信息工程与自动化学院昆明理工大学云南省人工智能重点实验室

出处《计算机科学与探索》 CSCD 北大核心 2020年第10期1681-1692,共12页 Journal of Frontiers of Computer Science and Technology

基金国家重点研发计划Nos.2018YFC0830105,2018YFC0830101,2018YFC0830100 国家自然科学基金Nos.61972186,61762056,61472168。

关键词文本摘要编解码模型层级交互注意力机制变分信息瓶颈 text summarization encoding and decoding model multi-layer interactive attention variational information bottleneck

分类号 TP399 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1李风兰.小学语文如何构建智慧课堂教学[J].天津教育,2020(25):155-156. 被引量：2
2杨康,陈丽.自动驾驶中基于卷积神经网络的行人检测研究[J].电脑知识与技术,2020,16(25):22-24.
3唐皓,叶宝妮.在小学科学单元式教学中实施基于项目的STEM学习框架初探[J].中国教育技术装备,2020(5):97-100. 被引量：4
4读者·作者·编者[J].中国医学影像学杂志,2020,28(9):656-656.
5熊光亚,李永红,王亦斌,孙涛.一种适用于水位快速变化场合的低功耗浮子水位计[J].工业仪表与自动化装置,2020(5):21-23. 被引量：1
6曾宪锋.2019欧冠决赛转播中的环绕声&全景声同播方案[J].现代电视技术,2020(9):112-116. 被引量：1
7动态[J].今日民航,2020(3):12-13.
8王嘉宁,何怡,朱仁煜,刘婷婷,高明.基于远程监督的关系抽取技术[J].华东师范大学学报（自然科学版）,2020(5):113-130. 被引量：6
9黄亮.冗余电磁阀的气路联接改造设计[J].仪器仪表用户,2020,27(10):79-81.
10申晨,林鸿飞.基于图嵌入的社交媒体药物不良反应事件检测方法[J].大连理工大学学报,2020,60(5):547-554. 被引量：2

计算机科学与探索

2020年第10期

浏览历史

内容加载中请稍等...

利用层级交互注意力的文本摘要方法

相关作者

相关机构

相关主题

浏览历史