期刊文献+

融合多重注意力机制的卷积神经网络文本分类设计与实现 被引量:12

Design and Implementation of Text Classification Based on Convolutional Neural Network with Multiple Attention Mechanisms
下载PDF
导出
摘要 针对单一的卷积神经网络文本分类模型忽视词语在上下文的语义变化,未对影响文本分类效果的关键特征赋予更高权值的问题,提出了一种融合多重注意力机制的卷积神经网络文本分类模型.该模型将注意力机制分别嵌入卷积神经网络的卷积层前后,对影响文本分类效果的高维特征和低维特征进行权值的重新分配,优化特征提取过程,实现特征向量的精确分类.在池化层采用平均池化和最大池化相结合的方法,从而减少特征图的尺寸,避免过拟合现象的发生,最后使用softmax函数进行分类.本文在三个不同的中英文数据集上进行实验,同时设计注意力机制重要性对比实验,分析自注意力机制与CNN结合对文本分类效果提升的重要性,结果表明该分类模型有效地提高了分类的准确性. Aiming at the problem that the single convolutional neural network text classification model ignores the semantic changes of words in context and does not assign higher weights to the important features that affect the accuracy of the model,a novel text classification model based on convolutional neural network with multiple attention Mechanism is proposed.In this model,the attention mechanism is embedded into the convolutional layer of the convolutional neural network respectively,and the weight of high-dimensional features and low-dimensional features that affect the text classification effect is redistributed,the feature extraction process is optimized,and the precise classification of feature vectors is realized.In the pooling layer,the method of combining average pooling and maximum pooling was adopted,so as to reduce the size of feature map and avoid the occurrence of overfitting.Finally,softmax function was used for classification.In this paper,experiments were carried out on three different Chinese and English data sets.Meanwhile,comparative experiments on the importance of attention mechanism were designed to analyze the importance of the combination of self-attention mechanism and CNN to the improvement of text classification effect.The results show that this classification model can effectively improve the accuracy of classification.
作者 闫跃 霍其润 李天昊 毛煜 YAN Yue;HUO Qi-run;LI Tian-hao;MAO Yu(College of Information Engineering,Capital Normal University,Beijing 100048,China;School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2021年第2期362-367,共6页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(62077002)资助 北京市教委科研计划项目(KM201810028016)资助 首都师范大学交叉科学研究院资助.
关键词 自注意力机制 卷积神经网络 特征提取 文本分类 self-attention mechanism convolutional neural network feature extraction text classification
  • 相关文献

参考文献7

二级参考文献34

  • 1王细薇,樊兴华,赵军.一种基于特征扩展的中文短文本分类方法[J].计算机应用,2009,29(3):843-845. 被引量:36
  • 2樊兴华,孙茂松.一种高性能的两类中文文本分类方法[J].计算机学报,2006,29(1):124-131. 被引量:70
  • 3刘磊,曹存根,王海涛,陈威.一种基于“是一个”模式的下位概念获取方法[J].计算机科学,2006,33(9):146-151. 被引量:18
  • 4李峰,李芳.中文词语语义相似度计算——基于《知网》2000[J].中文信息学报,2007,21(3):99-105. 被引量:105
  • 5SebastianiI F. Machine Learning in Automated Text Categorization Consiglio Nazionale delle Rieerche[J]. Italy. ACM Computing Surveys,2002,34(1) : 1-47
  • 6Zelikovitz S,Transductive M F. Learning for Short-Text Classification Problem using Latent Semantic Indexing International [J]. Journal of Pattern Recognition and Artificial Intelligence, 2005,19(2) : 143-163
  • 7Pu Qiang, Yang Guo Wei. Short-Text Classification Based on ICA and LSA[J]//Proceedings of International Symposium on Neural Networks, 2006 (ISNN 2) : 265-270
  • 8马后锋 樊兴华.一种改进的增量贝叶斯分类算法[J].仪器仪表学报,2007,28(8Ⅲ):312-316.
  • 9Chen Enhong,Wu Gaofeng. An Ontology Learning Method Enhanced by Frame Semantics [J]//Proceedings of the Seventh IEEE International Symposium on Multimedia. 2005:374-382
  • 10郑德权,赵铁军,李生,等.基于内容的词义本体知识自动获取[A]∥全国第八届计算语言学联合学术会议(JSCL-2005)论文集[C].2005.

共引文献147

同被引文献83

引证文献12

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部