期刊文献+

融合实体信息的图卷积神经网络的短文本分类模型 被引量:7

Short text classification based on graph convolutional neural networks with entity information
下载PDF
导出
摘要 提出一种融合实体信息的图卷积神经网络模型(ETGCN),用于短文本分类.首先,使用实体链接工具抽取短文本中的实体;然后,利用图卷积神经网络对文档、实体和单词进行建模,丰富文本的潜在语义特征;将学习到的单词节点表示与BERT词嵌入进行拼接,通过双向长短期记忆网络,进一步挖掘文本上下文语义特征,再与图神经网络模型得到的文本特征进行融合,用于分类.实验结果表明,该模型在数据集AGNews、R52和MR上的分类准确率分别为88.38%、93.87%和82.87%,优于大部分主流的基线方法. An entity information fused graph convolutional neural network model(ETGCN)is proposed for short text classificaion.Firstly,the entities in the short text are extracted by the entity link tool.And then,the text,entities and words are modeled by the graph convolutional neural network to enrich the latent semantic features of the text.After that,the learned node representation of the word and the BERT word embedding are spliced and fed to the bidirectional long and short-term memory network to further mine the semantic features of the text context.The semantic features of the text context are merged with the text features obtained by the graph neural network model.The fusion features are used for the classification of the short text.The experimental results show that the classification accuracy of the model on data sets of AGNews,R52 and MR is 88.38%,93.87%and 82.87%,respectively,which is better than other mainstream baseline methods.
作者 王佳宇 李楹 马春梅 吴东昊 姜丽芬 WANG Jiayu;LI Ying;MA Chunmei;WU Donghao;JIANG Lifen(College of Computer and Information Engineering,Tianjin Normal University,Tianjin 300387,China)
出处 《天津师范大学学报(自然科学版)》 CAS 北大核心 2023年第1期67-72,共6页 Journal of Tianjin Normal University:Natural Science Edition
基金 国家自然科学基金资助项目(61902282) 天津市自然科学基金重点项目(18JCYBJC8900,18JCQNJC70200,20JCZDJC000) 天津市教委科研计划资助项目(2018KJ155) 天津市科技发展基金资助项目(JW1702) 广东省科技计划资助项目(2017KQNCX194)。
关键词 短文本分类 实体信息 图卷积神经网络 short text classification entity information graph convolutional neural networks
  • 相关文献

参考文献2

二级参考文献17

  • 1邹娟,周经野,邓成.一种基于语义分析的中文特征值提取方法[J].计算机工程与应用,2005,41(36):164-166. 被引量:6
  • 2谈文蓉,符红光,刘莉,杨宪泽.一种基于贝叶斯分类与机读词典的多义词排歧方法[J].计算机应用,2006,26(6):1389-1391. 被引量:5
  • 3Rennie J D M,Shih L,Teevan J,et al.Tackling the poor assumptions of Naive Bayes text classifiers [C]//Proceedings of the Twentieth International Conference on Machine Learning,2003,2:616-623.
  • 4Chiang J H,Chen Y C.Hierarchical fuzzy-KNN networks for news documents categorization[C]//lOth IEEE International Conference on Fuzzy Systems,2001(2) :720-723.
  • 5Sebastiani F,Nazionale C,Valdambrini N.An improved boosting algorithm and its application to text categorization[C]//Proceedings of the Ninth International Conference on Information and Knowledge Management, 2000: 78-85.
  • 6Zhang Hao,Berg A C,Maire M,et al.SVM-KNN:Discriminative nearest neighbor classification for visual category recognition[C]// IEEE Computer Society Conference on HHComputer Vision and Pattern Recognition, 2006 : 2126-2136.
  • 7Yang Y.An evaluaton of statistical approaches to text categorization[J].Information Retrieval, 1999,1 ( 1 ) : 76-78.
  • 8Komarek P,Moore A.Fast robust logistic regression for large sparse datasets with binary outputs[C]//Proceedings of the Ninth International Workshop on Artifical Intelligence and Statistics,2003:197-204.
  • 9Keerth S S,Duan K B,Shevade S K,et al.A fast dual algorithm for kernel logistic regression[J].Machine Learning,2005,61( 1 ) : 151-165.
  • 10Lin C J,Weng R C,Sathiya Keerthi S.Trust region Newton methods for large-scale logistic regression[C]//Proceedings of the 24th International Conference on Machine Learning,2007,3 : 561-568.

共引文献11

同被引文献61

引证文献7

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部