期刊文献+

基于改进卷积神经网络的短文本分类模型 被引量:18

Short Text Classification Model Based on Improved Convolutional Neural Network
下载PDF
导出
摘要 基于卷积神经网络,提出一种基于改进卷积神经网络的短文本分类模型.首先,采用不同编码方式将短文本映射到不同空间下的分布式表示,提取不同粒度的数字特征作为短文本分类模型的多通道输入,并根据标准知识库提取概念特征作为先验知识,提高短文本的语义表征能力;其次,在全连接层增加自编码学习策略,在近似恒等的基础上进一步组合数字特征,模拟数据内部的关联性;最后,利用相对熵原理为模型增加稀疏性限制,降低模型复杂度的同时提高模型的泛化能力.通过对开源数据集进行短文本分类实验,验证了模型的有效性. We proposed a short text classification model based on improved convolutional neural network.Firstly,different coding methods were used to map short text to distributed representation in different spaces,and digital features of different granularities were extracted as multi-channel inputs of short text classification model.Extracting concept features from standard knowledge base as prior knowledge to improve the semantic representation ability of short text.Secondly,the self-coding learning strategy was added to the full connection layer,on the basis of approximate identity,the digital features were further combined to simulate the relevance within the data.Finally,the principle of relative entropy were used to increase the sparsity limit of the model,reduce the complexity and improve the generalization ability of the model.The effectiveness of the proposed model was verified by short text classification experiments on the open source dataset.
作者 高云龙 吴川 朱明 GAO Yunlong;WU Chuan;ZHU Ming(Changchun Institute of Optics,Fine Mechanics and Physics,Chinese Academy of Science,Changchun 130033,China;Key Laboratory of Airborne Optical Imaging and Measurement,Chinese Academy of Sciences,Changchun 130033,China)
出处 《吉林大学学报(理学版)》 CAS 北大核心 2020年第4期923-930,共8页 Journal of Jilin University:Science Edition
基金 国家自然科学基金(批准号:61401425) 吉林省科技发展计划项目(批准号:20200571505JH).
关键词 卷积神经网络 短文本 概念分布式表示 稀疏 自编码 convolutional neural network short text concept distributed representation sparsity self-coding
  • 相关文献

同被引文献151

引证文献18

二级引证文献92

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部