摘要
针对因中文食品安全文本特征表达困难,而造成语义信息缺失进而导致分类器准确率低下的问题,提出一种基于深度神经网络的跨文本粒度情感分类模型。以食品安全新闻报道为目标语料,采用无监督的浅层神经网络初始化文本的词语级词向量。引入递归神经网络,将预训练好的词向量作为下层递归神经网络(Recursive Neural Network)的输入层,计算得到具备词语间语义关联性的句子特征向量及句子级的情感倾向输出,同时动态反馈调节词向量特征,使其更加接近食品安全特定领域内真实的语义表达。然后,将递归神经网络输出的句子向量以时序逻辑作为上层循环神经网络(Recurrent Neural Network)的输入,进一步捕获句子结构的上下文语义关联信息,实现篇章级的情感倾向性分析任务。实验结果表明,联合深度模型在食品安全新闻报道的情感分类任务中具有良好的效果,其分类准确率和F1值分别达到了86.7%和85.9%,较基于词袋思想的SVM模型有显著的提升。
Facing the difficulties in feature expression of Chinese food safety text information, and the loss of semantic information with low classification accuracy, a sentimental text classification model based on joint deep neural network was presented. The proposed model utilizes the corpora of food safety document captured from the internet, and word vector from word embedding method as the input for the neural network to get the pre-trained word vector. The pretrained word vector is further trained dynamically to get the word features and the sentimental classification of the sentence result, which better express the phrase-level sentimental relations for each sentence and the real semantic meaning in the food safety domain. Then the word feature of the sentence is inputted to the recurrent neural network (RNN) to catch the semantic information of the sentence structure further, realizing the sentimental classification of the text. The experiments show that our joint deep neural network model achieves better results in sentiment analysis on food safety information, compared with the bag-of-words based SVM model. The classification accuracy and F1 value reach 86.7% and 85. 9 % respectively.
出处
《计算机科学》
CSCD
北大核心
2016年第12期277-280,共4页
Computer Science
基金
国家自然科学基金(61303214)资助
关键词
联合神经网络模型
多粒度文本特征
词向量
食品安全
情感倾向性分析
Joint deep neural networks model, Multi-dimensional textual features, Word-embedding, Food safety, Sentiment analysis