期刊文献+

基于深度学习的中文影评情感分析 被引量:14

Sentiment analysis of Chinese movie reviews based on deep learning
下载PDF
导出
摘要 随着社交网络的兴起,更多人选择在网络上发表自己对影视作品的观点,这为影视投资人了解观众对电影的反馈提供了更方便的途径.例如,豆瓣影评中包含了海量用户或积极或消极的情感观点,而分析豆瓣影评的情感倾向能够辅助投资人进行决策,提升作品质量.大量数据分析必须借助计算机技术手段完成,其中情感分析是自然语言处理(natural language processing, NLP)的一个方向,常用来分析判断文本描述的情绪类型,因此也被称为情感倾向分析.为了提高影评情感分类的准确率,设置了多组对比实验来选择最优参数,比较了当以中文字符向量和词向量为输入矩阵时,双向长短期记忆(bidirectional long short-term memory,Bi-LSTM)模型和卷积神经网络(convolutional neural network, CNN)模型对分类准确率的影响.提出了一种以CNN模型为弱分类器的Bagging算法,训练了多个CNN模型,并采用投票法决定最终的分类结果.这种集成的方法减少了单个模型造成的分类偏差,比单一的Bi-LSTM模型的分类准确率提高了5.10%,比单一的CNN模型的分类准确率提高了1.34%. With the rise of social networks,more people choose to express their opinions on the internet,which allows film and television investors to collect the audience’s feedback more easily.The watercress movie review is just one such platform through which investors are able to know the viewers’taste and preference,and thereby to make better decision in investing the television and film industry.A large amount of data analysis must be done by means of computer technology.Sentiment analysis is a direction of natural language processing(NLP).Sentiment analysis,also known as emotional tendency analysis,is one aiming to analyze the positive or negative aspects of text description.In order to improve the accuracy of the film’s sentiment classification,multiple sets of contrast experiments are set to select the optimal parameters,and the Chinese character vectors and the word vectors are compared as the input matrix,in the bidirectional long short-term memory(Bi-LSTM)model and the convolutional neural network(CNN).A Bagging algorithm with CNN model as weak classifier is proposed.Multiple CNN models are trained to determine the final classification results by voting method.The integrated method reduces the deviation caused by a single model.The accuracy of a single Bi-LSTM model has increased by 5.10%,which is 1.34%higher than that of a single CNN model.
作者 周敬一 郭燕 丁友东 ZHOU Jingyi;GUO Yan;DING Youdong(School of Software Engineering,Suzhou Institute for Advanced Study,University of Science and Technology of China,Suzhou 215123,Jiangsu,China;Shanghai Film Academy,Shanghai University,Shanghai 200072,China)
出处 《上海大学学报(自然科学版)》 CAS CSCD 北大核心 2018年第5期703-712,共10页 Journal of Shanghai University:Natural Science Edition
关键词 双向长短期记忆模型 卷积神经网络模型 BAGGING算法 词嵌入向量 影评情感分析 bidirectional long short-term memory(Bi-LSTM)model convolutional neural network(CNN)model Bagging algorithm word embedding vector sentiment analysis of movie reviews
  • 相关文献

参考文献2

二级参考文献42

  • 1M.Q. Hu, B. Liu. Mining and Summarizing Custom- er Reviews[C]//ACM SIGKDD 2004.. 168-177.
  • 2Bo Pang, Lillian Lee. Opinion mining and sentiment a- nalysis[C]//Foundations and Trends in Information Retrieval, 2(1-2):1-135.
  • 3M.Q. Hu, B. Liu. Opinion Extraction and Summari- zation on the Web[C]//AAAI06, Boston: 1621-1624.
  • 4H. Yu, V. Hatzivassiloglou. Towards Answering O- pinion Question: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences[C]// EMNLP'03 : 129-136.
  • 5Bo Pang, Lillian Lee, Shivakumar Vaithyanathan. Thumbs up? sentiment classification using machine learning techniques[C]//ACL'02: 79-86.
  • 6Bo Pang, Lillian Lee. A sentimental education: Senti- ment analysis using subjectivity summarization based on minimum cuts[C]//ACL'04: 271-278.
  • 7E. Riloff, J. Wiebe. 2003. Learning extraction pat-terns for subjective expressions[C]//EMNLP'03: 105- 112.
  • 8Glance, N. , M. Hurst, K. Nigam, et al. 2005. Deri- ving marketing intelligence from online discussion [C]//SIGKDD'05 : 419-428.
  • 9Wilson, T. , J. Wiebe, P. Hoffmann. 2005. Recog- nizing contextual polarity in phrase-level sentiment a- nalysis[C]//HLT-EMNLP'05 .. 347-354.
  • 10Luciano Barbosa, Junlan Feng. 2010. Robust Senti- ment Detection on Twitter from Biased and Noisy Da- ta[C]//Coling 2010 (poster paper) : 36-44.

共引文献327

同被引文献102

引证文献14

二级引证文献78

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部