期刊文献+

基于并行化递归神经网络的中文短文本情感分类 被引量:11

SENTIMENT CLASSIFICATION OF CHINESE SHORT TEXT BASED ON PARALLELIZED RECURSIVE NEURAL NETWORK
下载PDF
导出
摘要 情感分析的一个重要应用是判断用户对于产品评论的情感倾向,这些用户评论一般都是字数较少的短文本。传统方法多利用词袋模型获取单词的浅层特征来进行情感分析,利用这些简单特征训练的模型在短文本,尤其是在复杂语法问题上效果并不理想。通过利用深度递归神经网络算法来捕获句子语义信息,并引入中文"情感训练树库"作为训练数据来发现词语情感信息,在短文本情感五分类的问题上取得了较高的准确率。针对复杂模型在海量数据训练上的时间效率问题,通过在Spark并行框架下实现了模型的并行化处理,使得模型的可扩展性和时间效率得到提升。 A significant application of sentiment analysis is to determine the user's semannc orjentauon in product reviews which are generally short texts. Traditional methods often acquire the shallow characteristics of words for sentiment analysis through bag-of-words model. However, the model trained through these simple characteristics doesn' t have a good performance in short text, especially complex syntax context. Through using deep recursive neural network to capture the semantic information and introducing a Chinese sentiment training treebank as the training set to find the sentiment information, a relatively higher accuracy on five-class short text sentiment analysis is achieved. Aiming at the problem of training time efficiency in large scale data, the parallelization is implemented through Spark, which can enhance the scalability and time efficiency of the model.
作者 谢铁 郑啸 张雷 王修君 Xie Tie Zheng Xiao Zhang Lei Wang Xiujun(School of Computer Science and Technology,Anhui University of Technology, Maanshan 243002, Anhui, Chin)
出处 《计算机应用与软件》 2017年第3期205-211,232,共8页 Computer Applications and Software
基金 国家自然科学基金项目(61402008 61402009) 安徽省科技重大专项(16030901060) 安徽高校省级自然科学研究重大项目(KJ2014ZD 05) 安徽省高校优秀青年人才支持计划
关键词 深度学习 情感分析 文本分类 SPARK Deep learning Sentiment analysis Text classification Spark
  • 相关文献

参考文献6

二级参考文献123

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:325
  • 2刘远超,王晓龙,徐志明,关毅.文档聚类综述[J].中文信息学报,2006,20(3):55-62. 被引量:65
  • 3徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:119
  • 4赵世奇,刘挺,李生.一种基于主题的文本聚类方法[J].中文信息学报,2007,21(2):58-62. 被引量:23
  • 5R.Vilalta and Y.Drissi.A perspective view and survey of meta-learning[J].Artificial Intelligence Review,2002,18(2):77-95.
  • 6Saso Dzeroski and Bernard Zenko:Is combining classifiers with stacking better than selecting the best one?[J].Machine Learning.2004,54(3):255-273.
  • 7Rie Ando and Tong Zhang.A framework for learning predictive structures from multiple tasks and unlabeled data[J].Journal of Machine Learning Research,2005,6:1817-1853.
  • 8B.Pang,L.Lee,and S.Vaithyanathan.Thumbs up? Sentiment classification using machine learning techniques[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing(EMNLP-02).2002.
  • 9B.Pang and L.Lee.A sentimental education:Sentiment analysis using subjectivity summarization based on minimum cuts[C]//Proceedings of the 42nd Meeting of the Association for Computational Linguistics(ACL-04).2004.
  • 10E.Riloff,S.Patwardhan,and J.Wiebe.Feature subsumption for opinion analysis[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing(EMNLP-06).2006.

共引文献962

同被引文献121

引证文献11

二级引证文献155

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部