期刊文献+

An Empirical Study of Unsupervised Sentiment Classification of Chinese Reviews 被引量:1

An Empirical Study of Unsupervised Sentiment Classification of Chinese Reviews
原文传递
导出
摘要 This paper is an empirical study of unsupervised sentiment classification of Chinese reviews. The focus is on exploring the ways to improve the performance of the unsupervised sentiment classification based on limited existing sentiment resources in Chinese. On the one hand, all available Chinese sentiment lexicons - individual and combined - are evaluated under our proposed framework. On the other hand, the domain dependent sentiment noise words are identified and removed using unlabeled data, to improve the classification performance. To the best of our knowledge, this is the first such attempt. Experiments have been conducted on three open datasets in two domains, and the results show that the proposed algorithm for sentiment noise words removal can improve the classification performance significantly. This paper is an empirical study of unsupervised sentiment classification of Chinese reviews. The focus is on exploring the ways to improve the performance of the unsupervised sentiment classification based on limited existing sentiment resources in Chinese. On the one hand, all available Chinese sentiment lexicons - individual and combined - are evaluated under our proposed framework. On the other hand, the domain dependent sentiment noise words are identified and removed using unlabeled data, to improve the classification performance. To the best of our knowledge, this is the first such attempt. Experiments have been conducted on three open datasets in two domains, and the results show that the proposed algorithm for sentiment noise words removal can improve the classification performance significantly.
出处 《Tsinghua Science and Technology》 SCIE EI CAS 2010年第6期702-708,共7页 清华大学学报(自然科学版(英文版)
基金 Supported by the National Natural Science Foundation of China(Nos.60405011,60575057,and 60875073)
关键词 sentiment noise words unsupervised sentiment classification domain dependent sentiment noise words unsupervised sentiment classification domain dependent
  • 相关文献

参考文献19

  • 1Liu B. Sentiment analysis and subjectivity. In: Indurkhya N, Damerau F J, eds. Handbook of Natural Language Processing, 2010.
  • 2Pang B, Lee L. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2008, 2(1-2): 1-135.
  • 3Pang B, Lee L. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of ACL. Barcelona, Spain, 2004: 271-278.
  • 4Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of EMNLR Philadelphia, USA, 2002: 79-86.
  • 5Zhai Z, Xu H, Li J, et al. Sentiment classification for Chinese reviews based on key substring features. In: Proceedings of IEEE NLP-KE. Dalian, China, 2009: 452-459.
  • 6Zhai Z, Xu H, Li J, et al. Feature subsumption for sentiment classification in multiple languages. In: Proceedings of PAKDD. Hyderabad, India, 2010:261-271,.
  • 7Liu B. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data. Springer, 2006.
  • 8Devitt A, Ahmad K. Sentiment analysis in financial news: A cohesion-based approach. In: Proceedings of ACL. Prague, Czech Republic, 2007: 984-991.
  • 9Ding X, Liu B, Yu P S. A holistic lexicon-based approach to opinion mining. In: Proceedings of WSDM. Palo Alto, California, 2008:231-240.
  • 10Hu M, Liu B. Mining opinion features in customer reviews. In: Proceedings of AAAI. San Jose, California, 2004: 755-760.

同被引文献16

  • 1俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:160
  • 2胡熠,陆汝占,李学宁,段建勇,陈玉泉.基于语言建模的文本情感分类研究[J].计算机研究与发展,2007,44(9):1469-1475. 被引量:23
  • 3知网[EB/OL].[2007-09-03].http://www.keenage.com.
  • 4WANG Su-ge, LI De-yu, SONG Xiao-lei, et al. A feature selection method based on improved fisher' s discriminant ratio for text senti- ment classification[ J ]. Expert Systems with Applications, 2011, 38(7) :8696-8702.
  • 5PANG B, LEE L. Opinion mining and sentiment analysis[ J]. Foun- dations and Trends in Information Retrieval, 2008,2 ( 1 ) : 1-135.
  • 6ZHAI Zhong-wu, XU Hua, KANG Ba-da, et al. Exploiting effective features for Chinese sentiment classification [ J ]. Expert Systems with Applications, 2011,38(8) :9139-9146.
  • 7HALL M, FRANK E, HOLMES G, et al. The WEKA data mining software: an update[J]. ACM SIGKDD Explorations Newsletter, 2009,11 (1) :10-18.
  • 8BISHOP C M. Neural networks for pattern recognition [ M ]. New York : Oxford University Press, 1995.
  • 9谭松波.中文情感挖掘语料ChenSentiCorp[EB/OL].(2010-06-29)[2011-04-22].http:/)'www.searchforum.org.en/tansongbo/corpussenti.htm.
  • 10BREIMAN L. Bagging predictor[ J]. Machine Learning, 1996, 24 (2) :123-140.

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部