期刊文献+

基于情感关键句抽取的情感分类研究 被引量:27

Sentiment Classification Analysis Based on Extraction of Sentiment Key Sentence
下载PDF
导出
摘要 情感分析需要解决的一个重要问题是判断一篇文档的极性是正面的还是负面的.情感分类的正确率很难达到普通文本分类的水平,因为情感分类更难更复杂.在判断文档的情感极性时,不同的句子具有不同的情感贡献度,所以,对整篇文档的关键句和细节句进行区分将有助于提高情感分类的性能.关键句通常简短且具有判别性,而细节描述句通常复杂多样且容易引入歧义.在关键句抽取算法中,考虑3类属性:情感属性、位置属性和关键词属性.为了更好地利用关键句和细节句之间的差异性和互补性,将抽取的关键句分别用于有监督的和半监督的情感分类.在有监督情感分类中,采用的是分类器融合的方法;在半监督情感分类中,采用的是Co-training算法.在8个领域上进行实验,结果表明所提方法性能明显优于Baseline,从而证明情感关键句抽取算法是有效的. A key problem of sentiment analysis is to determine the polarity of a review is positive (thumbs up) or negative (thumbs down). Unlike topic-based text classification, where a high accuracy can be achieved, the sentiment classification is a hard and complicated task. One of the main challenges for document-level sentiment classification is that not every part of the document is equally informative for inferring the polarity of the whole document. Thus, makinga distinction between key sentences and trivial sentences will be helpful to improve the sentiment classification performance. Wc divide a document into key sentences and detailed sentences. Key sentence is usually brief but discriminative while detailed sentences are diverse and ambiguous. For key sentence extraction, our approach takes three attributes into account: sentiment attribute, position attribute and special words attribute. To make use of the discrepancy and complementarity of key sentences and detailed sentences, we incorporate key sentences and detailed sentences in supervised and semi supervised learning. In supervised sentiment classification, a classifier combination approach is adopted because the original document is divided into two different and complementary parts; in semi-supervised sentiment classification, a co-training algorithm is proposed to incorporate unlabeled data for sentiment classification better than the baseline Experimental results across eight domains show that our method and the key sentence extraction is effective.
出处 《计算机研究与发展》 EI CSCD 北大核心 2012年第11期2376-2382,共7页 Journal of Computer Research and Development
基金 国家自然科学基金重点项目(60933005) 国家自然科学基金项目(60803085) 国家"八六三"高技术研究发展计划基金项目(2010AA012500)
关键词 情感分类 关键句 分类器融合 联合训练 有监督学习 半监督学习 sentiment classification key sentence classifier combination co waining supervisedlearning semi-supervised learning
  • 相关文献

参考文献16

  • 1杜伟夫,谭松波,云晓春,程学旗.一种新的情感词汇语义倾向计算方法[J].计算机研究与发展,2009,46(10):1713-1720. 被引量:21
  • 2吴琼,谭松波,许洪波,段洣毅,程学旗.基于随机游走模型的跨领域倾向性分析研究[J].计算机研究与发展,2010,47(12):2123-2131. 被引量:11
  • 3胡熠,陆汝占,李学宁,段建勇,陈玉泉.基于语言建模的文本情感分类研究[J].计算机研究与发展,2007,44(9):1469-1475. 被引量:23
  • 4Pang B, Lee 1., VaithyanaIhan S. Thumbs up? Sentiment classification using machine learning techniques [C]//Proc of EMNI.P. New York: ACM, 2002:79 8G.
  • 5Turney P. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews [C] //Proc of ACI. New York: ACM, 2002: 417-424.
  • 6Dasgupta S, Ng V. Mine the easy and classify the hard: A semi supervised approach to automatic sentiment classification [C] //Proc of ACL. New York. ACM, 200.9: 701-709.
  • 7Yesscnalina A, Yuc Y, Cardic C. Multi level structured models for document level sentiment classification [C] //Proc of EMNLP. New York: ACM, 2010:1046-1056.
  • 8Gamon M. Sentiment classification on customer feedback data: Noisy data, large feature vectors, and the role of linguistic analysis [C] //Proc of the Int Conf on Computational Linguistics. New York: ACM, 2004.
  • 9Melville P, Gryc W, Lawrence R. Sentiment analysis of blogs by combining lexical knowledge with text classification [C] //Procof SIGKDD. New York: ACM, 2009.
  • 10l.i Shoushan, Huang Churen, Zhou Guodong, et al. Employing personal/impersonal views in supervised and semi supervised sentiment classification [C] //Proc of ACL. Ncw York: ACM, 2010:414-423.

二级参考文献48

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:325
  • 2徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:119
  • 3刘群 李素建.基于《知网》的词汇语义相似度的计算.中文计算语言学,2002,17(2):59-76.
  • 4Tumey P. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews [C] //Proc of the 40th Annual Meeting of the Association for Computational Linguistics. New,York: ACM, 2002: 417- 424.
  • 5Pang B, Lee L, Shivakumar V. Thumbs up? sentiment classification using machine learning techniques [C]//Proc of the 2002 Conf on Empirical Methods in Natural Language Processing. Stroudsburg, PA, USA: ACL, 2002:79-86.
  • 6Wiebe J M. Learning subjective adjectives from corpora [C] //Proc of the 17th National Conf on Artificial Intelligence. Menlo Park: AAAI Press, 2000:735-740.
  • 7Hatzivassiloglou V, McKeown K R. Predicting the semantic orientation of adjectives [C]//Proc of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conf of the European Chapter of the Association for Computational Linguistics. Stroudsburg. PA, USA: ACL, 1997:174-181.
  • 8Turney P, Littman M. Measuring praise and criticism: inference of semantic orientation from association [J]. ACM Trans on Information Systems, 2003, 21(4): 315-346.
  • 9Pang B, Lee L. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts [C] //Proc of the 42nd Annual Meeting on Association for Computational Linguistics. Srroudsburg, PA, USA: ACL. 2004:271-278.
  • 10Takamura H, Inui T, Okumura M. Extracting semantic orientations of words using Spin Model [C]//Proc of the 43rd Annual Meeting of the ACL. Stroudsburg, PA, USA: ACL, 2005:133-140.

共引文献48

同被引文献259

引证文献27

二级引证文献134

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部