期刊文献+

面向文本情感分析的中文情感词典构建方法 被引量:41

A method on building Chinese sentiment lexicon for text sentiment analysis
原文传递
导出
摘要 提出了构建基于HowNet和SentiWordNet的中文情感词典方法。将词语自动分解为多个义元后计算其情感倾向强度,并且使用词典校对方法对词语情感倾向强度进行优化。将所构建词典应用到文本情感分析任务中,使用支持向量机构建文本情感分类器进行实验。实验结果表明,该词典优于一般极性情感词典,为情感分析研究提供了有效的词典资源。 A method on building Chinese sentiment lexicon based on HowNet and SentiWordNet was proposed,in which sentiment intensity of the word was automatically calculated by decomposing it into multiple semantic units and a lexicon proofreading technique was used to optimize the value of sentiment intensity of the word. The building lexicon was applied to the task of sentiment analysis, in which the support vector machine was used to build the sentiment classifier. The experiment results showed that the built sentiment lexicon was more effective than the general polar sentiment lexi- con, and provided an effective dictionary resource for the research of sentiment analysis.
出处 《山东大学学报(工学版)》 CAS 北大核心 2013年第6期27-33,共7页 Journal of Shandong University(Engineering Science)
基金 国家社科基金资助项目(12BYY045) 教育部人文社会科学研究青年资助项目(10YJCZH247) 广东省科技计划资助项目(2010B031000014)
关键词 情感词典 情感强度 支持向量机 情感分析 中文文本 sentiment lexicon sentiment intensity support vector machine sentiment analysis Chinese text
  • 相关文献

参考文献23

  • 1赵妍妍,秦兵,刘挺.文本情感分析[J].软件学报,2010,21(8):1834-1848. 被引量:541
  • 2HATZIV ASSILOGLOU V, MCKEOWN K R. Predicting the semantic orientation of adjectives[CJ IIProceedings of the Eighth Conference on European Chapter of the Associ?ation for Computational Linguistics. Madrid; Association for Computational Linguistics, 1997;174-181.
  • 3WIEBEJ M. Learning subjective adjectives from corpora[CJIIProceedings of the National Conference on Artifi?cial Intelligence. Austin; AAAl Press, 2000;735-740.
  • 4RILOFF E, WIEBEJ M, WILSON T. Leaming subjec?tive nouns using extraction pattern bootstrapping[CJ I I Proceedings of the Seventh Conference on Natural Lan?guage Leaming at HLT-NAACL 2003. Edmonton, Cana?da; Association for Computational Linguistics, 2003; 25- 32.
  • 5BARONI M, VEGNADUZZO S. Identifying subjective adjectives through Web-based mutual information[CJ I I Proceedings of KONVENS 2004. Vienna; University of Vienna, 2004;17-24.
  • 6MOILANEN K, PULMAN S. The good, the bad, and the unknown; morphosyllabic sentiment tagging of unseen words[C] II Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies; Short Papers. Stroudsburg; Asso?ciation for Computational Linguistics, 2008; 109 -112.
  • 7TURNEY P, LITTMAN M L. Measuring praise and criti?cism; inference of semantic orientation from association[J]. ACM Trans Information Systems, 2003, 21 ( 4 ) ; 315-346.
  • 8YANG A M, LINJ H, ZHOU Y M, et al. Research on building a Chinese sentiment lexicon based on SO-PMI[J]. Applied Mechanics and Materials, 2013, 263: 1688- 1693.
  • 9READJ. Recognising affect in text using pointwise-mutu?al information[DJ. Brighton; University of Sussex, UK, 2004.
  • 10MILLER G A, BECKWITH R, FELLBAUM C, et al. Introduction to wordnet, an on-line lexical database[J]. InternationalJournal of Lexicography, 1990, 3 (4) ;235- 244.

二级参考文献36

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2林传鼎,无.社会主义心理学中的情绪问题——在中国社会心理学研究会成立大会上的报告(摘要)[J].社会心理科学,2006,21(1):37-37. 被引量:15
  • 3KU L-W, LO Y-S, CHEN H-H. Using polarity scores of words for sentence-level opinion extraction [ C]// Proceedings of the 6th NTCIR-6 Workshop Meeting. Toyko, Japan: [ s. n. ], 2007:316 - 322.
  • 4王秉卿,张姝,张奇.中文情感词识别[C]//NCIRCS2008:第四届全国信息检索与内容安全学术会议.北京:[出版社不详],2008:63-69.
  • 5刘群 李素建.基于《知网》的词汇语义相似度的计算.中文计算语言学,2002,17(2):59-76.
  • 6王克,张春良,朱慕华,等.基于情感词词典的中文文本主客观分析[C].NCIRCS2008:第四届全国信息检索与内容安全学术会议.北京,2008.56-62.
  • 7知网[EB/OL].[2009-03-12].http://www.keenage.com.
  • 8TURNEY P D. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews [ C]// Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. Morristown, N J, USA: Association for Computational Linguistics, 2002:417-424.
  • 9谭松波.中文情感挖掘语料-ChenSentiCorp[EB/OL].(2008-12-19)[2009-03-12].http://www.searchforum.org.cn/tansongbo/corpus-senti.htm.
  • 10KAJI N, KITSUREGAWA M. Building lexicon for sentiment analysis from massive collection of HTML documents [ C/OL]//EMNLPCoNLL 2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2007:1075 - 1083 [2009 -03 -08]. http://www. aclweb. org/anthology/D/D07/D07-1115. pdf.

共引文献1277

同被引文献426

引证文献41

二级引证文献354

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部