期刊文献+

基于双重选择策略的跨领域情感倾向性分析 被引量:5

Cross-domain Sentiment Analysis Based on Double Selection Strategy
下载PDF
导出
摘要 情感倾向性分析旨在识别评论中隐含的情感信息,在产品声誉分析、舆情监控、个性推荐等方面具有广阔的应用前景。在评测消费者对新发布产品的态度时,本产品领域中可供参考的已分类评论数据往往较少,而其他相关领域可能存在大量的已分类的评论数据,利用其他产品已标注的评论数据对新产品进行情感倾向性分析,属于跨领域的情感分类问题。针对这一问题,本文引入迁移学习机制,将经典迁移学习TrAdaBoost算法的样本迁移机制应用于情感倾向性分析,并针对积极类和消极类分类精度不均衡问题提出了改进策略,首先根据评论样本权重进行第一次选择,其次结合分类置信度对评论样本进行第二次选择。实验结果表明,在整体分类精度有所提高的前提下,改进算法的优势在于均衡了积极类和消极类的分类精度,使得分类结果更具实际参考价值。 Sentiment analysis aims to identify the emotional information contained in the comments. It has wide application prospects in reputation analysis, public opinion analysis and personalized recommendation. When we want to get the consumers' sentiments about a new designed or published product, there may be lack of the labeled comments in this domain, while we have a large number of labeled comments in other certain related domains. Analyzing the sentiments of a new product using the labeled data of other products is the task of cross-domain sentiment analysis. This paper applies TrAdaBoost, a state-of-the-art transfer learning algorithm, to the sentiment transfer and provides an improvement strategy for reducing the imbalance between the positive and negative classifications. First, according to the sample weight, we select the samples. Then we combine the classification confidence for second selection. Experimental results show that, the overall classification accuracy increased, and the modified algorithm balances the classification accuracy of negative class and positive class, which has more practical values in the real world applications.
出处 《情报学报》 CSSCI 北大核心 2012年第11期1202-1209,共8页 Journal of the China Society for Scientific and Technical Information
基金 国家自然科学基金资助项目(71001016,71031002)
关键词 情感倾向性 跨领域 迁移学习 双重选择 sentiment analysis, cross-domain, transfer learning, double selection strategy
  • 相关文献

参考文献16

二级参考文献61

  • 1陈欣.模糊层次分析法在方案优选方面的应用[J].计算机工程与设计,2004,25(10):1847-1849. 被引量:108
  • 2顾益军,樊孝忠,王建华,汪涛,黄维金.中文停用词表的自动选取[J].北京理工大学学报,2005,25(4):337-340. 被引量:34
  • 3娄德成,姚天昉.汉语句子语义极性分析和观点抽取方法的研究[J].计算机应用,2006,26(11):2622-2625. 被引量:64
  • 4徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:119
  • 5谭松波.中文情感挖掘语料--chnsenticorp[EB/OL].[2010-05-01].http://www.searchforum.org.cn/tansongbo/corpus-senti.htm.
  • 6A-M Popescu, O Etzioni. Extracting Product Features and Opinions from Reviews [ A]. Proceedings of HLT-EMNLP 2005 [ C], Vancouver Canada,2005:339-346.
  • 7Yuen Raymond W M, Chart Terence Y W, Lai Tom B Yet al. Morpheme-Based Derivation of Bipolar Semantic Orientation of Chinese Words [ A ]. Proceedings of the 20th International Conference on Computational Linguistics ( COLING- 2004 ), 2004 : 1008-1014.
  • 8Peter D Tumey, Michael L Littman. Measuring Praise and Criticism : Inference of Semantic Orientation from Association [J]. ACM Transactions on Information Systems, 2003, 21 ( 4 ) : 315 - 346.
  • 9WangBo. Extracting Morphemes to Improve Sentiment Analysis [A]. Proceedings of IEEE YC-ICT 2010 [ C ], Beijing, November, 2010.
  • 10PChaovalit, L Zhou. Movie Review Mining: A Comparison Between Supervised and Unsupervised Classification Approaches [ A]. IEEE. Proceedings of The 38th Annual Hawaii International Conference on System Sciences, USA, 2005 : 112-121.

共引文献50

同被引文献40

  • 1Pang B, Lee L. Opinion Mining and Sentiment Analysis [ J ]. Foundations and Trends in Information Retrieval, 2008, 2( 1/2 ) : 1 - 135.
  • 2Blitzer J, Dredze M, Pereira F. Biographies, Bollywood, Boom - boxes and Blenders: Domain Adaptation for Sentiment Classifica- tion [ C ]. In: Proceedings of the 45tit Annual Meeting of the Associ- ation for Computational Linguistics. 2007:440 - 447.
  • 3Tan S B, Cheng X Q, Wang Y F, et al. Adapting Naive Bayes toomain Adaptation for Sentiment Analysis [ C ]. in: Proceedings of the 31st European Conference on IR Research on Advances in Infor- mation Retrieval. Berlin, Heidelberg: Springer - Verlag, 2009 : 337 - 349.
  • 4Pan S J, Ni X C, Sun J T, et al. Cross - domain Sentiment Classi- fication via Spectral Feature Alignment [ C ]. In: Proceedings of the 19th International Conference on World Wide Web. New York, NY, USA: ACM,2010:751 -760.
  • 5Tan S B, Wu G W, Tang H F, et al. A Novel Scheme for Domain -transfer Problem in the Context of Sentiment Analysis[ C]. In: Proceedings of the 16th ACM Conference on Information and Knowl- edge Management. New York, NY, USA: ACM, 2007:979 - 982.
  • 6Chung F R K. Spectral Graph Theory [ M ]. American Mathemati- cal Society, 1997.
  • 7徐军,丁宇新,王晓龙.使用机器学习方法进行新闻的情感自动分类[J].中文信息学报,2007,21(6):95-100. 被引量:107
  • 8姚天昉,程希文,徐飞玉,汉思·乌思克尔特,王睿.文本意见挖掘综述[J].中文信息学报,2008,22(3):71-80. 被引量:106
  • 9熊德兰,程菊明,田胜利.基于HowNet的句子褒贬倾向性研究[J].计算机工程与应用,2008,44(22):143-145. 被引量:31
  • 10李新福,赵蕾蕾,何海斌,李芳.使用Logistic回归模型进行中文文本分类[J].计算机工程与应用,2009,45(14):152-154. 被引量:10

引证文献5

二级引证文献52

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部