期刊文献+

基于评价对象类别的跨领域情感分类方法研究 被引量:3

Cross-domain Sentiment Classification with Opinion Target Categorization
下载PDF
导出
摘要 情感分类任务具有领域相关性,即使用某一个领域的标注样本训练出的分类模型在对其他领域样本进行分类时性能表现往往会非常差。情感分类的跨领域学习旨在减少跨领域的性能损失。提出一种基于评价对象类别的跨领域学习方法。首先,将评价对象分为4大类:整体、硬件、软件和服务;然后,人工标注源领域中属于以上4类评价对象的句子,并构建评价对象类别分类器;最后,将不同的评价对象类别当作不同的视图,进而使用协同学习(Co-trai-ning)进行跨领域情感分类。实验结果表明,提出的方法有效地改进了跨领域学习性能。 The task of sentiment classification is domain-specific,i.e.,a classifier learning from the annotated data from a domain often performs dramatically badly on the data from a different domain.We presented a novel approach for cross-domain sentiment classification.Specifically,we first generalized four general categories of the opinion targets:overall,software,hardware,and service and classified all sentences into these categories.Then,some sentences with the category information were annotated in the source domain and a classifier for opinion target categorization was developed with the annotated data to classify all the sentences in both the source and target domain.Third,the four categories of opinion targets were considered as four different views which are employed in a standard co-training algorithm to perform cross-domain sentiment classification.Experimental results across several domains of Chinese reviews demonstrate the effectiveness of the proposed approach.
出处 《计算机科学》 CSCD 北大核心 2013年第1期229-232,250,共5页 Computer Science
基金 国家自然科学基金(60970056 61070123 61003155) 高等学校博士学科点专项科研基金(20093201110006) 模式识别国家重点实验室开发课题基金 江苏省自然科学基金(BK2011282) 江苏省高校自然科学基金重大研究项目(11KIJ520003) 教育部科技发展中心网络时代的科技论文快速共享专项研究资助
关键词 评价对象 协同训练 最大熵 跨领域情感分类 Opinion target Co-training Maximum entropy Cross-domain sentiment classification
  • 相关文献

参考文献12

  • 1Aue A,Gamon M. Customizing Sentiment Classifiers to New Domains:A Case Study[A].2005.
  • 2吴琼,谭松波,张刚,段洣毅,程学旗.跨领域倾向性分析相关技术研究[J].中文信息学报,2010,24(1):77-83. 被引量:10
  • 3Blitzer J,Dredze M,Pereira F. Biographies,Bollywood,Boomboxes and Blenders:Domain Adaptation for Sentiment Classification[A].2007.432-439.
  • 4Pan S J,Ni X C,Sun J T. Cross-domain Sentiment Classification via Spectral Feature Alignment[A].2010.751-760.
  • 5Li S,Huang C,Zong C. Multi-domain Sentiment Classification with Classifier Combination[J].Joumal of Computer Science and Technology (JCST),2011,(01):25-33.
  • 6侯锋,王传廷,李国辉.网络意见挖掘、摘要与检索研究综述[J].计算机科学,2009,36(7):15-19. 被引量:3
  • 7Hu M,Liu B. Mining and Summarizing Customer Reviews[A].2004.168-177.
  • 8Zhuang L,Jing F,Zhu X. Movie Review Mining and Surnmarization[A].2006.43-50.
  • 9Jakob N,Gurevych I. Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields[A].2010.1035-1045.
  • 10宗成庆.统计自然语言处理[M]北京:清华大学出版社,2008341.

二级参考文献25

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2赵军,许洪波,黄萱菁,谭松波,刘康,张奇.中文倾向性分析评测技术报告[C]//第一届中文倾向性分析评测会议(The First Chinese Opinion Analysis Evaluation).COAE,2008.
  • 3Weifu Du, Songbo Tan. An Iterative Reinforcement Approach for Fine-Grained Opinion Mining[C]//Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Boulder, Colorado, 2009: 486-493.
  • 4Huifeng Tang, Songbo Tan and Xueqi Cheng. A Survey on Sentiment Detection of Reviews. Expert Systems With Applications[J]. Elsevier. 2009, 36 (7) : 10760-10773.
  • 5Chang CC, Lin CJ. LIBSVM: a library for supportvector machines. 2001. Software available at http:// www. csie. ntu. edu. tw/-cjlin/libsvm.
  • 6Songbo Tan, Xueqi Cheng, Moustafa M. Ghanem, Bin Wang, Hongbo Xu. A Novel Refinement Approach for Text Categorization[C]//Proceedings of the 14^th ACM international conference on Information and knowledge management. Bremen, Germany, 2005: 469-476.
  • 7Songbo Tan. An Effective Refinement Strategy for KNN Text Classifier. Expert Systems With Applications[J]. Elsevier. 2006, 30(2): 290-298.
  • 8Tan S. B. Neighbor-weighted K nearest neighbor for unbalanced text corpus[J]. Expert Systems with Applications. 2005, 28: 667-671.
  • 9John Blitzer, Mark Dredze, Fernando Pereira. Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classifieation [C]// Proceedings of the 45^th Annual Meeting of the Associ ation of Computational Linguistics. Prague. 2007: 440-447.
  • 10Songbo Tan, Xueqi Cheng, Yuefen Wang and Hongbo Xu. Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis [C]//31^st European Conference on Information Retrieval. Springer Berlin: Heidelberg, 2009: 337-349.

共引文献11

同被引文献25

  • 1Pang B, Lee L. Opinion Mining and Sentiment Analysis [ J ]. Foundations and Trends in Information Retrieval, 2008, 2( 1/2 ) : 1 - 135.
  • 2Blitzer J, Dredze M, Pereira F. Biographies, Bollywood, Boom - boxes and Blenders: Domain Adaptation for Sentiment Classifica- tion [ C ]. In: Proceedings of the 45tit Annual Meeting of the Associ- ation for Computational Linguistics. 2007:440 - 447.
  • 3Tan S B, Cheng X Q, Wang Y F, et al. Adapting Naive Bayes toomain Adaptation for Sentiment Analysis [ C ]. in: Proceedings of the 31st European Conference on IR Research on Advances in Infor- mation Retrieval. Berlin, Heidelberg: Springer - Verlag, 2009 : 337 - 349.
  • 4Pan S J, Ni X C, Sun J T, et al. Cross - domain Sentiment Classi- fication via Spectral Feature Alignment [ C ]. In: Proceedings of the 19th International Conference on World Wide Web. New York, NY, USA: ACM,2010:751 -760.
  • 5Tan S B, Wu G W, Tang H F, et al. A Novel Scheme for Domain -transfer Problem in the Context of Sentiment Analysis[ C]. In: Proceedings of the 16th ACM Conference on Information and Knowl- edge Management. New York, NY, USA: ACM, 2007:979 - 982.
  • 6Chung F R K. Spectral Graph Theory [ M ]. American Mathemati- cal Society, 1997.
  • 7李新福,赵蕾蕾,何海斌,李芳.使用Logistic回归模型进行中文文本分类[J].计算机工程与应用,2009,45(14):152-154. 被引量:10
  • 8赵妍妍,秦兵,刘挺.文本情感分析[J].软件学报,2010,21(8):1834-1848. 被引量:537
  • 9黄贤立.基于典型相关分析的多视图跨领域情感分类[J].计算机工程,2010,36(24):186-188. 被引量:6
  • 10吴琼,谭松波,许洪波,段洣毅,程学旗.基于随机游走模型的跨领域倾向性分析研究[J].计算机研究与发展,2010,47(12):2123-2131. 被引量:11

引证文献3

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部