期刊文献+

最小费用最大流跨领域情感分类框架 被引量:1

Minimum-cost and Maximum-flow Framework for Cross-domain Sentiment Classification
下载PDF
导出
摘要 在对文本的情感极性进行分类的过程中,针对标注训练数据与待判断情感极性的数据来自不同领域,特征分布差异较大,导致传统监督学习方法分类准确率大幅下降的问题,提出基于最小费用最大流框架的跨领域情感分类方法.该方法将跨领域情感分类问题转化为最大化领域间传播信息量和最小化跨领域特征拟合损失量的双重最优化问题,并建立连接源领域和目标领域的费用容量网络,将信息量和损失量分别看作网络中的容量和费用权值,通过最小费用最大流框架解决上述双重最优化问题,调配训练数据特征权值.以更为直观的模型描述领域间的映射关系.实验发现,方法能有效提高跨领域情感分类的准确率. There are some test data,and we need to determine their sentiment polarity,in case that the training data and the test data don't belong to the same domain,because of the huge difference of feature distribution,the performance of sentiment classification de- creases sharply when transferred from one domain to another by traditional supervised classification methods. A new cross-domain sen- timent classification method based on framework of Minimum-Cost and Maximum-Flow is proposed. We redefine the problem of cross-domain sentiment classification as a double optimization problem, the goal is maximizing the information transferred from source-domain to target-domain meanwhile minimum the loss of cross-domain feature fitting, then we build the cost-volume network to link the source-domain and the target-domain, and define the volume and cost with information and loss amount, so we can solve the double optimization problem with Minimum-Cost and Maximum-Flow framework as above-mentioned. Finally, we adjust the feature weights of train data and describe the mapping relationship between the domains more intuitively. The experiment results indicate that the proposed method can improve the performance of cross-domain sentiment classification effectively.
出处 《小型微型计算机系统》 CSCD 北大核心 2017年第1期49-55,共7页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(81360230 81560296)资助
关键词 跨领域 情感分类 双重最优化问题 费用流量网络 最小费用最大流 领域映射关系 cross domain sentiment classification double optimization problem cost-volume network minimum-cost and maximum-flow mapping relationship between the domains
  • 相关文献

参考文献5

二级参考文献277

  • 1徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:119
  • 2Pang B,Lee L,Vaithyanathan S.Thumbs up? Sentiment classification using machine learning techniques[C]//Proc of EMNLP 2002.Morristown,NJ,USA:ACL,2002:79-86.
  • 3Ku L,Liang Y,Chen H.Opinion extraction,summarization and tracking in news and blog corpora[C]//Proc of AAAI 2006.Boston,Massachusetts:ACL,2006.
  • 4赵军,许洪波,黄萱菁,等.中文倾向性分析评测技术报告[R].北京:中文信息学会,2008.
  • 5Aue A,Gamon M.Customizing sentiment classifiers to new domains:A case study[C]//Proc of RANLP 2005.Borovets,Bulgaria:RANLP,2005.
  • 6Blitzer J,Dredze M,Pereira F.Biographies,bollywood,boom-boxes and blenders:Domain adaptation for sentiment classification[C]//Proc of ACL 2007.Prague,Czech Republic:ACL,2007:440-447.
  • 7Tan S,Wang Y,Wu G,et al.Using unlabeled data to handle domain-transfer problem of semantic detection[C]//Proc of SAC 2008.New York:ACM,2008:896-903.
  • 8Tan S,Cheng X,Wang Y,et al.Adapting nave Bayes to domain adaptation for sentiment analysis[C]//Proc of ECIR 2009.Berlin:Springer,2009:337-349.
  • 9Gamon M,Aue A.Automatic identification of sentiment vocabulary:Exploiting low association with known sentiment terms[C]//Proc of the ACL Workshop on Feature Engineering for Machine Learning in NLP.Ann Arbor,Michigan:ACL,2005:57-64.
  • 10Brin S,Page L,Motwami R,et al.The PageRank citation ranking:Bringing order to the Web[R].Stanford,CA:Computer Science Department,Stanford University,1999.

共引文献530

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部