一种基于基准词的跨领域文本倾向性计算方法

A kind of domain transfer sentiment analysis based on standard word list

导出

摘要常用的基于特征表达的跨领域文本倾向性分析的基本思想是通过统计的方法对源领域和目标领域的数据进行特征对齐,再根据特征间关联构建目标领域的分类器。从词汇倾向性计算入手,提出了一种基于领域基准词表的跨领域倾向性计算的方法。与传统的词汇倾向性计算方法不同的是,该方法在构建基准词表时,同时考虑词性和领域信息,在计算倾向性时,根据词汇当前的词性和领域信息采用相应的领域基准词表进行计算。实验结果表明:与传统的跨领域倾向性分析算法相比,虽然该方法在准确率上的优势不明显,但可以不依赖源领域和目标领域文本数据;与传统的基于基准词表的倾向性计算方法相比,该方法能够大幅提高倾向性分析的准确率。 M ost of the traditional domain transfer sentiment classification which based on feature express are based on feature alignment,which extract from source domain and aim domain. The classification for the aim domain are established base on these features relation. In this paper,based on word sentiment calculated,an approach for cross-domain sentiment classification based on domain standard word list is proposed. Different from other standard word list based classification algorithm,this standard word list considered not only the word part of speech,but also the domain information. The word sentiment is calculated,by different domain and part of speech standard word list,according to its domain and part of speech. The experiment shows: 1,Compared with the traditional transfer domain algorithm,the source and aim domain texts are not need,although there is no obvious advantage on accuracy; 2,Compared with the sentiment classification based on standard word list,the accuracy is clearly better.

作者沙芸李晓磊张世博

机构地区北京石油化工学院信息工程学院

出处《山东大学学报（理学版）》 CAS CSCD 北大核心 2016年第7期59-65,共7页 Journal of Shandong University(Natural Science)

基金北京市青年拔尖人才资助项目(13031821005) 北京市教育委员会科技计划面上项目(KM20121001700613031821005)

关键词中文信息处理跨领域倾向性分析词汇倾向性计算基准词表 Chinese information processing cross-domain sentiment classification word orientation standard word list

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1Vasileios Hatzivassiloglou,Kathleen R McKcown.Predicting the semantic orientation of adjectives[].Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and the Eighth Conference of the European Chapter of the Association for Computational Linguistics.1997

1任小燕.中文情感分析综述[J].科技信息,2011(31):202-203. 被引量：2
2邓箴.一种基于本体的词汇语义倾向计算[J].中小企业管理与科技,2012(13):284-285.
3白海燕,朱礼军.关联数据的自动关联构建研究[J].现代图书情报技术,2010(2):44-49. 被引量：36
4郭峰.基于松弛标记法的股票市场词汇极性研究[J].计算机应用与软件,2010,27(10):162-164. 被引量：1
5李根,李文辉.基于混合蛙跳算法的长时间跨度人脸识别[J].东北大学学报（自然科学版）,2014,35(7):955-959.
6宋乐,何婷婷,王倩,闻彬.极性相似度计算在词汇倾向性识别中的应用[J].中文信息学报,2010,24(4):63-67. 被引量：5
7魏晓聪,林鸿飞.面向迁移学习的文本特征对齐算法[J].计算机工程,2017,34(2):215-219. 被引量：7
8赵玉聪,钟志农,景宁,吴烨.多维实体关联信息综合处理平台[J].计算机应用,2016,36(A01):213-216. 被引量：2
9汤毓,李尚平,李冰.基于大型件特征对齐的误差分析及模型重构[J].制造业自动化,2014,36(2):99-101.
10周全,魏昕,陈建新,郑宝玉.一种基于稠密SIFT特征对齐的稀疏表达人脸识别算法[J].电子与信息学报,2015,37(8):1913-1919. 被引量：10

山东大学学报（理学版）

2016年第7期

浏览历史

内容加载中请稍等...

一种基于基准词的跨领域文本倾向性计算方法

参考文献1

相关作者

相关机构

相关主题

浏览历史