期刊文献+

基于不平衡情感分类的Lasso-Lars特征选择方法研究 被引量:2

Feature Selection in Imbalanced Sentiment Classification:A Method Using Lasso-Lars
下载PDF
导出
摘要 基于Lasso回归和支持向量机分类器,首先利用Lasso回归具有变量筛选的特点,过滤部分不重要的特征,然后利用支持向量机分类器做情感提取.在某化妆品品牌的评论数据实验中,利用基础情感词典和领域情感词典构建待选择高维特征集,通过对比特征选择前后的G-means,精确度和召回率等,均取得显著效果. The characteristics of textual emotion analysis are usually of high dimension and sparseness.Lasso has a simple and efficient trait in feature selection.This paper introduces the Lasso regression into the unbalanced emotion analysis and achieves remarkable results.Applying emotional analysis in e-commerce plays an important role in improving product quality and improving service,which attracts many researchers and has high research value.In fact,the number of positive comments on e-commerce data generally exceeds the number of bad reviews.If the feature selection is not reasonable,it is easy to ignore the bad reviews,and the bad reviews are the key to analyzing the problems.Based on the Lasso regression and SVM classifier,this paper first uses Lasso regression to filter the features that have variable screening,filters some unimportant features,and then makes use of SVM classifier to extract the emotion.In a cosmetic brand's reviewing data experiment,the basic emotion dictionary and domain sentiment lexicon are used to construct the high-dimensional feature set to be selected,and the significant effects are achieved by comparing G-means before and after feature selection,accuracy and recall.
作者 万会芳 闵兰 舒畅 WAN Hui-fang;MIN Lan;SHU Chang(College of Management Science,Chengdu University of Technology,Chengdu 610059,China)
出处 《西南师范大学学报(自然科学版)》 CAS 北大核心 2018年第9期74-78,共5页 Journal of Southwest China Normal University(Natural Science Edition)
关键词 不平衡情感分类 特征选择 Lasso imbalanced sentiments classification feature selection Lasso
  • 相关文献

参考文献9

二级参考文献93

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:386
  • 3Pang B.,Lee L.,Vaithyanathan S.Thumbs up?:Sentiment Classification using Machine LearningTechniques[C] //Proceedings of EMNLP.2002.
  • 4Blitzer J.,Dredze M.,Pereira F.Biographies.Bollywood,Boom-boxes and Blenders:DomainAdaptation for Sentiment Classification[C] //Proceedings of ACL.2007.
  • 5Li S.,Huang C.,Zhou G.,et al.EmployingPersonal/Impersonal Views in Supervised and Semi-supervised Sentiment Classification[C] //Proceedingsof ACL.2010.
  • 6Barandela R.,Sánchez J.S.,García V.,et al.Strategiesfor Learning in Class Imbalance Problems[J].PatternRecognition,2003,36:849-851.
  • 7Kubat M.,Matwin S.Addressing the Curse ofImbalanced Training Sets:One-Sided Selection[C] //Proceedings of ICML.1997.
  • 8Chawla N.,Bowyer K.,Hall L.,et al.SMOTE:Synthetic Minority Over-Sampling Technique[J].Journal of Artificial Intelligence Research,2002,16:321-357.
  • 9Juszczak P.,Duin R.Uncertainty Sampling Methodsfor One-Class Classifiers[C] //Proceedings of ICML,Workshop on Learning with Imbalanced Data Sets II.2003.
  • 10Zhou Z.,Liu X.Training Cost-Sensitive NeuralNetworks with Methods Addressing the ClassImbalance Problem[C] //IEEE Transaction onKnowledge and Data Engineering,2006,18:63-77.

共引文献1074

同被引文献30

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部