摘要
基于Lasso回归和支持向量机分类器,首先利用Lasso回归具有变量筛选的特点,过滤部分不重要的特征,然后利用支持向量机分类器做情感提取.在某化妆品品牌的评论数据实验中,利用基础情感词典和领域情感词典构建待选择高维特征集,通过对比特征选择前后的G-means,精确度和召回率等,均取得显著效果.
The characteristics of textual emotion analysis are usually of high dimension and sparseness.Lasso has a simple and efficient trait in feature selection.This paper introduces the Lasso regression into the unbalanced emotion analysis and achieves remarkable results.Applying emotional analysis in e-commerce plays an important role in improving product quality and improving service,which attracts many researchers and has high research value.In fact,the number of positive comments on e-commerce data generally exceeds the number of bad reviews.If the feature selection is not reasonable,it is easy to ignore the bad reviews,and the bad reviews are the key to analyzing the problems.Based on the Lasso regression and SVM classifier,this paper first uses Lasso regression to filter the features that have variable screening,filters some unimportant features,and then makes use of SVM classifier to extract the emotion.In a cosmetic brand's reviewing data experiment,the basic emotion dictionary and domain sentiment lexicon are used to construct the high-dimensional feature set to be selected,and the significant effects are achieved by comparing G-means before and after feature selection,accuracy and recall.
作者
万会芳
闵兰
舒畅
WAN Hui-fang;MIN Lan;SHU Chang(College of Management Science,Chengdu University of Technology,Chengdu 610059,China)
出处
《西南师范大学学报(自然科学版)》
CAS
北大核心
2018年第9期74-78,共5页
Journal of Southwest China Normal University(Natural Science Edition)
关键词
不平衡情感分类
特征选择
Lasso
imbalanced sentiments classification
feature selection
Lasso