期刊文献+

情感分类中基于词性嵌入的特征权重计算方法 被引量:5

Feature weighting method based on part of speech embedding for sentiment classification
下载PDF
导出
摘要 在文本情感分类中,传统的特征表达通常忽略了语言知识的重要性。提出了一种基于词性嵌入的特征权重计算方法,通过构造一种特征嵌入模式将名词、动词、形容词、副词四种词性对情感分类的贡献度嵌入到传统的TFIDF(Term Frequency-Inverse Document Frequency)权值中。其中,词性的情感贡献度通过粒子群优化算法获得。实验采用支持向量机完成分类,并对比了不同知识的嵌入情况,包括词性、情感词及词性和情感词的组合。结果表明基于词性嵌入的方法分类性能最优,可以显著提高中文文本情感分类的准确率。 The importance of language knowledge is always neglected in traditional feature representation for text sentiment classification. This paper proposes a novel feature weighting approach based on part of speech embedding, in which a feature embedding schema is constructed such that the contribution of noun, verb, adjective and adverb can be embedded into the traditional TF-IDF(Term Frequency-Inverse Document Frequency)weighting, where the best contribution value is obtained by particle swarm optimization algorithm. The support vector machine classifier is used for the Chinese text sentiment classification. In the experiment, the performance of different knowledge is also compared, such as part of speech, sentiment words and their combination. The experimental results show that the proposed method achieves the best classification performance.
出处 《计算机工程与应用》 CSCD 北大核心 2017年第22期121-125,共5页 Computer Engineering and Applications
基金 国家自然科学基金(No.61272315 No.11391240180) 浙江省自然科学基金(No.LY14F020041 No.LY15A020003)
关键词 词性嵌入 特征权重 情感分类 粒子群优化 partofspeechembedding featureweighting sentimentclassification particleswarmoptimization
  • 相关文献

参考文献8

二级参考文献106

  • 1陈欣.模糊层次分析法在方案优选方面的应用[J].计算机工程与设计,2004,25(10):1847-1849. 被引量:108
  • 2胡旺,李志蜀.一种更简化而高效的粒子群优化算法[J].软件学报,2007,18(4):861-868. 被引量:331
  • 3Bruce R, Wiebe J. Recognizing subjectivity: a case study in manual tagging [ J ]. Natural Language Enginneering, 1999,5 ( 2 ) : 1-16.
  • 4Tumey P D, Littman M L. Measuring praise and criticism: inference semantic orientation from association[J]. ACM Transaction on Information Systems, 2003, 21 ( 4 ) : 315-346.
  • 5Kennedy A, Inkpen D. Sentiment classification of movie reviews using contextual valence shifters [ J ]. Computational Intelligence, 2006,22 ( 2 ) : 110-125.
  • 6申杰.WEB舆情观点挖掘关键技术研究[D].四川:电子科技大学,2009.
  • 7James Auen.Natural Language Understandin[M].The Benjamin/Cummings Publishing Company, 1991-05.
  • 8Apte C,Damerau F J,Weiss S M.Automated Learning of Decision Rules for Text Categorization[J].ACM Trans On Inform Syst,12(3): 233-251.
  • 9Salton G,Buckley B.Term-weighting Approaches in Automatic Text Retrieval[J].Information Processing and Management, 1998 ; 24(5 ) :513 -523.
  • 10Larkey L S.A Patent Search and Classification System[C].In:proceedings of DL-99,4th ACM Conference on Digital Libraries Berkeley,CA,1999:179-187.

共引文献374

同被引文献48

引证文献5

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部