期刊文献+

基于多特征融合的评论文本情感分析 被引量:7

Comment Text Sentiment Analysis Based on Multi-feature Fusion
下载PDF
导出
摘要 评论文本情感分析现已成为自然语言处理的重要研究领域。针对评论文本语法不规则、特征稀疏的问题,设计了一种针对评论文本的多特征融合的情感分类算法。首先提出一种改进的情感规则方法;然后从规则方法中提取出有效信息,将每一个情感信息量扩展为多维向量,再融合一元词特征、句法特征以及依存词语搭配特征构成向量空间,形成更有效的融合特征模板;最后利用信息增益理论进行特征选择,作为支持向量机的输入对评论文本进行识别和分类,实现了机器学习方法与规则方法相融合。以中文酒店评论数据集作为语料进行实验,结果表明该方法能让机器学习算法更加充分地利用规则特征,相比单纯地使用规则方法或机器学习方法,能够达到更好的分类性能,进一步提高分类精度。 The analysis on text emotional inclination has received much attention from natural language processing filed in recent years.In order to solve the problem of grammatical irregularity and feature sparsity,we design an emotional classification approach based on multi-feature fusion for text sentiment.At first,an improved method based on emotional rules is proposed.Then the effective information extracted from the ruled-based method is extended to a multidimensional vector and an effective integration feature set is obtained by adding various rule-based features to the basic feature set after expanding and converting them.Finally,the information gain theory is used to select features as the input of SVM.Thus,a method via a combination of rule-based and machine learning method is realized.We use the Chinese hotel reviews data set as the corpus for the experiment which shows that this method can make machine learning algorithm more full use of the rule features and it works better than simply using rule-based method or machine learning method.
作者 龚安 费凡 GONG An;FEI Fan(School of Computer & Communication Engineering,China University of Petroleum,Qingdao 266580,Chin)
出处 《计算机技术与发展》 2018年第8期91-95,共5页 Computer Technology and Development
基金 国家油气重大专项(2017ZX05013-001)
关键词 文本情感分析 多特征融合 机器学习 情感规则 text sentiment analysis multi-feature fusion machine learning emotional rules
  • 相关文献

参考文献3

二级参考文献42

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2H Y Tan. Chinese place automatic recognition research. In: C N Huang, Z D Dong, eds. Proc of Computational Language.Beijing: Tsinghua University Press, 1999
  • 3Zhang Huaping, Liu Qun, Zhang Hao, et al. Automatic recognition of Chinese unknown words recognition. First SIGHAN Workshop Attached with the 19th COLING, Taipei, 2002
  • 4S R Ye, T S Chua, J M Liu. An agent-based approach to Chinese named entity recognition. The 19th Int'l Conf on Computational Linguistics, Taipei, 2002
  • 5J Sun, J F Gao, L Zhang, et al. Chinese named entity identification using class-based language model. The 19th Int'l Conf on Computational Linguistics, Taipei, 2002
  • 6Lawrence R Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proc of IEEE, 1989,77(2): 257~286
  • 7Shai Fine, Yoram Singer, Naftali Tishby. The hierarchical hidden Markov model: Analysis and applications. Machine Learning,1998, 32(1): 41~62
  • 8Richard Sproat, Thomas Emerson. The first international Chinese word segmentation bakeoff. The First SIGHAN Workshop Attached with the ACL2003, Sapporo, Japan, 2003. 133~143
  • 9J Hockenmaier, C Brew. Error-driven learning of Chinese word segmentation. In: J Guo, K T Lua, J Xu, eds. The 12th Pacific Conf on Language and Information, Singapore, 1998
  • 10Andi Wu, Zixin Jiang. Word segmentation in sentence analysis.1998 Int'l Conf on Chinese Information Processing, Beijing, 1998

共引文献738

同被引文献94

引证文献7

二级引证文献56

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部