摘要
随着WEB2.0的迅猛发展,汉语情感倾向分类在许多不同的领域取得了广泛的应用。同时,文本情感倾向分类也是当前学术界的热门课题之一。本文旨在探究一种汉语情感倾向分类方法,通过构造一种自动分类系统,对商品评价信息进行正类、负类和中立的三分类。本文采用一个两级分类系统实现三分类,首先第一级将文本分为极性和中立两部分,然后第二级再将极性文本分为正类和负类。在文本分类方法方面,采用了基于情感词、基于规则和TSVM等不同的方法。本文最后组织了分类实验对系统效果加以验证,并对实验结果进行了分析。
Along with the rapid development of WEB2.0, automatic sentiment orientation classification gets globally used in various areas. Also, it is one of the most popular topics in the academia. The goal of this research is to construct an automatic classifier and identify the sentimental orientation of merchandise comments, dividing these texts into three categories: positive, negative and neutral. This paper applies a two-level classification system to accomplish the goal. First, the level-one system divides the corpus into polarity part, which consists of positive and negative texts, and neutral part. Then, the level-two system divides the polarity corpus into positive part and negative part. As for the method to texts classification, different methods as sentimental word based, rule based and TSVM are used. At the end of this paper, several experiments are conducted to verify the effect of the system, followed by the analysis of the experiment results.
出处
《软件》
2013年第11期73-76,共4页
Software
关键词
汉语情感倾向自动分类
基于情感词
基于规则
TSVM
Chinese emotional tendencies automatic classification
based on emotion word
based on rules
TSVM