摘要
针对句子的倾向性进行判断,采用SentiWordNet构建中文倾向性词表,通过剔除停用词等降低句子向量的维数,以此来提高句子向量化速度,然后利用支持向量机分类器进行句子倾向性判断,最后提出两种新的置信度计量方法对倾向性句子进行排序.实验结果表明,构建的识别系统在一定程度上能有效识别倾向性句子.
To judge the sentence' s opinion, we adopt SentiWordNet to build Chinese opinioned vocabulary, eliminate stop words to reduce the dimension of the sentence vector, use this method to enhance the speed of structuring sentence' s vector, apply support vector machine (SVM) classifier to determine whether the sentence having opinion, propose two new methods of measuring confidence and sort the opinioned sentences by the confidence score. The experimental result shows that our system can effectively detect opinion sentences in some extent.
出处
《福州大学学报(自然科学版)》
CAS
CSCD
北大核心
2010年第4期504-508,共5页
Journal of Fuzhou University(Natural Science Edition)
基金
福建省科技创新平台资助项目(2009J1007)
福建省教育厅科研资助项目(JA04161)
福建省发展改革委员会资助项目(SX2004-29)
关键词
倾向性
中文句子
识别
置信度
opinion
Chinese sentence
detection
confidence