期刊文献+

不规范文本的无监督观点句抽取 被引量:8

Unsupervised Subjective Sentence Extraction for Non-Standard Texts
下载PDF
导出
摘要 观点往往承载着文本的重要信息,观点句抽取技术旨在抽取文本中包含作者主观观点的句子,其应用越来越广泛。针对网络语言不规范的现象,文章提出了一种对不规范文本的无监督观点句抽取方法,该方法先对语料及其分词结果进行规范化处理,再通过基于词典和基于规则的方法自动构造训练样例,对SVM分类器进行训练,再使用分类器抽取观点句。使用该方法在人工标注的语料以及COAE2011电子产品语料上进行实验,取得了较好的效果。 Opinions often carry very important information of the texts. Subjective sentence extraction technology is designed to extract the sentences which contain the author's opinions. Nowadays its application is more and more broad. For the network language has become more and more nowstandard, this paper proposed an unsupervised subjective sentence extraction method. This method firstly improved the corpus and its segmentation results first, and then got the training samples automatically by using a lexicon-based method and a rule-based method. After the SVM classifier being trained, it is used to extract subieetive sentences. The proposed method has been evaluated on a manually annotated corpus and the electronic products corpus of COAE2011, and achieved good results.
作者 张文文 王挺
出处 《计算机与数字工程》 2013年第1期64-68,共5页 Computer & Digital Engineering
基金 国家自然科学基金(编号:61170156)资助
关键词 观点句 不规范 无监督 精抽取 泛抽取 subjective sentence, non-standard, unsupervised, accurate extraction broad extraction
  • 相关文献

参考文献11

  • 1Bing Liu. Sentimental Analysis and Subjectivity[M]. In: N. Indurkhya and F. J. Damerau. Handbook of Natural Language Processing[M]. 2nd Edition, Cambridge: CRC Press, 2010: 627-665.
  • 2黄萱菁 赵军.中文文本情感倾向性分析.中国计算机学会通讯,2008,4(2):41-46.
  • 3许洪波,孙乐,姚天畴,等.第三届中文倾向性分析评测(COAE2011)总结报告[c]//第三届中文倾向性分析评测(COAE2011),济南:中国科学院计算机技术研究所,2011:1-24.
  • 4蒙新泛,王厚峰.主客观识别中的上下文因素的研究[A].中国计算机语言学研究前沿进展(2007-2009)[c]//烟台:第十届全国计算语言学学术会议(CNCCL2009),2009:594-599.
  • 5Janyce Wiebe M, Bruce, Rebecca F, et al. Development and use of a gold standard dataset for subjectivity classifications [A]. Proc. 37th Annual Meeting of the Assoc. for Computa- tional Linguisties(ACL-99)[C]//University of Maryland: As sociation for Computational Linguistics, 1999 : 46-253.
  • 6Ellen Riloff, J'anyce Wiebe. Learning extraction patterns for Subjective Expressions [ C]//Proceedings of EMNLP2003, 2003:105-112.
  • 7Janyce Wiebe, Ellen Riloff. Creating Subjective and Objective Sentence Classifiers from Unannotated Texts[C]//Proceedings of the 6th International Conference on Computational Linguis- tics and Intelligent Text Proeessing(CICLin05), 2005 : 1-12.
  • 8Wiebe J, Wilson T, Cardie C. Annotating Expressions o{ O- pinions and Emotions in Language[J]. Language Resources and Evaluation (formerly Computers and the Humanities), 2005,39(2/3) : 164-210.
  • 9徐睿峰,王亚伟,徐军,等.基于多知识源融合和多分类器表决的中文观点分析[C]//第三届中文倾向性分析评测(COAE2011),济南:中国科学院计算机技术研究所,2011:77-87.
  • 10王中卿,王荣洋,庞磊,等.Suda-SAM-(JMS情感倾向性分秒技术报告[c]//第三届中文倾向性分析评测(COAE2011),挣南:中国科学院计算机技术研究所,2011:25-32.

共引文献10

同被引文献53

引证文献8

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部