基于主题模型的主观性句子识别

Subjectivity Sentence Identification Based on Topic Model

下载PDF

导出

摘要主观性句子识别旨在发现文本集合中具有观点的句子。本文基于概率主题模型,提出融合主题的主观性句子识别模型。该模型通过考虑主题因素识别句子主观性,同时挖掘文本集合中的潜在主观性主题。提出的模型是一个弱监督生成模型,不需要大量的标记语料进行训练,仅需要一小部分领域独立的主观性词典修改模型的先验。实验证明,提出的模型能有效地提高句子识别召回率和F值,同时抽取的主观性主题具有较强的语义信息。 Subjectivity sentence identification aims to detect the opinionated sentences in text. This paper proposes mixing topics and subjectivity sentence identification model based on probabilistic topic model. Through considering the topics, the model can detect the subjective sentences, and can also extract the subjective topics from texts simultaneously. The proposed model is a weakly-supervised generative model, which only needs a small set of domain independent subjectivity lexicon to modify prior of model. The experiment results demonstrate that the model can highly improve the sentence subjectivity identification recall and the F-value, and the extracted subiectivity topics are semantically informative.

作者吴超荣廖祥文

机构地区福州大学数学与计算机科学学院

出处《计算机与现代化》 2012年第12期127-130,135,共5页 Computer and Modernization

基金福建省自然科学基金资助项目(2010J05133) 福州大学科技发展基金资助项目(2010-XQ-22)

关键词主观性句子识别观点挖掘概率主题模型弱监督 subjectivity sentence identification opinion mining probabilistic topic model weakly-supervised

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1Pang B, Lee L. Asentimental education: Sentiment analy- sis using subjectivity summarization based on minimum cuts [ C ]//Proceedings of ACL. 2004:271-278.
2Li L, Yao T. Kernel-based sentiment classification for Chi- nese sentence [ C ]//Proceedings of the 6th International Conference on Advanced Language Processing and Web In- formation Technology. 2007 : 27-32.
3Ikeda D, Takamura H, Ratinov L, et al. Learning to shift the polarity of words for sentiment classification [ C ]//Pro- ceedings of the 3rd International Joint Conference on Natu- ral Language Processing. 2008:296-303.
4Riloff E, Wiebe J. Learning extraction patterns for subje- ctive expressions [ C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMN- LP). 2003:105-112.
5Wiebe J, Riloff E. Creating subjective and objective sen- tence classifiers from unannotated texts [ C ]//Proceedings of the Conference on Computational Linguistics and Intelli- gent Text Processing (C/CLing). 2005:486-497.
6Lin C, He Y, Everson R. Sentence subjectivity detection with weakly-supervised learning [ C ]//Proceedings of Inter- national Joint Conference on Natural Language Processing (IJCNLP). 2011:1153-1161.
7Lin C, He Y, Everson R. A comparative study of Bayesian models for unsupervised sentiment detection [ C ]//Proceed- ings of Conference on Computational Natural Language Learning (CoNLL). 2010 : 144-152.
8Mei Q, Ling X, Wondra M, et al. Topic senti-ment mix- ture: Modeling facets and opinions in weblogs [ C ]//Pro- ceedings of the 16th International Conference on World Wide Web (WWW). 2007:171-180.
9Lin C, He Y. Joint sentiment/topic model for sentiment a- nalysis [C]//Proceedings of the 18th ACM Conference on Information and Knowledge Management. 2009:375-384.
10Yohan Jo, Mice H Oh. Aspect and sentiment unification model for online review analysis [ C ]//Proceedings of the Fourth ACM International Conference on Web Search and Data Mining. 2011:815-824.

1刘培宗.DCS系统安全验收的探讨[J].科技资讯,2009,7(6):14-14.
2周超然,张昕,赵建平.基于层次化模型的多智能体系统设计方法[J].长春理工大学学报（自然科学版）,2016,39(6):105-109.
3谢庭渝.DCS控制系统安全验收评价的探讨[J].化工安全与环境,2004,17(8):16-18.
4谢庭渝.DCS控制系统安全验收评价的探讨[J].电气工程应用,2005(4):29-34.
5杨建锋,马军成,王令超.基于多光谱遥感的耕地等别识别评价因素研究[J].农业工程学报,2012,28(17):230-236. 被引量：30
6Biao Liu Dong Li.Determinants of Successful IT-Enabled Business Innovation： A Case Study from the Perspective of Institutional Entrepreneurship Theory[J].Frontiers of Business Research in China,2014,8(2):227-244. 被引量：1

计算机与现代化

2012年第12期

浏览历史

内容加载中请稍等...

基于主题模型的主观性句子识别

参考文献15

相关作者

相关机构

相关主题

浏览历史