期刊文献+

基于博主背景的博客倾向性检索归一化策略 被引量:3

A Blog-Profile Based Normalization Strategy for Blog Opinion Retrieval
下载PDF
导出
摘要 博客倾向性检索的目标是检索出不仅与特定查询主题相关而且包含针对该主题的评论的博文单元,并依据倾向性强度进行排序。目前大多数研究工作仅仅通过单个博文单元包含的主题倾向性强弱对博文进行排序。然而,博客是博主表达自己观点情感的媒介,博主的个性风格很大程度上影响着倾向性强度,忽略博主因素仅仅使用单个博文单元获取倾向性评分,会给倾向性评分带来偏差。针对这个问题,该文首先分析博主背景因素对倾向性评分的影响并建立博主背景模型,然后提出基于博主背景的博客倾向性检索归一化策略,最后使用该策略对基于概率推理模型的博客倾向性检索算法进行归一化。实验结果表明,基于博主背景的倾向性检索归一化策略能够更加合理地对博主单元进行排序。 The goal of Blog Opinion Retrieval is to retrieve the blog units that not only relate to a given query but also comment on the query. Previous works ranked blog units by the opinion strength of a single blog unit. However, since blog is the media expressing the btogger's opinions and feelings, the personality of a blogger could affect the strength of his opinion. Therefore, it is disadvantageous defect to use only a single blog unit to get opinion score while neglecting the blogger's factor. In this paper we build a hlogger profile and then present a blogger-profile based normalization strategy for blog opinion retrieval. We apply it to normalize the Blog Opinion Retrieval algorithm based on probabilistic inference model. Experiment results show that the proposed normalization strategy could rank blog units more reasonably and improve the retrieval performance.
出处 《中文信息学报》 CSCD 北大核心 2010年第3期75-80,104,共7页 Journal of Chinese Information Processing
基金 福建省科技创新平台计划项目(2009J1007) 福州大学引进人才基金(022224)
关键词 计算机应用 中文信息处理 博客倾向性检索 博主背景模型 归一化策略 computer application Chinese information processing blog opinion retrieval blogge profile normalization strategy
  • 相关文献

参考文献16

  • 1Arun Qamra,Belle Tseng and Edward Y.Chang.Mining Blog Stories Using CommunityBased and Temporal Clustering[C] // Proc.of CIKM' 06.Arlington,Virginia,USA:ACM 2006.
  • 2Ounis Iadh,de Rijke Maarten,et al.Overview of the TREC-2006 Blog Track[C/OL] //Proc.of the Fifteenth Text REtrieval Conference (TREC 2006).Gaithersburg,Maryland,USA:NIST 2006.[2007-01-23],http://trec.nist.gov/pubs/trecl5/papers/BLOG06.OVERVIEW,pdf.
  • 3Craig Macdonald,Iadh Ounis,Ian.Soboroff Overview of the TREC-2007 Blog Track[C/OL] // Proc.of The Sixteenth Text REtrieval (TREC 2007).Gaithersburg,Maryland,USA,NIST 2007.[2007-12-12],http://trec.nist.gov/pubs/trecl6/papers/BLOG.O-VERVIEW16.pdf.
  • 4杨宇航,赵铁军,于浩,郑德权.Blog研究[J].软件学报,2008,19(4):912-924. 被引量:19
  • 5Turney P.Thumbs up or Thumbs down? Semantic orientation applied to unsupervised classification of reviews[C] // Proc.of ACL'02.Philadelphia,PA,USA:Association for Computational Linguistics,2002:417-424.
  • 6Pang B,Lee L and Vaithyanathan S.Thumbs up?Sentiment Classification Using Machine Learning Techniques[C] // Proc.of ACL'02.Philadelphia,PA,USA:Association for Computational Linguistics,2002-79-86.
  • 7Pang Bo,Lee Lillian.A Sentimental Education:Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts[C] // Proc.of ACL'04.Barcelona,Spain:Association for Computational Linguistics,2004:1030-1035.
  • 8M.Hurst and K.Nigam.Retrieving Topical Sentiments from Online Document Collections[C]// Document Recognition and Retrieval XI,2004:27-34.
  • 9K.Eguchi,V.Lavrenko.Sentiment Retrieval using Generative Models[C]// Proceedings of Empirical Methods on Natural Language Processing (EMNLP),2006-345-354.
  • 10Min Zhang,Xingyao Ye.A Generation Model to Unify Topic Relevance and Lexicon-based Sentiment for Opinion Retrieval[C]// the Proceedings of SIGIR' 08,Singapore,July 20-24,2008.

二级参考文献24

  • 1杨楠,弓丹志,李忺,孟小峰.Web社区发现技术综述[J].计算机研究与发展,2005,42(3):439-447. 被引量:35
  • 2Ounis Iadh, de Rijke Maart:en, et al. Overview of the TREC- 2006 Blog track [OL]. [2007-01-23]. http://trec, nist. gov/ pubs/trecl 5/papers/BLOG06. OVERVIEW. pdf.
  • 3Craig Macdonald, Iadh Ounis, Ian. Soboroff. Overview of the TREC-2007 Blog track [OL]. [2007-12-12]. http:// trec. nist. gov/pubs/trecl6/papers/BLOG. OVERVIEW16. pdf.
  • 4Hannah D, Macdonald C. University of Glasgow at TREC 2007: Experiments in Blog and Enterprise Tracks with Terrier [OL].[2007-12-12]. http://trec, nist. gov/pubs/ trecl6/papers/uglasgow, biog. ent. final, pdf.
  • 5Zhou Guangxu, Joshi Hemant, Bayrak Coskun. Topic categorization for relevancy and opinion detection [OL]. [2007-12-12]. http://trec, nist. gov/pubs/trecl6/papers/ualr. biog. final, pdf.
  • 6Zhang Wei, Yu Clement, Meng Weiyi. Opinion retrieval from Blogs [C] //Proc of CIKM'07. New York: ACM, 2007.831-840.
  • 7Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning techniques [C] //Proc of ACL'02. Philadelphia, PA, USA: Association for Computational Linguistics, 2002:79-86.
  • 8Mullen T, Collier N. Sentiment analysis using support vector machines with diverse information sourees [C] //Proc of EMNLP'04. Barcelona, Spain: Association for Computational Linguistics, 2004:412-418.
  • 9Turney P. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews [C]//Proc of ACL'02. Philadelphia, PA, USA: Association for Computational Linguistics, 2002:417-424.
  • 10Whitelaw C, Garg N, Argamon S. Using appraisal groups for sentiment analysis [C]//Proc of CIKM'05, New York: ACM, 2005:625-631.

共引文献30

同被引文献26

  • 1刘永丹,曾海泉,李荣陆,胡运发.基于语义分析的倾向性文本过滤[J].通信学报,2004,25(7):78-85. 被引量:34
  • 2杨淑娥,黄礼.基于BP神经网络的上市公司财务预警模型[J].系统工程理论与实践,2005,25(1):12-18. 被引量:199
  • 3朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:325
  • 4应伟,王正欧,安金龙.一种基于改进的支持向量机的多类文本分类方法[J].计算机工程,2006,32(16):74-76. 被引量:28
  • 5徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:119
  • 6China Internet Network Information Center. The 23th statistical re- port of China Internet network development [ EB/OL]. [ 2011 -01 -10]. http://www, cnnic, net. en/uploadfiles/pdf/2009/1/13/ 92458. pdf.
  • 7YI J, NASUKAWA T, BUNESCU R, et al. Sentiment analyzer: Extracting sentiments about a given topic using natural language pro- cessing techniques [ C]//Proceedings of the 3rd IEEE International Conference on Data Mining. Washington, DC: IEEE Computer So- ciety, 2003:427 - 434.
  • 8PANG B, LEE L, VAITHYANATHAN S. Thumbs up? Sentiment classification using machine learning techniques [ C]// Proceedings of the Conference on Empirical Methods in Natural Language Pro- cessing. Stroudsburg: Association for Computational Linguistics, 2002:79 - 86.
  • 9TURNEY P D, LITYMAN M L. Measuring praise and criticism: In- ference of semantic orientation from association [ J]. ACM Transac- tions on Information Systems, 2003, 21(4) : 315 -346.
  • 10ESULI A, SEBASTIANI F. Sentiwordnet: A publicly available lexi- cal resource for opinion mining [ C]// Proceedings of LREC-06, the 5th Conference on Language Resources and Evaluation. Genova, It- aly: [s.n.], 2006: 417-422.

引证文献3

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部