期刊文献+

信息检索加权理论与技术:基于VSM模型的分析 被引量:5

Term Weighting Schemes and Techniques in Information Retrieval:An Analysis Based on VSM Model
下载PDF
导出
摘要 分析了信息检索加权技术的理论基础,探讨了局部统计分布特性和全局分布特性在词加权技术中的应用以及不对称分布对加权性能的影响,结合词加权技术的基本原理提出了词加权形式化描述与计算模型,并运用该模型对基于向量空间模型的加权技术及其优化策略进行了分析。针对加权技术需解决的关键问题描述文献内容和区分文献,提出计算文献权重应同时利用特征词局部分布和全局分布信息,并消除文献长度和语义信息缺乏等不对称分布问题的影响。
作者 方清华
出处 《情报杂志》 CSSCI 北大核心 2008年第6期73-76,共4页 Journal of Intelligence
  • 相关文献

参考文献17

  • 1Robertson,S. Understanding Inverse Document Frequency: On Theoretical Arguments[J ]. Journal of Documentation, 2004,60 ( 5 ) : 503 - 520
  • 2Salton G, Buckley C. Term - Weighting Approaches in Automatic Text Retrieval[J] ]. Information Processing & Management, 1988,24 (5) :513- 523
  • 3Lan M, Sung SY, Low HB et al. A Comparative Study on Term Weighting Schemes for Text Categorization [ J ]. Proceedings of 2005 IEEE International Joint Conference on Neural Neural Networks, 2005(1) :546 - 551
  • 4Cummins R, O' Riordan. Evolving General Term - Weighting Schemes for Information Retrieval: Tests on Larger Collections [ J ]. Artificial Intelligence Review, 2005 (24) : 277 - 299
  • 5Papineni K. Why Inverse Document Frequency[ C]. NAACL. Proceedings of the North American Association for Computational Linguistics, New York, NY: Association for Computational Linguistics, 2001 : 25 - 32
  • 6Singhal A. Term Weighing Revised[ D]. Ithaca, NY, USA: Cornell University, 1997
  • 7Greiff WA Theory of Term Weighting Based on Exploratory Data Analysis[C]. Croft WB, Moffat A, van Rijsbergen CJ et al. Proceedings of the 21st International ACM SIGIR Conference on Researchand Development in Information Retrieval (SIGIR' 98). New York, NY,USA: ACM,1998:11 - 19
  • 8Wistsch HF. Global Term Weighting in Distributed Environments [J ]. Information Processing and Management ,2007,43(2):1 -13
  • 9Sparck Jones K. A Statistical Interpretation of Term Specificity and Its Application in Retrieval[ J ]. Journal of Documentation, 1972,28 (1):11 -21
  • 10Manning CD,Raghavan P,Schutze H. An Introduction to Information Retrieval [ M]. Cambridge, England: Cambridge University Press,2007:1 - 461

同被引文献33

引证文献5

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部