期刊文献+

一种基于相似度值的向量空间投影HITS算法

A Vector Space Projection HITS Algorithm Based on Similarity Value
下载PDF
导出
摘要 利用传统的搜索引擎寻找信息,返回的页面结果集查准率低且信息冗余,基于Web结构挖掘技术的HITS算法可以提高页面搜索的有效性。在深入分析HITS算法及其相关改进算法的基础上,提出一种基于相似度值的向量空间投影HITS算法。该算法在超链接结构分析的基础上结合页面文本内容,能较好地消除HITS算法存在的主题偏移现象,且不增加额外的系统开销。 There usually have several problems, such as low accuracy and data redundancy, in the result set given by the traditional search engine. HITS algorithm based on Web structure mining technology can markedly improve the effectiveness of searching Web pages. Deeply analyzes HITS algorithm and some pertinent improved algorithms, and proposes a vector space projection HITS algorithm. This algorithm, which is based on the analysis of hyperlink structure and combining page text content, can relatively eliminate the theme deviation issue in HITS algorithm without causing extra system overhead.
出处 《现代计算机》 2009年第10期20-22,37,共4页 Modern Computer
基金 重庆市科委自然科学基金项目(CSTC No.2007BB2439) 重庆市教委基金项目(No.0634167)
关键词 超链接 HITS算法 相似度值 向量投影 Hyperlink HITS Algorithm Similarity Value Vector Projection
  • 相关文献

参考文献6

  • 1Sergey Brin, Lawrence Page. The Anatomy of a Large-Scale Hypertextual Web Search Engine[J]. Computer Networks and ISDN Systems, 1998,30 : 107-117.
  • 2Kleinberg J.M. Authoritative Sources in a Hyperlinked Environment[J]. Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms, 1998:668-677.
  • 3仲婷,金浩,冯茜芦,潘金贵.一种基于结构分析的改进HITS算法[J].广西师范大学学报(自然科学版),2007,25(2):214-217. 被引量:3
  • 4Soumen Chakrabarti,Byron Dom,etc. Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text[J]. Computer Networks and ISDN Systems, 1998, 30:65-74.
  • 5Jullen Gevrey and Stefan M Ruger. Link-Based Approaches for Text Retrieval[J]. Proceedings of TREC210, NIST Special Publication, 2002:279-285.
  • 6Saeko Nomura, Satoshi Oyama, etc. Analysis and Improvement of HITS Algorithm for Detecting Web Communities[J]. Proceedings of the 2002 Symposium on Applications and the Intemet, 2002.

二级参考文献8

  • 1PAGE L,BRIN S,MOTWANI R,et al.The PageRank citation ranking:Bringing order to the Web[R].Stanford,CA:Stanford Digital Libraries Working Paper,1998.
  • 2KLEINBERG J.Authoritative sources in a hyperlinked environment[C]//Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms.New Orleans:ACM Press,1997:668-677.
  • 3LEMPEL R,MORAN S.SALSA:The stochastic approach for link-structure analysis[J].ACM Transactions on Information Systems,2001,19(2):131-160.
  • 4BHARAT K,HENZINGER M R.Improved algorithms for topic distillation in a hyperlinked environment[C]//21st International ACM SIGIR Conference on Research and Development in Information Retrieval.Melbourne:ACM Press,1998:104-111.
  • 5LI Long-zhuang,SHANG Yi,ZHANG Wei.Improvement of HITS-based algorithms on Web documents[C]//Proceedings of the eleventh international conference on World Wide Web.New York:ACM Press,2002:527-535.
  • 6CHAKRABARTI S,DOM B E,GIBSON D,et al.Mining the link structure of the world wide Web[J].IEEE Computer,1999,32(8):60-67.
  • 7CHAKRABARTI S,DOM B E,GIBSON D,et a.Automatic resource compilation by analyzing hyperlink structure and associated text[C]//Proceedings of the 7th International WWW Conference.Amsterdan:Elsevier Science Publisher,1998:65-74.
  • 8NOMURA S,OYAMA S,HAYAMIZU T,et al.Analysis and improvement of HITS algorithm for detecting web communities[J].Systems and Computers in Japan,2004,35(13):32-42.

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部