期刊文献+

基于Tf-Idf和网页链接的PageRank改进算法 被引量:1

IMPROVED PAGERANK ALGORITHM BASED ON TF-IDF AND WEBPAGE LINK
下载PDF
导出
摘要 提出基于Tf-Idf和网页链接对传统的PageRank算法不足之处进行改进。该算法不仅较好地解决了PageRank主题漂移问题,而且在查准率和查全率方面也有较大的提高。通过实验证明,该算法可以获得优于传统PageRank算法的查询结果集。 This paper proposes to improve the inadequacy of traditional PageRank algorithm based on Tf-Idf and Webpage link.This improved algorithm solves the theme drift problem of PageRank,and also makes quite big progress in precision and recall.Through experiment we prove that the improved algorithm can acquire better query result set than the traditional PageRank algorithm.
出处 《计算机应用与软件》 CSCD 北大核心 2013年第5期301-302,330,共3页 Computer Applications and Software
关键词 PAGERANK 查全率 搜索引擎 网页连接 主题漂移 PageRank Recall Search engine Webpage link Theme drift
  • 相关文献

参考文献12

  • 1Michael J,Alon Halery.Web Tables Exploring the power of Tables on the Web[J].IEEE Press,2007,9(6):280-286.
  • 2Chakrabarti S,Mvanden Berg,Dom B.Focused Crawling:A New Ap-proach to Topic-Specific Web Resource Discovery[J].Computer Net-works,1999,31(16):1623-1640.
  • 3Watkins C J C H.Learning form delayed reward[D].England:Univer-sity of Cambridge,1989.
  • 4Heydon A,Najork M.Mercator:a scalable,extensible web crawler[J].World Wide Web Journal,1999,8(3):219-229.
  • 5Haveliwala T.Efficient computation of pagerank[R].Stanford Univer-sity:Stanford CA,1999.
  • 6Gospodnetic O,Hatcher E.Lucene in action[M].Greenwich:Manning Publications,2005:80-98.
  • 7Brin S,Page L.The anatomy of a large-scale hypertextual web search engine[C]//Proceedings of7th World Wide Web Conference,1998:107-117.
  • 8Dean J,Henzinger M R.Finding related pages in the WorldWide Web[J].Computer Networks,1999,31(11-16):1467-1469.
  • 9高琪,张永平.PageRank算法中主题漂移的研究[J].微计算机信息,2010,26(9):117-119. 被引量:13
  • 10温泉,丁祥武.基于主题聚焦模型的PageRank改进算法[J].计算机应用与软件,2011,28(3):173-175. 被引量:2

二级参考文献27

  • 1杨占华,杨燕.数据挖掘在智能搜索引擎中的应用[J].微计算机信息,2006,22(04X):244-246. 被引量:22
  • 2蔡明,张体首.基于本体的搜索引擎研究[J].微计算机信息,2006(12X):242-244. 被引量:14
  • 3鲁松 白硕 等.文本中词语权重计算方法的改进[A]..2000 International Conference on Multilingual Information Processing[C].,2000.31-36.
  • 4国互联网络信息中心CNNIC.《第21次中国互联网络发展状况统计报告》[R].http://www.cnnic.net.cn/index/0E/00/11/index.htm.2007.
  • 5Sergey Brin and Lawrence Page.The PageRank Citation Ranking:Bring Order to the Web [C]. Computer Science Department, Stanford University. 1998.
  • 6Sergey Brin and Lawrence Page.The Anatomy of a Large-Scale Hypertexttual WebSearchEngine[C]. Computer Science Department, Stanford University.1998.
  • 7Taher H.Havaliwala.Topic-Sensitive PageRank [C]. Computer Science Department,Stanford Universit.2002.
  • 8George A.Mihaila.HillTop:A Search Engine based on Expert Documents [C].Department of Cumputer Science University of Toronto.2004.
  • 9Gospodneic,ErikHatcher, Otis.Lucene in Action[M].北京.电子工业出版社.2007.
  • 10Wenpu Xing,Ali Ghorbani. Weighted PageRank Algorithm. IEEE 2004.

共引文献62

同被引文献13

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部