期刊文献+

分布式网页排序算法及其传输模式分析 被引量:1

Algorithms for Distributed Page Ranking and Analysis on Transmission Mode
下载PDF
导出
摘要 网页规模的飞速发展要求分布式网页排序技术的出现。在分析了分布式环境下网页划分的策略后;基于集中式PageRank,给出了适于开放系统的GroupPageRank算法;接着提出了两个分布式网页排序算法并给出了一些相关理论结果。同时还对传输模式进行了探讨,提出了具有良好扩展性的间接传输模式。最后在真实数据集上进行了实验,验证了实验的结果。 Distributed page ranking is needed for the remarkable growth of the web.After the analysis of the strategy for web page partitioning under distributed environment ,an algorithm called GroupPageRank is proposed based on the centralized PageRank algorithm.It is also suitable for open system.Then two distributed PageRank algorithms and also some related theoretical results are given.At the same time ,we perform discussion on transmission modes.A scalable transmission mode,Indirect Transmission Mode,is presented.Finally,we verify some of the discussions by experiments based on real datasets.
作者 余锦 史树明
出处 《计算机工程与应用》 CSCD 北大核心 2004年第29期182-187,共6页 Computer Engineering and Applications
基金 国家自然科学基金项目(编号:60173007) 国家863高技术研究发展计划项目(编号:2001AA111080 2002AA104580)资助
关键词 分布式 网页排序 PAGERANK 间接传输 distributed,page ranking,PageRank,indirect transmission
  • 相关文献

参考文献18

  • 1Jon M Kleinberg.Authoritative sources in a hyperlinked environment[C].In:Proceedings of the Ninth Annual ACMSIAM Symposium on Discrete Algorithms,San Francisco,California, 1998-01
  • 2Lawrence Page,Sergey Brin,Rajeev Motwani et al.The PageRank citation ranking:Bringing order to the Web[R].Technical report,Stanford University Database Group,1998
  • 3http://www.google.com
  • 4T H Haveliwala. Efficient computation of PageRank[R].Technical Report ,Stanford University, 1999
  • 5G Jeh,J Widom. Scaling personalized web search[R].Technical Report,Stanford University,2002
  • 6Rowstron A, P Drnschel.Pastry: Scalable, distributed object location and routing for largescale peer-to-peer systems[C].In:IFIP/ACM Middleware, Heidelberg, Germany, 2001
  • 7Owe Axelsson.Iterative Solution Methods[M].Cambridge University Press, 1994
  • 8S D Kamvar,T H Haveliwala,C D Manning et al. Extrapolation Methods for Accelerating PageRank Computations[R].Technical Report,Stanford University, 2002
  • 9T H Haveliwala.Topic-sensitive PageRank[C].In :Proceedings of the Eleventh International World Wide Web Conference ,2002
  • 10D Rafiei,A O Mendelzon. What is this page known for?Computing web page reputations[C].In:Proceedings of the Ninth International World Wide Web Conference,2000

同被引文献35

  • 1沈贺丹,潘亚楠,邵良杉.关于搜索引擎的研究综述[J].计算机技术与发展,2006,16(4):147-149. 被引量:17
  • 2蒋宗礼,赵钦,肖华,王蕊.高性能并行爬行器[J].计算机工程与设计,2006,27(24):4762-4766. 被引量:7
  • 3张三峰,吴国新.一种面向动态异构网络的容错非对称DHT方法[J].计算机研究与发展,2007,44(6):905-913. 被引量:1
  • 4中国互联网络发展状况统计报告[EB/OL].http://tech.qq.com/a/20080724/000277.htm.2008-9-27.
  • 5Arasu A, Cho J. Searching the Web[J]. ACM Transactions on Internet Technology, 2001,1 (1) : 2-43.
  • 6Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters[A]//Proceedings of the 6th Conference on Symposium on Opear-ting Systems Design & Implementation[C]. San Francisco, CA, 2004: 10-10.
  • 7Ghemawat S, Gobioff H, Leung Shun-Tak. The Google File System[A]//Proeeedings of the 19th ACM Symposium on Operating Systems Principles[C]. 2003:20-43.
  • 8Pike R, Dorward S, Griesemer R. Interpreting the Data:Parallel Analysis with Sawzall [J]. Scientific Programming Journal, 2005,13:277-298.
  • 9Chang F, Dean J, Ghemawat S. Bigtable: A Distributed Storage System for Structured Data[A]//7th USENIX Symposium on Operating Systems Design and Implementation[C]. 2006:205- 218.
  • 10Brin S, Page L. The Anatomy of a Large - scale Hypertextual Web Search Engine[J]. Computer Networks, 1998,30:107-117.

引证文献1

二级引证文献89

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部