摘要
网页规模的飞速发展要求分布式网页排序技术的出现。在分析了分布式环境下网页划分的策略后;基于集中式PageRank,给出了适于开放系统的GroupPageRank算法;接着提出了两个分布式网页排序算法并给出了一些相关理论结果。同时还对传输模式进行了探讨,提出了具有良好扩展性的间接传输模式。最后在真实数据集上进行了实验,验证了实验的结果。
Distributed page ranking is needed for the remarkable growth of the web.After the analysis of the strategy for web page partitioning under distributed environment ,an algorithm called GroupPageRank is proposed based on the centralized PageRank algorithm.It is also suitable for open system.Then two distributed PageRank algorithms and also some related theoretical results are given.At the same time ,we perform discussion on transmission modes.A scalable transmission mode,Indirect Transmission Mode,is presented.Finally,we verify some of the discussions by experiments based on real datasets.
出处
《计算机工程与应用》
CSCD
北大核心
2004年第29期182-187,共6页
Computer Engineering and Applications
基金
国家自然科学基金项目(编号:60173007)
国家863高技术研究发展计划项目(编号:2001AA111080
2002AA104580)资助