期刊文献+

基于云计算的Pagerank算法的改进

An improved Pagerank algorithm based on cloud computing
原文传递
导出
摘要 针对Pagerank算法在Web结构挖掘中存在的需要大量迭代的问题,提出一种新的方法.该方法通过对原始Pagerank值的计算公式进行改进,降低了迭代次数.实验表明,在云计算环境下,新方法减少了网络通信和访问HDFS的消耗,在时间花费上优于传统的Pagerank算法. With the advent of the era of cloud computing, it is a new important research topic to discuss the problem of the web mining based the cloud computing. A new method is proposed to solve the large number of iterations problems in the Web structure mining for the Pagerank algorithm. Through improving the formula of the original pagerank value, it reduces the number of iterations. The experiments show that this method reduces the network traffic and the consumption of accessing HDFS in the cloud computing enviroment, and it is superior to the original Pagerank algorithm in the time consumption.
作者 郑晶
出处 《福州大学学报(自然科学版)》 CAS CSCD 北大核心 2014年第1期45-49,共5页 Journal of Fuzhou University(Natural Science Edition)
基金 国家自然科学基金资助项目(30671680) 国家科技型中小企业技术创新基金资助项目(11C26213502126) 福建省教育厅科技资助项目(JA11269) 福建江夏学院青年资助项目(2011C005)
关键词 云计算 WEB结构挖掘 PAGERANK MAPREDUCE cloud computing Web structure mining Pagerank Mapreduce
  • 相关文献

参考文献9

  • 1Chen M S, PARK J S, YU P S. Data mining for path traversal patterns in a Web environment[ C]//Proceedings of the 16th In- ternational Conference on Distributed Computing Systems. Hong Kong : IEEE, 1996 : 385 - 392.
  • 2Brin S, Page L. The anatomy of a large - scale hypertextual Web search engine [ C ]//Proceedings of the Seventh International World Wide Web Conference. Brisbane: Elsevier Science Publishers, 1998:107 - 117.
  • 3Haveliwala T H. Topic -sensitive Pagerank [ C ]//Proceedings of the Eleventh International World Wide Web Conference. New York: ACM, 2002:517-526.
  • 4Richardson M, Domingos P. The intelligent surfer : probabilistic combination of link and content information in Pagerank [ J ]. Advances in Neural Information Processing Systems, 2002, 14, 1 441 - 1 448.
  • 5宋聚平,王永成,尹中航,滕伟.对网页PageRank算法的改进[J].上海交通大学学报,2003,37(3):397-400. 被引量:40
  • 6戚华春,黄德才,郑月锋.具有时间反馈的PageRank改进算法[J].浙江工业大学学报,2005,33(3):272-275. 被引量:27
  • 7程苗.基于云计算的Web数据挖掘[J].计算机科学,2011,38(B10):146-149. 被引量:51
  • 8Dean J, Ghemawat S. Mapreduce: simplied data processing on large cluster[ C ]//Proceedings of the 6'h Conference on Sympo- sium on Opearting Systems, Design and Implementation. [ s. 1. ] : USENIX Association, 2004.
  • 9Stanford Universtity. Standfor network analysis platform [ EB/OL ]. [ 2002 - 05 - 08 ]. http : //snap. stanford, edu/data/index. html.

二级参考文献22

  • 1席景科,闫大顺.Web数据挖掘中数据集成问题的研究[J].计算机工程与设计,2006,27(8):1366-1368. 被引量:6
  • 2Cannataro M, Talia D, Trunfio P. KNOWLEDGE GRID.. High Performance Knowledge Discovery on the Grid [C] // Lecture Notes In Computer Science, Vol. 2242, Proceedings of the Second International Workshop on Grid Computing. 2001:38-50.
  • 3Ye Yan-bin, Chiang C-C. A Parallel Apriori Algorithm for Frequent Item sets Mining[C]//Proeeedings of the Fourth International Conference on Software Engineering Research Manage- ment and Applications(SERA'06). 2006:87-94.
  • 4Armbrust M, Fox A, Griffith R, et al. Above the Clouds: A Berkeley View of Cloud Computing.
  • 5王鹏.云计算的关键技术与应用实例.
  • 6Cooley R, Mobasher B, Srivastava J. Web mining: Information and pattern discovery on the World Wide Web[A]. 9th International Conference on Tools with Artificial Intelligence (ICTAI'97). IEEE Computer Society[C]. 1997. 558-567.
  • 7Page L, Brin S, Motwani R, et al. The pagerank citation ranking: Bringing order to the WEB [EB/OL]. http://newdbpubs. stanford. edu/8090/pub/1999-66/1999-11-11.
  • 8Jon M K. Authoritative sources in a hyperlinked environment [J]. Journal of the ACM, 1999,46(5):668-677.
  • 9Oren Zamir, Oren Etzioni. Grouper: a dynamic clustering interface to Web search results [J]. Computer Networks, 1999, 31:58-63.
  • 10Brin S, Page L. The anatomy of a large-scale hypertextual Web-search engine [A]. Proc 7th International World Wide Web Conference[C]. Brisbane:SIGIR, 1998. 146-164.

共引文献109

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部