期刊文献+

一种通用分布式数据抓取系统的设计与实现 被引量:5

Design and implementation of universal distributed data crawling system
下载PDF
导出
摘要 设计并实现了一种通用的具有高可靠性和可扩展性的分布式网络数据抓取系统.给出了服务器和抓取节点的执行算法,并利用实时数据库Influx DB和可视化框架Grafana设计了抓取节点的性能监控系统.利用系统可以跟据需求对互联网的数据进行快速地抓取和收集. In this paper, a universal distributed data crawling system with high reliability and scalability was designed and implemented. The algorithms that run on server and crawling nodes respectively were described. A performance monitoring system based on InfluxDB and Grafana was also created for real - time monitoring. This system can be used to rapidly crawl and collect the data from internet by requirements.
作者 潘庆和
出处 《哈尔滨商业大学学报(自然科学版)》 CAS 2016年第3期307-312,共6页 Journal of Harbin University of Commerce:Natural Sciences Edition
关键词 分布式网络系统 数据抓取 InfluxDB Grafana distributed network data crawling InfluxDB Grafana
  • 相关文献

参考文献10

  • 1SCHRAM A, ANDERSON K M. MySQL to NoSQL: data model- ing challenges in supporting scalability [ C ]//Tucson: The 3rd annual conference on Systems, programming, and applications: software for humanity, 2013. 191 -202.
  • 2BOICEA A, RADULESCU F, AGAPI L I. MongoDB vs Oracle - - database comparison [ C ]// Bucharest : 2012 Third Inter-national Conference on Emerging Intelligent Data and Web Tech- nologies, 2012. 330 - 335.
  • 3PARKER Z, POE S, VRBSKY SV, et al. Comparing nosql mongodb to an sql db [ C ]//USA: Acre Southeast Conference, 2013.1 -6.
  • 4SHVACHKO K, KUANG H,RADIA S C, et al. The Hadoop Distributed File System[ C ]// USA: 2010 IEEE 26th Symposi- um on Mass Storage Systems and Technologies ( MSST), 2010. 1 -10.
  • 5Apache Spark project, http ://spark. apache, org.
  • 6DAVID B, BRIAN K J. Python cookbook [M]. 3rd ed. USA Sebastopol : O' Reilly Media, 2013.
  • 7Influxdata [ DB/OL]. https ://influxdata. com/.
  • 8Grafana [ DB/OL]. http ://www. grafana, org,/.
  • 9MARK L. Programming Python, [ M ]. 3 rd ed. USA Sebastopol : O' Reilly Media, 2006.
  • 10JULIA E, MARK L. Lightweight Django [ M ]. USA Sebas- topoi: O' Reilly Media, 2014.

同被引文献45

引证文献5

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部