期刊文献+

网络爬虫的设计与实现 被引量:7

The Design and Implementation of Web Crawler
下载PDF
导出
摘要 搜索引擎技术随着互联网的日益壮大而飞速发展。作为搜索引擎不可或缺的组成部分,网络爬虫的作用显得尤为重要,它的性能直接决定了在庞大的互联网上进行网页信息采集的质量。设计并实现了通用爬虫和限定爬虫。 With the growing of Internet, search engine technology develops rapidly. As an indispensable part of search engine, web crawler is particularly important, its performance directly determines the quality of gathering webpage information in large Internet . This paper designs and implements general crawler and limitative crawler.
作者 王娟 吴金鹏
出处 《软件导刊》 2012年第4期136-137,共2页 Software Guide
关键词 网络爬虫 通用爬虫 限定爬虫 Web Crawler General Crawler Limitative Crawler
  • 相关文献

参考文献4

二级参考文献38

  • 1郑冬冬,赵朋朋,崔志明.Deep Web爬虫研究与设计[J].清华大学学报(自然科学版),2005,45(S1):1896-1902. 被引量:28
  • 2郑冬冬,崔志明.Deep Web爬虫爬行策略研究[J].计算机工程与设计,2006,27(17):3154-3158. 被引量:13
  • 3孙彬,王东,李娟.基于XQuery的Deep Web搜索系统的设计与实现[J].科学技术与工程,2007,7(16):4080-4084. 被引量:2
  • 4Hemovici M, Jacovi M, Maarek Y S, et al. The Shark-Search Algorithm: An Application:Tailored Web Site Mapping[ C ]//Proceedings of the7th international World Wide Web 7 conference. Brisbane, Australia, 1998.
  • 5Joson Rennie, Andrew Kachites McCallum. Using reinforcement learning to spider the web efficiently[ C ]//Proceedings of the 16th International Conference on Machine Learning( ICML - 99 ). Bled, Slovenia, 1999:335 - 343.
  • 6Diligenti M, Coetzee F, Lawrence S, et al. Focused crawling using context graphs. Proceedings of the 26th International Conference on Very Large Database ( VLDB2000), 2000:527 - 534.
  • 7Aggaewal C, A1-Garawif Yup. Intelligent crawling on the World Wide Web with arbitrary predicates[ C ]//Proc of the 10th International WoAd Wide Web Conference. HongKong: [ S n] ,2001.
  • 8Maenehea Ehrig. Ontology-focused crawling of Web documents[ C ]//Proc of ACM Symposium on Applied Computing ,2003.
  • 9Chakrabarti S, Punera K, Subramanyam M. Accelerated Focused Crawling through Online Relevance Feedback [ C ]//Proceedings of the 11 th International Conference on World Wide Web, Hawaii, USA ,2002 : 148 - 159.
  • 10Cai Rui, Yang Jiang-ming, Wei lai. iRobot: An Intelligent Crawler for Web Forums [ A ]//Proceedings of the 17th International world Wide Web Conference[ C ]. ACM Press ,2008:447 - 456.

共引文献154

同被引文献27

引证文献7

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部