期刊文献+

爬行虫算法设计与程序实现 被引量:14

Design of Crawler's Algorithm and Implementation of Crawler's Program
下载PDF
导出
摘要 爬行虫算法是搜索引擎探讨的热点。通过分析现有爬行虫算法设计和程序实现的主要方式 ,权衡其利弊 ,总结出一个适合于中小型网站网页下载的爬行虫算法。并使用jBuider8.0工具实现了该算法。通过实验分析 ,该程序下载的网页数的速度为 1882 4 2个网页 /分和 4 1.92 74 .5 9KB/秒。 The research of crawler's algorithm is a hotspot in search engine. This paper,first analyses the current method of designing crawler's algorithm and realizing crawler's program and concludes its disadvantage and advantage. Then it gives a crawler's algorithm of retrieval web page suitable for medium and small-sized web site and realizes this algorithm by jbuider8.0?It is proved that the program speed of downloading web pages is 188~242/minus and 41.92~74.59KB/second.
出处 《计算机应用》 CSCD 北大核心 2004年第1期33-35,共3页 journal of Computer Applications
关键词 爬行虫算法 爬行虫程序 搜索引擎 crawler's algorithm crawler's program search engine
  • 相关文献

参考文献2

二级参考文献15

  • 1[1]Mark A.C.Overmeer.My personal search engine.Computer Networks,1999,31:2271~2279
  • 2[2]S.Lawrence,C.Lee Giles.Accessibility of information on the Web.Nature,1999,400
  • 3[3]M.Koster.Robots in the web:threat or treat.Conne Xions,1995,9(4) http://info.webcrawler.com/mak/projects/robots/threat-or-treat.html
  • 4[4]Krishan Bharat,Andrei Broder,Monika Henzinger,etc..The connectivity derver:fast access to linkage information on the web.Proc.7th International World Wide Web Conference,1998
  • 5[5]Soumen Chakrabarti.Mining the Web's link structure.Computer,IEEE,1999,August:60~67
  • 6[6]Altigran S.Da Silva,Eveline A.Veloso,Paulo B.Golgher,etc..CoBWeb--A crawler for the Brazilian Web.String Processing and Information Retrieval Symposium,1999:184~191
  • 7[7]C.M.Bowman,P.B.Danzig,D.R.Hardy,U.Manber,and M.F.Schwartz.Harvest:a scalable,customizable discovery and access system.Technical Report CU-CS-732-94,1994
  • 8[8]H.Yamana,K.Tamur,H.Kawano,S.Kamei,M.Harada,etc.Experiments of collecting www information using distributed www robots.In Proceedings of the 21st International ACM SIGIR Conference,Australian,1998
  • 9[9]Y.S.Maarek,et al.WebCutter:a system for dynamic and tailorable site mapping.Proc.of 6th WWW Conference,Santa Clara,USA,April,1997
  • 10[10]Gun-Woo Nam,Jong-Hee Park,Tai-Yun Kim.Dynamic management of URL based on object-oriented paradigm.Parallel and Distributed Systems,IEEE,1998:226~230

共引文献20

同被引文献69

引证文献14

二级引证文献68

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部