期刊文献+

支持Ajax的Deep Web爬虫研究与设计 被引量:1

Study and Design of an Ajax-Supported Deep Web Crawler
下载PDF
导出
摘要 随着互联网的迅速发展,网络资源日益丰富,如何从Web尤其是Deep Web中获取信息成为人们关注的焦点,以Ajax为基础的新一代网页信息抓取问题也逐渐成为研究热点。通过分析支持Ajax的Deep Web爬虫关键技术,提出了支持Ajax的Deep Web爬虫的体系结构,阐述了一种自动爬行Ajax网站的算法,为该爬虫的总体框架设计奠定了基础。 With the rapid development of Intemet, the network resources are getting more and more abundant, how to extract information from network, especially from Deep Web his been focused on. A new generation of Ajax-based web information extraction has become a hot topic. By analyzing the key technology of the Ajax-supported Deep Web Crawler, this paper puts forward the architecture of the Ajax-Supported Deep Web Crawler, and illustrates an algorithm to crawl the Ajax-supported Deep Web automatically, which lay the foundation for the design of the overall framework of an Ajax-supported Deep Web Crawler.
作者 周杨
出处 《计算机系统应用》 2012年第2期167-171,共5页 Computer Systems & Applications
关键词 DEEP WEB 爬虫 AJAX 搜索引擎 deep Web crawler Ajax search engine
  • 相关文献

参考文献4

  • 1杨丽萍 马继涛 张虹霞.网络搜索引擎分类与发展.情报学报,2006,25(10):421-424.
  • 2Bergman MK. The Deep Web:Surfacing Hidden Value. http://www.brightplanet.com/resources/details/deepweb.html.
  • 3He H, Meng WY. Automatic integration of Web search interfaces with WiSE-integrator. VLDB Journal, 2004, 13(3): 269.
  • 4郑冬冬,崔志明.Deep Web爬虫爬行策略研究[J].计算机工程与设计,2006,27(17):3154-3158. 被引量:13

二级参考文献12

  • 1Raghavan S,Garcia-Molina H.Crawling the hidden web[C].Roma,Italy:Proceedings of the 27th International Conference on Very Large Data Bases,2001.129-138.
  • 2Cormen T H,Leiserson C E,Rivest R L.Introduction to algorithms[M].2nd Edition.MIT Press/McGraw Hill,2001.
  • 3Ipeirotis P,Gravano L.Distributed search over the hidden web:Hierarchical database sampling and selection[C].VLDB,2002.
  • 4Ntoulas A,Cho J,Olston C.What's new on the web? The evolution of the web from a search engine perspective[Z].WWW,2004.
  • 5Barbosa L,Freire J.Siphoning hidden-web data through keyword-based interfaces[C].SBBD,2004.
  • 6Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the web[C].14th Australasian conference on Data Base technologies,2003.
  • 7He B,Chang K C C.Statistical schema matching across web query interfaces[C].SIGMOD Conference,2003.
  • 8Ipeirotis P G,Gravano L,Sahami M.Probe,count,and classify:Categorizing hidden web databases[C].SIGMOD,2001.
  • 9Liu V Z,Luo J C Richard C,Chu W W.Dpro:A probabilistic approach for hidden web database selection using dynamic probing[C].ICDE,2004.
  • 10Wang Jiying.Information discovery,extraction and integration for the hidden web[C].2002.

共引文献14

同被引文献5

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部