支持Ajax的Deep Web爬虫研究与设计被引量：1

Study and Design of an Ajax-Supported Deep Web Crawler

下载PDF

导出

摘要随着互联网的迅速发展,网络资源日益丰富,如何从Web尤其是Deep Web中获取信息成为人们关注的焦点,以Ajax为基础的新一代网页信息抓取问题也逐渐成为研究热点。通过分析支持Ajax的Deep Web爬虫关键技术,提出了支持Ajax的Deep Web爬虫的体系结构,阐述了一种自动爬行Ajax网站的算法,为该爬虫的总体框架设计奠定了基础。 With the rapid development of Intemet, the network resources are getting more and more abundant, how to extract information from network, especially from Deep Web his been focused on. A new generation of Ajax-based web information extraction has become a hot topic. By analyzing the key technology of the Ajax-supported Deep Web Crawler, this paper puts forward the architecture of the Ajax-Supported Deep Web Crawler, and illustrates an algorithm to crawl the Ajax-supported Deep Web automatically, which lay the foundation for the design of the overall framework of an Ajax-supported Deep Web Crawler.

作者周杨

机构地区军事经济学院基础部计算机教研室

出处《计算机系统应用》 2012年第2期167-171,共5页 Computer Systems & Applications

关键词 DEEP WEB 爬虫 AJAX 搜索引擎 deep Web crawler Ajax search engine

分类号 TP393.09 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1杨丽萍马继涛张虹霞.网络搜索引擎分类与发展.情报学报,2006,25(10):421-424.
2Bergman MK. The Deep Web:Surfacing Hidden Value. http://www.brightplanet.com/resources/details/deepweb.html.
3He H, Meng WY. Automatic integration of Web search interfaces with WiSE-integrator. VLDB Journal, 2004, 13(3): 269.
4郑冬冬,崔志明.Deep Web爬虫爬行策略研究[J].计算机工程与设计,2006,27(17):3154-3158. 被引量：13

二级参考文献12

1Raghavan S,Garcia-Molina H.Crawling the hidden web[C].Roma,Italy:Proceedings of the 27th International Conference on Very Large Data Bases,2001.129-138.
2Cormen T H,Leiserson C E,Rivest R L.Introduction to algorithms[M].2nd Edition.MIT Press/McGraw Hill,2001.
3Ipeirotis P,Gravano L.Distributed search over the hidden web:Hierarchical database sampling and selection[C].VLDB,2002.
4Ntoulas A,Cho J,Olston C.What's new on the web? The evolution of the web from a search engine perspective[Z].WWW,2004.
5Barbosa L,Freire J.Siphoning hidden-web data through keyword-based interfaces[C].SBBD,2004.
6Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the web[C].14th Australasian conference on Data Base technologies,2003.
7He B,Chang K C C.Statistical schema matching across web query interfaces[C].SIGMOD Conference,2003.
8Ipeirotis P G,Gravano L,Sahami M.Probe,count,and classify:Categorizing hidden web databases[C].SIGMOD,2001.
9Liu V Z,Luo J C Richard C,Chu W W.Dpro:A probabilistic approach for hidden web database selection using dynamic probing[C].ICDE,2004.
10Wang Jiying.Information discovery,extraction and integration for the hidden web[C].2002.

共引文献14

1杨占江,李英杰.一种Web商情采集系统需求及结构模型[J].商场现代化,2007(10S):57-58.
2董旻,方曙.Deep Web信息抽取研究[J].图书情报工作,2007,51(10):25-28. 被引量：5
3曾伟辉,李淼.深层网络爬虫研究综述[J].计算机系统应用,2008,17(5):122-126. 被引量：39
4张云冬,徐和祥,胡运发,邓河.基于个性化图书馆的Deep Web Crawler研究与实现[J].计算机应用与软件,2009,26(4):148-149. 被引量：1
5周二虎,张水平,胡洋.基于Deep Web检索的查询结果处理技术的应用[J].计算机工程与设计,2010,31(1):106-109.
6黄聪会,张水平,胡洋.主题Deep Web爬虫框架研究[J].计算机工程与设计,2010,31(5):929-931. 被引量：3
7李贵,韩子扬,郑新录,李征宇.基于Apriori算法的Deep Web网页关系挖掘研究[J].山东大学学报（理学版）,2011,46(5):67-70.
8郭少友,赵善义,李建平,王斌.基于数据库分类的deep web爬行器研究[J].情报科学,2011,29(10):1575-1579.
9钱程,阳小兰.一种支持Ajax框架的网络爬虫的设计与实现[J].计算机与数字工程,2012,40(4):69-71. 被引量：3
10赵昊,卫刚,赵晓东.基于主题Deep Web数据挖掘的研究与探索[J].电脑知识与技术,2012,8(6):3792-3795.

同被引文献5

1张翼,揭金良.基于MVC的企业级应用开发[J].铁路计算机应用,2006,15(11):8-10. 被引量：2
2王得宝,孙美娟,陆守一,崔赛华,闫珺.SVG和GML在WebGIS中的应用研究[J].铁路计算机应用,2007,16(6):40-42. 被引量：2
3史宏.基于信息整合技术的铁路安全监督管理信息系统研究[J].铁路计算机应用,2009,18(5):18-22. 被引量：2
4刘旭光.WebService服务的探析[J].数字技术与应用,2013,31(2):189-189. 被引量：2
5刘英丹,董传良.利用Web Service实现企业应用集成[J].计算机应用,2003,23(7):124-126. 被引量：74

引证文献1

1李琦,蔡海勇,林楷.车务站段安全生产指挥辅助系统的研究与实现[J].铁路计算机应用,2019,28(7):36-39. 被引量：3

二级引证文献3

1林楷.关于铁路制动铁鞋北斗定位防盗系统的研究[J].数码世界,2020,0(1):277-277.
2张亮,赵明.车务段安全生产指挥中心现状及作用提升分析[J].中国铁路,2020(7):73-77. 被引量：4
3曹海鹏,唐伟忠,宋哲超,刘海宁,李思维.车务段安全生产指挥中心管理系统设计[J].铁路计算机应用,2024,33(6):46-56.

1程世奇,武新军,郭锴,赵昆明.架空工业管道漏磁无线检测系统的研制[J].化工自动化及仪表,2015,42(5):492-495 522. 被引量：1
2林琳.首款单兵式掌上自动检测系统[J].中国特种设备安全,2011,27(2):68-68.
3胡春江,刘学仁,赵德利,温定筠,张凯国.一种绝缘子检测机器人的应用研究[J].信息技术与信息化,2014(3):126-128. 被引量：2
4祝宇,夏诏杰,聂峰光,郭力.支持向量机在化学主题爬虫中的应用[J].计算机与应用化学,2006,23(4):329-332. 被引量：8
5吴功平,肖晓晖,郭应龙,胡基才.架空高压输电线自动爬行机器人的研制[J].中国机械工程,2006,17(3):237-240. 被引量：35
6王明维,邵守斌.储罐在线检测技术研究[J].油气田地面工程,2008,27(7):16-17. 被引量：1
7四爪机器人：可自动清扫房间不留死角[J].读写算（科技知识动漫）,2014,0(9):4-4.

计算机系统应用

2012年第2期

浏览历史

内容加载中请稍等...

支持Ajax的Deep Web爬虫研究与设计被引量：1

参考文献4

二级参考文献12

共引文献14

同被引文献5

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

支持Ajax的Deep Web爬虫研究与设计 被引量：1

参考文献4

二级参考文献12

共引文献14

同被引文献5

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

支持Ajax的Deep Web爬虫研究与设计被引量：1