摘要
分析了Web2.0网络的网络爬虫面临的新挑战,对目前学术界出现的多种实现方案和策略进行了全面的综述,提出了AJAX爬虫的设计并加以实现,最后进行了实验验证,验证了这种AJAX Crawler能够很好地获取AJAX的动态页面,并与普通的爬虫在下载速度方面进行了对比。
The paper analyzes the new challenges to web crawler in Web2.0, and conducts a comprehensive overview of methods and strategy in current academic. Then the paper puts forward AJAXCrawler and implements it. At last, it makes experiments to verify that AJAXCrawler can do well in getting AJAX dynamic web pages, and makes a contrast with com- mon web crowler in download speed.
出处
《智能计算机与应用》
2013年第6期57-59,62,共4页
Intelligent Computer and Applications