摘要
文章从介绍一般爬虫的逻辑结构开始,分类综述了发展历史中出现不同协作方式的顺序、并行和分布式爬虫,通用爬虫、深度爬虫以及增量爬虫等特殊分类的爬虫,着重介绍了主题爬虫的原理和相关策略,优势、应用和问题,最后提出主题爬虫未来的研究方向。
This article begins from the introduction of the logical structure of general crawler, reviews the different coordination modes of sequential, parallel and distributed crawler development history, general reptiles, Deep reptiles and incremental reptiles and other special classification of reptiles, focusing on the principles of the topical crawler and related strategies, advantages, applications and problems, and finally proposed the future research direction of the crawler.
出处
《电脑知识与技术》
2017年第9X期213-214,共2页
Computer Knowledge and Technology