摘要
传统网络爬虫只处理页面中的超链接,而忽略了大量有价值的深层网搜索表单。本文设计了一个表单检测器用于检测搜索表单,介绍了其功能模块及具体实现,最后用实验验证该检测器的有效性。
Traditional Web crawler only process the URL of the pages, ignoring the tremendous amount of high quality search form. This article designed a form detector of the deep web crawler, and introduced its component and realization in detail, lastly validated its validity.
出处
《科技资讯》
2009年第16期21-21,共1页
Science & Technology Information