期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
A New Framework for Focused Web Crawling 被引量:3
1
作者 PENG Tao HE Fengling ZUO Wanli 《Wuhan University Journal of Natural Sciences》 CAS 2006年第5期1394-1397,共4页
Focused crawlers are important tools to support applications such as specialized Web portals, online searching, and Web search engines. A topic driven crawler chooses the best URLs and relevant pages to pursue during ... Focused crawlers are important tools to support applications such as specialized Web portals, online searching, and Web search engines. A topic driven crawler chooses the best URLs and relevant pages to pursue during Web crawling. It is difficult to deal with irrelevant pages. This paper presents a novel focused crawler framework. In our focused crawler, we propose a method to overcome some of the limitations of dealing with the irrelevant pages. We also introduce the implementation of our focused crawler and present some important metrics and an evaluation function for ranking pages relevance. The experimental result shows that our crawler can obtain more "important" pages and has a high precision and recall value. 展开更多
关键词 focused crawlers irrelevant pages relevance metrics
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部