摘要
将叙词表同传统的信息检索技术相结合,提出用叙词表的族对爬虫的主题进行描述的方法并用该方法设计实现一主题爬虫。实验结果证明本文提出模型和算法的有效性。
Combining the thesaurus with the traditional information retrieval technology, a new method is presented that family in thesaurus is used to describe the predefined topic. Also develops a focused - crawler based on this method. On which we compare its efficiency with other well - known Web search engine. The experimental results showes the effectiveness of our models and algorithms.
出处
《现代图书情报技术》
CSSCI
北大核心
2007年第5期41-44,共4页
New Technology of Library and Information Service
基金
北京市自然科学基金资助项目"基于遗传算法网页信息搜索技术"(项目编号:4062013)的研究成果之一
关键词
主题爬虫
叙词表
搜索引擎
Focused crawler Thesaurus Search engine