摘要
Internet网络环境搜索引擎主要由三部分组成:搜索器、索引数据库和用户界面。检索器是搜索引擎中的核心和关键。通常的网络搜索引擎采用的是集合或模糊检索算法,其检索结果冗余大。主要介绍了搜索引擎索引数据库的结构;基于Spider的通用搜索器的实现;索引表的生成过程;精确检索原理、算法及实现。采用精确检索算法的搜索引擎,所搜索的信息冗余度小并且效率高。
Network search engine of a Internet consists of three parts : searcher, index Database and user interface. Searcher is the most important part of the search engine , which usually uses search algorithm of aggregate or fuzzy. Results of these search algorithms are great redundant. This paper introduces the structure of index Database of implement technologies in a search engine, currency searcher based on Spider, process creating index table and the principle and the algorithm of accurate search. The information is redundant and efficient through the accurate search algorithm of the search engine.
出处
《微处理机》
2007年第1期75-77,81,共4页
Microprocessors
基金
辽宁省教育厅高等学校科学技术研究基金项目(202023085)