摘要
用户需要检索的信息往往分散存储在多个搜索引擎各自的数据库里 .对普通用户而言 ,访问多个搜索引擎并从返回的结果中分辨出确实有用的网页是一件费时费力的工作 .集成搜索引擎则可以提供给用户一个同时访问多个搜索引擎的集成环境 .集成搜索引擎能将其接收到的用户查询提交给底层的多个搜索引擎进行搜索 .作为一种搜索工具 ,集成搜索引擎具有如 WEB查询覆盖面比传统引擎更大 ,引擎有更好的可扩展性等优点 .讨论了解决集成搜索引擎的数据库选择问题的多种技术 .针对用户提交的查询要求 。
Frequently a user's information needs are stored in the databases of multiple search engines. It is inconvenient and inefficient for an ordinary user to invoke multiple search engines and identify useful documents from the returned results. To support unified access to multiple search engines, a metasearch engine can be constructed. When a metasearch engine receives a query from a user, it invokes the underlying search engines to retrieve useful information for the user. Metasearch engines have other benefits as a search tool, such as increasing the coverage of the Web and improving the scalability of the search. In this paper, techniques are surveyed, which are proposed to tackle the database selection problem in a metasearch engine environment. Database selection is to identify search engines that are likely to return useful documents to a given query.
出处
《计算机研究与发展》
EI
CSCD
北大核心
2001年第4期396-404,共9页
Journal of Computer Research and Development
基金
US NSF基金提供部分资助! (IIS-990 2 872 )
关键词
搜索引擎
信息检索
WEB
方维网
文本数据库
metasearch, information resource discovery, search engine, information retrieval