摘要
随着Web规模的不断扩大,搜索引擎正成为因特网上最常用的应用之一。本文以天网搜索为实例,分析了大规模通用型中文搜索引擎检索系统的设计与实现技术。围绕检索效率和检索效果两个方面,本文介绍天网检索系统的集成框架结构和分布式架构,并分析了索引创建和索引检索中的相关实现技术。
With the flourish of the Web, search engine becomes one of the most popular applications on the Internet. In this paper, we analyze the design and implementation of Tianwang, which is a large-scale general Chinese search engine. Based on the principle of efficiency and effectiveness, we describe the integrated retrieval system framework and the distributed retrieval architecture of Tianwang. Then we analyze the technical details in the index creation and index retrieval, which lead to a high-performance search engine retrieval system.
出处
《计算机工程与科学》
CSCD
2006年第3期1-4,共4页
Computer Engineering & Science
基金
国家973计划资助项目(G1999032706)
教育部博士点基金课题(20030001076)
关键词
搜索引擎
信息检索
天网
search engine
information retrieval
Tianwang