摘要
Lucene是一个优秀的开源全文搜索技术框架,Lucene全文检索技术是信息检索领域广泛使用的基本技术。它能非常方便地为各种应用程序加入全文索引和搜索功能,快速有效地索引企业累积的大量信息资源。文中阐述了建立全文检索系统的必要性,介绍了全文检索系统的概念并分析了Lucene的系统结构和实现机制,最后结合实际应用背景从系统设计、文挡抽取、索引的建立及执行检索等方面介绍了全文检索系统的实现过程。实践证明,该系统的查准率、查询速度等均达到了设计要求。
Lucene is an excellent technology frame of full-text retrieval engine of open source code. Lucene full-text retrieval technology is a basic technology used widely in information retrieval field. It is very convenient for various applications by adding full-text index and search functions, quickly and efficiently index the accumulation of large enterprise information resources. Expound the necessity of establishing full-text retrieval system, then introduce the concept of full-text retrieval system and analyze the structure of Lucene system and the implementation of the mechanism. Finally, give the implementation process of full-text retrieval system from the system design, the text extraction, the index establishment and the executive index search with a practical application. Practice shows that the system' s precision ratio and speed can satisfy the design demands well.
出处
《计算机技术与发展》
2011年第3期67-71,共5页
Computer Technology and Development
基金
贵州省自然科学基金资助项目(黔教科2008090)
关键词
全文检索
索引
信息检索
full-text retrieval
index
information retrieval