摘要
快速有效地索引企业累积的大量的信息资源,是提供高质量检索服务的基础。Lucene是一个用Java写的全文索引引擎工具包,访问索引时间快,支持多用户访问,可以跨平台使用。本文研究了Lucene和中文切分词技术,分析了Lucene的索引原理,实现了一个基于Lucene并支持中英文文档检索的应用实例。
To fast and effectively index vast information resources of enterprises is the basis of providing high quality information retrieval service. Lucene is a full text indexing engine package written in Java language. It has high access speed, supports multi-user accesses and can be used in a cross-platform way. This paper studies Lucene and Chinese word segmentation technology, analyzes the index theory of Lucene and gives an example of the Lucene-based index system which supports Chinese and English document retrieval.
出处
《情报理论与实践》
CSSCI
北大核心
2006年第1期125-128,共4页
Information Studies:Theory & Application