摘要
当今社会已经进入信息量爆发式增长的大数据时代,如何从海量数据中快速查找信息成为当前研究的热点,Lucene软件作为优秀的开源全文检索工具已被广泛应用于各种搜索引擎。文章通过对全文检索原理与Lucene工具架构的研究,从优化内存索引、索引压缩处理、优化磁盘索引等方面探讨Lucene检索效率的优化。实验结果证明,通过优化内存索引、索引压缩处理等方法可以有效地提高全文检索的效率。
Today's society has entered the big data era with the explosive growth of information, how to find information quickly from massive data has become the current research hotspot, as an excellent open source full-text retrieval tool, Lucene has been widely used in all kinds of search engines. This paper studies the principle of full-text retrieval and the architecture of Lucene tool, and discusses the Lucene retrieval efficiency optimization from aspects of optimizing memory index, index compression processing and optimizing disk index. The experimental result proved that the efficiency of full-text retrieval can be improved effectively by optimizing memory index and index compression processing.
出处
《计算机时代》
2017年第11期16-19,共4页
Computer Era
关键词
全文检索
LUCENE
倒排索引
检索优化
full-text retrieval
Lucene
inverted index
retrieval optimization