摘要
为了深入理解Lucene使用的排序技术和实际应用,需要研究矢量信息检索模型和td-idf加权策略.根据影响排序的因素,提出了Lucene的文档得分算法,分析了各因素对排序结果的影响.结合实例对基于Lucene排序的相关API进行应用,以提升Lucene排序的性能.
Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It has been widely used to build search engine by developers. The article is focused on study of the sorting of search engines. Firstly, the theory of the Sorting is deeply researched, then the sorting algorithm of Lucene is studied, and finally the basic API of Lucene sorting is used in practice.
出处
《重庆工学院学报(自然科学版)》
2008年第12期102-105,共4页
Journal of Chongqing Institute of Technology