摘要
Lucene是一个高效全文检索工具包,但它不能直接处理文件和数据库。主要研究Lucene的体系架构及其索引的不足之处,并在其基础上设计实现了一个全文检索构件。该构件能够直接对文件及数据库进行全文检索,使用户在不用编写程序的情况下,快速为自己的桌面系统或Web系统添加全文检索功能。使用插件架构,同时实现了多媒体文本提取插件。
Lucene is a highly efficient full-text retrieval kit, however, it can' t process files and database directly. This paper mainly focuses on the architecture of Lucene and the defects of Lucense' s indexing, and then base on that a full-text retrieval component is designed and implemented. The component can process files and database directly. Using this component, users can add full-text retrieval function to their desktop/web application quickly without coding. By using plug-in architecture, it provides multimedia text retrieval plug-in picking up at same time.
出处
《计算机应用与软件》
CSCD
2010年第2期197-199,230,共4页
Computer Applications and Software
关键词
LUCENE
全文检索
构件
插件架构
多媒体文本提取
Lucene Full-text retrieval Component Plug-in architecture Multimedia text retrieval