摘要
本文在对Lucene全文检索、索引开发包深入研究的基础上,设计了多索引器索引方案,有效地减少了索引时间,提高了索引效率;在建立倒排索引时,通过对索引表中词条排序过程进行优化,改进排序方式,提高了化工词汇的查找效率;对于待索引的文档加入不同类型的权值,可以提高检索文档的准确度,适应化工专业搜索引擎的需求。
On the base of researching Lucene API of full text index and search, the project of muhi-indexer is designed to decrease the time of developing index effectively and improve the efficiency of index database; when inverted index database is being built, the efficiency of searching chemical terms can be improved by optimizing sorting process of term of index table; The indexed documents are added different types of parameters values to improve accuracy of searching chemical terms. This index database is more suitable for chemical search engine.
出处
《微计算机信息》
2009年第6期209-210,311,共3页
Control & Automation