摘要
在Lucene.Net的基础上,设计并实现了一种知识检索系统——基于奥运知识库的检索系统,该系统对Lucene.Net的中文分词功能进行了改进,即采用双字哈希机制的中文分词器,提高了对中文分词的支持度,并增加了新增生词的功能,能提高检索的准确度。
On the basis of Lucene.Net, designs and realizes a knowledge retrieval system, based on the knowledge base of the Olympic retrieval system, improves the function on the Chinese word segmentation in the Lucene.Net, that is a dual character Hash the Chinese word segmentation mechanism, enhance the Chinese-term support, and increases the functions of the new words, which can improve the accuracy of the search.
出处
《现代计算机》
2008年第11期124-125,128,共3页
Modern Computer
关键词
检索系统
LUCENE.NET
中文分词
双字哈希
Retrieval System
Lucene.Net
Chinese Word Segmentation
Double Character Hash Indexing