摘要
简述了站内全文检索的必要性,介绍了一种基于Lucene的全文检索系统模型,相对于Google的站内检索和传统的数据库检索都有较为明显的优势。该模型引入更好的中文分词技术,可自定义最终结果的排序。能够保证检索的前100条记录最符合检索者的需要。
The paper presents the necessity of in-site full text search, and proposes a system model for full text search engine based on Lu- cene. This model provides more apparent advantages comparing with Google's in-site and the original database search engine. More intelligent Chinese word segmentation and customized sorting results technologies are introduced, which leads to self-defining the order of final outcomes. The new model can guarantee the first 100 records retrieved mostly meet the need of users.
出处
《计算机应用与软件》
CSCD
北大核心
2008年第10期6-8,共3页
Computer Applications and Software
基金
国家863高科技发展研究计划资助项目(2003AA118070)
关键词
全文检索
LUCENE
中文分词
信息抽取
Full text search engine Lucene Chinese word segmentation Information retrieval