摘要
针对当前网络中所使用的基于数据库的Ftp搜索引擎没有标准资源文档且不支持中文分词和动态数据更新的缺陷,提出基于Lucene这个功能强大的全文索引引擎工具包的Ftp搜索引擎的设计方案。此Ftp搜索引擎不仅能够自动生成标准格式的XML资源文档,而且采用基于字典的前向最大匹配中文分词法在Lucene中动态更新全文索引。该设计还能够对检索关键字进行中英文混合分析和检索。
Since the Ftp search engine based on the database cannot dynamically support standard resource documents, Chinese words segmentation and updating database at present, this paper brings up a new design of Ftp search engine based on Lucene, a tool bag of a full text index engine with strong functions. And the new designed Ftp search engine can generate an XML resource documents by standard format automatically, thus maximally match Chinese words segmentation and update the full text index dynamically in the Lucene documents. In addition, the engine can also analyze the retrieval keywords both in Chinese and English.
出处
《图书情报工作》
CSSCI
北大核心
2006年第4期122-125,共4页
Library and Information Service