摘要
给出了一个基于Nutch的垂直搜索引擎系统的实现,主要探讨了基于Lucene和Carrot2的信息检索与聚类的实现,并对分词、垂直信息采集等的实现进行了说明。
This paper presents the implementation of vertical search engine based on Nutch, mainly the implementation of the Lucene and the Carrot2 for information retrieval and clustering. Moreover, the paper also introduces in some details the Chi- nese word segmentation and data collection.
出处
《河北工业科技》
CAS
2012年第3期155-157,共3页
Hebei Journal of Industrial Science and Technology
基金
河北省科技支撑计划项目(12213516D)