摘要
FTP(file transfer protcol)搜索引擎的重点在于中文分词技术和检索技术。使用了一种面向FTP搜索优化的最大前向匹配分词算法,并将用户查询作为反馈来更新分词算法中所使用的字典,结合倒排索引技术实现了一个高性能的FTP搜索引擎的原型系统。压力测试结果表明此FTP搜索引擎具有很高的性能。
The key of FTP Search Engine is Chinese word segmentation and retrieval technique. We use a Forward Maximum Matching Chinese word segmentation algorithm optimized for FTP Search Engine, and take the retrieval keywords as feedback to update the dictionary affiliated with the segmentation algorithm. With the conbination of this scheme with the revert index technique, a high performance FTP Search Engine prototype is implemented. The results of load test have shown that the engine is of high performance.
出处
《南京邮电大学学报(自然科学版)》
2007年第3期67-70,75,共5页
Journal of Nanjing University of Posts and Telecommunications:Natural Science Edition
关键词
FTP
搜索引擎
分词
倒排索引
file transfer protcol
search engine
word wegment
revert index