期刊文献+

一种基于DGKAD模型的FTP搜索引擎索引算法

An Indexing Algorithm Based on DGKAD Model and FTP Search Engine
下载PDF
导出
摘要 为解决FTP资源快速定位问题,提出了基于双字母倒排索引和引入地理位置信息的Kademlia模型(DGKAD)。在Kademlia(KAD)网络中加入了地理位置信息,弥补了KAD网络的层叠网逻辑拓扑和物理拓扑不匹配的问题,提高了网络通信效率;同时,鉴于FTP搜索引擎的检索对象是文件名,使用双字母倒排索引可以避免分词,提高了检索精确率。模拟实验表明,与基于中文分词的标准KAD(CKAD)相比,该算法的资源定位时间减少了约50%,检索查全率提高了约30%。 In order to solve the problem of FTP resource rapid positioning, this article mainly proposed an improved model, in which an kademlia model based on double-letters inverted the index introduction and geographical location information( abbreviation for DGKAD). In order to improve the efficiency of the network, Kademlia (KAD) was added to the network location information, it made up for logical topology and physical topology mismatch problems of KAD overlay network and improved the efficiency of network communication. At the same time, because the FTP search engine retrieval object was the name of the file, the use of double-letters inverted index could avoid word segmentation, and improved retrieval precision. Simulation results show that with an standard KAD based on chinese word segmentation (abbreviation for CKAD) , the algorithm of resource locating time was reduced by approximately 50% , retrieval accuracy increased by about 30%.
出处 《西华大学学报(自然科学版)》 CAS 2013年第3期50-53,76,共5页 Journal of Xihua University:Natural Science Edition
基金 国家自然科学基金项目(61271413) 四川省教育厅重点项目(08ZA023) 西华大学网络智能信息处理省重点高校实验室开放基金项目(SGXZD1002-10) 西华大学研究生创新基金项目(ycjj201228)
关键词 双字母倒排索引 KADEMLIA FTP搜索引擎 double-letters inverted index Kademlia FTP search engine
  • 相关文献

参考文献10

二级参考文献40

共引文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部