摘要
为解决FTP资源快速定位问题,提出了基于双字母倒排索引和引入地理位置信息的Kademlia模型(DGKAD)。在Kademlia(KAD)网络中加入了地理位置信息,弥补了KAD网络的层叠网逻辑拓扑和物理拓扑不匹配的问题,提高了网络通信效率;同时,鉴于FTP搜索引擎的检索对象是文件名,使用双字母倒排索引可以避免分词,提高了检索精确率。模拟实验表明,与基于中文分词的标准KAD(CKAD)相比,该算法的资源定位时间减少了约50%,检索查全率提高了约30%。
In order to solve the problem of FTP resource rapid positioning, this article mainly proposed an improved model, in which an kademlia model based on double-letters inverted the index introduction and geographical location information( abbreviation for DGKAD). In order to improve the efficiency of the network, Kademlia (KAD) was added to the network location information, it made up for logical topology and physical topology mismatch problems of KAD overlay network and improved the efficiency of network communication. At the same time, because the FTP search engine retrieval object was the name of the file, the use of double-letters inverted index could avoid word segmentation, and improved retrieval precision. Simulation results show that with an standard KAD based on chinese word segmentation (abbreviation for CKAD) , the algorithm of resource locating time was reduced by approximately 50% , retrieval accuracy increased by about 30%.
出处
《西华大学学报(自然科学版)》
CAS
2013年第3期50-53,76,共5页
Journal of Xihua University:Natural Science Edition
基金
国家自然科学基金项目(61271413)
四川省教育厅重点项目(08ZA023)
西华大学网络智能信息处理省重点高校实验室开放基金项目(SGXZD1002-10)
西华大学研究生创新基金项目(ycjj201228)