摘要
针对搜索引擎后台的移动代理从互联网中抓取信息资源时,存在对URL字符串处理能力的不足,提出了一种快速处理存取数据的散列算法,通过在真实值和键值之间建立一一对应关系,快速把进入到移动代理队列中的字符串转化为一种线性表结构,改善队列处理字符串的能力.在Heritrix框架基础上,利用Eclipse 3.3开发平台实现了相应功能.实验结果表明:在加入快速散列算法之后,移动代理在搜索效率和抓取速度等方面得到明显提高.
To overcome the shortcomings of the capacity on string queue's handling for URL when the mobile agent of search engine obtains information resources from the Internet, this paper proposes a hash algorithm for smart mobile agent, which can handle access data quickly. This algorithm builds one-to-one relationship between real value and key value, then converts string data in the smart mobile agent's queue into a structure of linear table quickly and thus improves the ability of dealing with string queue. It implements the functions of mobile agent by Eclipse 3.3 based on Hetitrix framework, The experimental results show that the efficiency of searching performance and speed of fetching are significantly improved by fast hash algorithm.
出处
《中南民族大学学报(自然科学版)》
CAS
2013年第3期90-93,共4页
Journal of South-Central University for Nationalities:Natural Science Edition
基金
中南民族大学中央高校基本科研业务费专项基金资助项目(ZZQ10011)
关键词
搜索引擎
移动代理
散列算法
search engine
mobile agent
hash algorithm