摘要
Internet用户通过常用搜索引擎获取Web信息时,往往得到了大量的重复网页信息,从而导致搜索效率不高。本文利用MD5算法成熟及可移植性好的特点,提出了一种基于MD5的消除重复网页的算法,实验证明该算法能有效的去除重复网页,时间和空间的复杂度不高,具有较强的实用价值。
The Searching Engines often return massive repeated pages information to Intemet users and result in low searching efficiency. Considering the mature and portability of MD5, an algorithm based on MD5 is proposed to remove the repeated pages. The experiment indicates this algorithm is effective'and its complexity of time and space is not high. It is showed that the study is practicable and valid.
出处
《电脑知识与技术》
2005年第10期15-16,共2页
Computer Knowledge and Technology
基金
教育部重点项目(教技司2001224)