摘要
由于Web信息检索返回页面的数量巨大,对搜索结果进行排序成为影响搜索质量的一个重要问题。分析了搜索引擎Google采用的PageRank算法,指出其具有偏重旧网页和忽视专业站点的不足,考虑网页日期这一重要因素改进了PageRank算法,试验结果表明,改进的算法可以提高判断网页重要性的准确度。最后介绍了如何通过个性化服务来发现与用户兴趣相似的资源。
With rapid development of internet,web information retrieval returns lots of pages. It has become an important problem to sort the search result properly. This paper analyzes the PageRank algorithm used by Google and its disadvantages. These disadvantages prefer old pages and ignoring special sites. The improved algorithm adds the date of website. The experiments show that the consideration on evaluating the importance of pages can make an improvement over the original algorithm. Finally ,the paper introduces the ways find the resources which are similar to user's interest through personalized service.
出处
《广西师范大学学报(自然科学版)》
CAS
北大核心
2008年第3期210-213,共4页
Journal of Guangxi Normal University:Natural Science Edition
基金
国家自然科学基金资助项目(60473115
60773084
60603023)