摘要
介绍大型搜索引擎应用的主流网页排序算法,改进其中的HITSS算法,提出一种基于网页分块技术的BHITS算法。BHITS算法通过对分好的页面板块进行主题标定,根据待采集信息的主题为不同主题的板块设定不同的权值实现相关度判定,在保持算法高效率的前提下,提高了算法区分链接重要性的能力。与相关算法的对比实验结果表明,BHITS算法网页排序的准确率明显优于其他算法。
This paper reviews dominating Webpage ranking algorithms, improves HITSS algorithm among of them, and proposes a new algorithm BHITS based on Webpage sub-block. BHITS algorithm uses the right values of different theme plates, the platea are calibrated by its topic and the right values of different subject sections are set according to the subjects of information to be collected, which improves the capacity of the hyperlinks distinguishing, while high efficiency is kept. From the eontrastive experiment with the related algorithms, the result shows that the precision of BHITS algorithm is significantly higher than that of other algorithms.
出处
《计算机工程》
CAS
CSCD
北大核心
2010年第11期64-66,72,共4页
Computer Engineering
关键词
网页排序
搜索引擎
WEB信息检索
权值
Webpage ranking
search engine
Web information retrieval
right value