期刊文献+

基于时间链接分析的页面排序优化算法 被引量:6

Temporal link-analyze based on Web page ranking algorithm
下载PDF
导出
摘要 传统的页面排序算法偏重于旧网页,使得一些旧的页面经常出现在检索结果的前面。为了改进此类算法,引入时间链接分析,使用爬虫抓起页面时HTTP协议反馈回来的修改时间作为页面和链接的时间,并综合考虑页面的出入链接个数和时间来计算页面的权重值。开发出的WTPR算法能使新网页集在排序中上升,高质量的旧网页比普通的旧网页能获得较高的排序值。 The traditional ranking algorithm favors the old pages, which makes old pages always appear in the top of the ranking results when pages are ranked according to the dynamic Web by the static ranking algorithm. In order to improve these algorithms, this paper introduced the temporal link-analyze. The algorithm used the last modification time returned by the HTTP response as the timestamp of nodes and links concerned. And integrated the weight of the in-link and out-link also in order to compute the overall weight of the pages. The WTPR algorithm developed can make the old pages decline and new pages rose in the ranking result, while the old pages of high-quality get higher rank value than common old pages.
出处 《计算机应用研究》 CSCD 北大核心 2009年第7期2438-2441,2477,共5页 Application Research of Computers
基金 国家自然科学基金资助项目(60773049)
关键词 页面排序算法 网页 网络挖掘 pagerank algorithm Web pages Web data mining
  • 相关文献

参考文献19

  • 1NTOULAS A, CHO J, OLSTON C. What's new on the Web: the evolution of the Web from a search engine perspective [ C ]//Proc of the 13th International Conference on World Wide Web. New York: ACM Press, 2004:1-12.
  • 2BRIN S, PAGE L. The anatomy of a large-scale hypertextual Web search engine[ C]//Proc of the 7th International World Wide Web Conference. 1998 : 107-117.
  • 3YU P S, LI Xin, LIU Bing. On the temporal dimension of search [ C]//Proc of WWW 2004. New York: [ s. n. ], 2004:448-449.
  • 4YU P S, LI Xin, LIU Bing. Adding the temporal dimension to search: a case atudy in publication search [ C ]//Proc of WI' 05. 2005.
  • 5XING Wen-pu, GHORBANI A. Weighted pagerank algorithm [ C ]// Proc of the 2nd Annual Conference on Communication Networks and Services Research. 2004.
  • 6BERBERICH K, VAZIRGIANNIS1 M, WEIKUM G. T-Rank:timeaware authority ranking [ C ]//Proc of WAW 2004. 2004 : 131 - 142.
  • 7KLEINBERG J M. Authoritative sources in a hyperlinked environment [J]. Journal of the ACM, 1999,46(5) :604-632.
  • 8CHO J, ROY S. Impact of Web search engines on page popularity [ C ]//Proc of WWW. New York : [ s. n. ], 2004:20- 29.
  • 9TOYODA M, KITSUREGAWA M. What' s really new on the Web? Identifying new pages from a series of unstable Web snapshots [ C ]// Proc of IW3C2. 2006.
  • 10BAR-YOSSEF Z, BRODER A Z. Sic transit gloria telae: towards an understanding of the Web' s decay [ C ]//Proc of WWW2004. New York : [ s. n. ] ,2004.

二级参考文献17

  • 1[1]J Cho, H Garcia-Molina, L Page. Efficient crawling through URL ordering. The 7th World Wide Web Conference, Brisbane, 1998
  • 2[2]S Brin, L Page. The anatomy of a large-scale hypertexual web search engine. The 7th World Wide Web Conference, Brisbane, 1998
  • 3[3]Taher H Haveliwala. Efficient computing of PageRank. Stanford Database Group, Tech Rep, 1999
  • 4[4]Monika Henzinger. Link analysis in web information retrieval. IEEE Data Engineering Bulletin, 2000, 23(3): 3~8
  • 5[5]Dell Zhang, Yisheng Dong. An efficient algorithm to rank web resources. Computer Netwoks, 2000, 33: 449~455
  • 6[6]Lei Ming, Wang Jianyong .et al.. Improved relevance ranking in web gather. Journal of Computer Science and Technology, 2001, 16(5): 410~417
  • 7[7]S Lawrence, C L Giles. Accessibility of information on the web. Nature, 1999, 400: 107~109
  • 8Yates R B,Neto B R.Moderm Information Retrieval[M].New York,USA:Addison Wesley,1999.
  • 9Chakrabarti S,Dom B,Gibson D.Hypersearching the Web[Z].http://www.sciam.com/,1999-06.
  • 10Brin S,Page L.The Anatomy of a Large-scale Hypertextual Web Search Engine[C].Proceedings of the 7th ACM-WWW International Conference.Brisbane:ACM Press,1998:107-117.

共引文献91

同被引文献80

引证文献6

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部