期刊文献+

非贪婪策略在WEB搜索中的应用

The Application of Non-Greedy Policy in WEB Search
下载PDF
导出
摘要  传统专业搜索引擎采用贪婪的链接选择策略,导致整体回报率较低.本文提出了一种非贪婪的链接选择策略,进而提出了一种基于非贪婪策略的启发式搜索算法.针对国外四所著名大学计算机系网站中论文资源的搜索实验表明,新的算法可以有效提高搜索效率. The traditional topic-specific search engines suffer from low reward rate due to their greedy link-selection policy.This paper proposes a novel non-Greedy link-selection policy.Then a heuristic searching algorithm based on non-Greedy policy is proposed.We validate our new algorithm by experiments of searching computer-relevant papers on the Web sites of four famous computer departments.The results show that the new algorithm has better performance.
出处 《中央民族大学学报(自然科学版)》 2004年第3期235-239,257,共6页 Journal of Minzu University of China(Natural Sciences Edition)
基金 国家自然科学基金项目(60203017) 国家科技基础性研究专项资金项目(2001DEA20016-02-04)资助
关键词 非贪婪策略 WEB搜索 网络蜘蛛 专业搜索引擎 启发式搜索策略 topic-specific search engine Web spider non-Greedy policy
  • 相关文献

参考文献12

  • 1[1]LAWRENCE S,GILES L.Accessibility and distribution of information on the Web[J].Nature 1999,400(8):107-109.
  • 2[2]BREWINGTON B E,CYBENKO G.How dynamic is the Web[A]Proc of the 9th International World Wide Web Conference[C],2000.
  • 3[3]ESTER M,GROB M,KRIEGEL H.Focused Web crawling:a generic framework for specifying the user interest and for adaptive crawling stratigies[A].Proc of the International Conference on Very Large Database(VLDB'01)[C],2001.
  • 4[4]BRA D P,HOUBEN G,KORNATZKY,et al.Information retrieval in distributed hypertexts[A].Proc of the 4th RIAO Conference[C],1994,481-491.
  • 5[5]HERSOVICI M,HEYDON A,MITZENMACHER M,NAJORK Y S,PELLEG D,SHTALHAN M,UR S.The shark search algorithm-An application:Tailored Web site mapping[A].Proc of the 7th International World Wide Web Conference[C],1998.
  • 6[6]AGGARWAL C,AI GARAWI F,YU S P.Intelligent crawling on the World Wide Web with arbitrary Predicates[A].Proc of the 10th International World Wide Web Conference[C],2001.
  • 7[7]CHO J,GARCIA MOLINA H,PAGE L.Efficient crawling through URL ordering[J].Computer Networks,1998,30(2):161-172.
  • 8[8]CHAKRABARTI S,VAN DEN BERG M,DOM B.Focused crawling:a new approach to topic specific Web resource discovery[J].Computer Networks,1999,31(11):1623-1640.
  • 9[9]RENNIE J,MCCALLUM A.Using reinforcement learning to spider the Web efficiently[A].Proc of the International Conference on Machine Learning(ICML99)[C],1999.
  • 10[10]PANT G,SRINIVASAN P,MENCZER F.Exploration versus exploitation in topic driven crawler[A].Proc of The WWW 02 Workshop on Web Dynamics[C],2002.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部