期刊文献+

主题网络蜘蛛搜索策略贪婪性解决方法 被引量:4

One Solution About Topic Web Crawler’s Greedy Search Strategy
下载PDF
导出
摘要 主题网络蜘蛛搜索策略是专业搜索引擎的核心技术。但是目前的主题搜索算法往往存在很大贪婪性,难以在全局范围内找到最优解。通过比较分析发现Best-First算法虽然有它的不足,但是它在几种算法中表现的性能最优。故以Best-First算法为基础,提出了BS-BS算法。对BS-BS算法进行性能评价,发现应用此算法搜索不但“召回率”有所提高,还能在一定程度上找到全局范围内的最优解。 Topic web crawler search strategy is the core of professional search engine technology. However, the current topic search algorithms always exist large greedy It is difficult to find optimal solutions in the overall situation. Through comparative analysis found that despite Best-First algorithm having shortcomings, but its performance is optimal in several algorithms So based on Best-First algorithms it raised BS-BS algorithms. Then it evaluated BS-BS algorithm .And found that not only 'recall rate' had improved, but could get the optimal solutions in the overall situation.
出处 《微电子学与计算机》 CSCD 北大核心 2006年第z1期278-280,共3页 Microelectronics & Computer
关键词 主题网络蜘蛛 Best-First算法 召回率 Topic web crawler, Best-first algorithm, Recall ratio
  • 相关文献

参考文献4

  • 1[1]J Cho,H Garcia-Molina,L Page.Crawling through URL ordering.Computers networks and ISDN systems[c].1998,30:161~172
  • 2[2]Srinivasan P,Menczer F,Pant G.A general evaluation framework for topical crawlers.Information Retrieval[c].2004
  • 3[3]Niram Angkawattanawit,Arnon Rungsawang.Learnable topic-specific web crawler.Massive Information & Knowledge Engineering[c].2002
  • 4[4]Pant G,Srinivasam P,Menczer F.Exploration versus ex ploitation in topic driven crawlers.In Proceedings of the WWW-02 Workshop on Web Dynamics[OL].http://www.muscat.com/martin/stem.html

同被引文献32

引证文献4

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部