期刊文献+

个性化Web采集算法研究及其应用 被引量:1

Based on Customized Web Crawling Arithmetic Study and Application
下载PDF
导出
摘要 全面详细地研究了用户个性化W eb信息采集算法,并提出了一个基于指定站点的用户个性化W eb信息采集模型;实验结果表明,在一个用户指定的站点内,该模型可以快速的采集到根据用户需求定制的页面,并存储到本地的文件系统中.这个采集模型具有较强的实用价值,可以为创建某方面的资源库快速的采集信息. This paper study customized crawling arithmetic roundly and in detail, and raise a based on customized Web crawling model , The experimental result indicates the model can crawl web pages requested by user quickly and store in local file system. This web crawler model has stronger practical value, it can gather information in order to establish the resources bank of some respect fast.
作者 刘彤
出处 《贵州大学学报(自然科学版)》 2006年第3期305-313,共9页 Journal of Guizhou University:Natural Sciences
基金 广东省科技攻关项目(A10202001) 广州市科技攻关项目(20004Z2-D0091)
关键词 WEB 信息采集 个性化采集算法 Web web crawling customized crawling arithmetic
  • 相关文献

参考文献16

  • 1刘彤.基于用户个性化的Web信息采集技术研究华南理工大学硕士学位论文.
  • 2R MILLER, K BHARAT. Sphinx: A framework for creating personal, site-specific web crawlers [ R ]. In Proceedings of the seventh conference on World Wide Web, Brisbane, Australia, April 1998.
  • 3CLAUDIO SCORDINO CRAWLING. the Web:problems and techniques[ M ]. Ph D Student. May 2004 Computer Science Department-University of Pisa.
  • 4SOUMEN CHAKRABARTI, MARTIN VAN DEN BERG, BYRON DOM: Focused Crawling: A New Approach to Topic-Specific Resource Discovery[ M]. IBM Almaden Research Center.
  • 5J CHO and H GARCIA-MOLINA. The evolution of the web and implications for an incremental crawler[ EB/OL]. In Proceedings of the 26th International Conference on Very Large Databases, 2000. http ://rose. cs. ucla. edu/? cho/papers/cho - evol. pdf.
  • 6J CHO and H GARCIA-MOLINA. Parallel crawlers[ R]. In Proceedings of the llth Intemational World Wide Web Conference, 2002.
  • 7J CHO, H GARCIA-MOLINA,and L PAGE. Efficient crawling through URL ordering[ R ]. In Proceedings of the 7th International World Wide Web Conference, pages 161 -172,Brisbane, 1998. http ://www7. scu. edu. au/programme/fullpapers/1919/com1919, htm.
  • 8M NAJORK and J L WIENER. Breadth-First Crawling yields high-quality pages[ R]. In Proceedings of the 10th International World Wide Web Conference, pages 114 - 118, May 2001.
  • 9MICHAEL CHAU,HISINCHUN CHEN. Personalized and Vocused Web Spiders.
  • 10M DILIGENTI, F COETZEE, S. LAWRENCE, C L GILES, M GORI. Focused crawling using context graphs[ R]. In Proceedings of 26th International Conference on Very Large Databases (VLDB), pages 527. 534, Cairo, Egypt, September 2000.

同被引文献7

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部