期刊文献+

主题网络爬虫研究综述 被引量:9

Overview of Research on Topic-focused Web Crawler
下载PDF
导出
摘要 随着人们对信息资源的个性化需求不断加大,主题网络爬虫应时而生。阐述主题网络爬虫定义及工作原理;介绍了主题网络爬虫研究进展,对主题网络爬虫爬行策略、网页抓取优先级以及系统设计实现进行阐述;总结当前研究的不足,对未来研究方向进行了展望。 With the increase of people’s personalized demand for information resources,topic-focused web crawler emerged at the right time.The topic-focused web crawler and its working principle are stated.The research progress of theme web crawler is systematically analyzed,and three fields of topic-focused web crawler crawling strategy,web page crawling priority and design and implementation oftopic-focused web crawler system are expounded.The deficiencies of current research are summarized and the future research direction is prospected.
作者 左薇 张熹 董红娟 于梦君 ZUO Wei;ZHANG Xi;DONG Hong-juan;YU Meng-jun(School of Professional and Continuing Education,Yunnan University;School of Information,Yunnan University,Kunming 650000,China)
出处 《软件导刊》 2020年第2期278-281,共4页 Software Guide
基金 云南大学职业与继续教育学院一般项目(YK1704ZJ)。
关键词 主题网络爬虫 主题爬虫 搜索引擎 topic-focused web crawler topic-focused crawler search engine
  • 相关文献

参考文献12

二级参考文献89

  • 1汪涛,樊孝忠.链接分析对主题爬虫的改进[J].计算机应用,2004,24(B12):174-176. 被引量:12
  • 2钱榕,徐新华,郑莹,杨炳儒.智能专题化信息搜集Crawler[J].计算机工程,2006,32(3):57-59. 被引量:4
  • 3赵佳鹤,王秀坤,刘亚欣.基于语义分析的主题信息采集系统的设计与实现[J].计算机应用,2007,27(2):406-408. 被引量:14
  • 4斯图尔特 G W.矩阵计算引论[M].王守根等译.上海:科学技术出版社,1980.
  • 5Rungsawang A, Angkawattanawit N. Learnable Topic-specific Web Crawler[J]. Journal of Network and Computer Applications, 2005, 28(2): 97-114.
  • 6Chakrabhik S, Vandenburg M, Dom B. Focused Crawling: A New Approach to Topic-specific Web Resource Discovery[C]//Proceedings of the 8th International World-Wide Web Conference. Toronto, Canada: [s. n.], 1999.
  • 7Liu Hongyu, MIuOS E, Janssen J. Probabilistic Models for Focused Web Crawling[C]//Proceedings of the 6th Annual ACM International Workshop on Web Information and Data Management. New York, USA: ACM Press, 2004.
  • 8Florescu D, Levy A, Mendelzon A. Database Techniques for the World-Wide Web: A Survey[J]. SIGMOD Record, 1998, 27(3): 59-74.
  • 9Wei Jiying, Wen Jirong. instance-based Schema Matching for Web Databases by Domain-specific Query Probing[C]//Proceedings of the 30th international Conference on VLDB. Toronto, Canada: [s. n.], 2004.
  • 10[7]Page L,Brin S,Motwani R,et al. The PageRank citation ranking:Bringing order to the Web [ EB/OL]. http://www-db. stanford. edu/~ backrub/pageranksub. ps, 1998 -01 - 20/2003 - 03 - 25.

共引文献110

同被引文献55

引证文献9

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部