期刊文献+

ChangeSpider:一个自适应的网页信息跟踪系统 被引量:1

ChangeSpider:An Adaptive Tracing System for the Information Changing on the Web
下载PDF
导出
摘要 网页信息的更新是网络一个非常重要的性质。同网络的其他应用类似,随着WWW信息内容更新的不断加快,如何有效地跟踪特定网站和页面的更新情况日渐成为人们关心的课题。论文讨论一个自适应的网页信息跟踪系统ChangeSpider,研究其体系结构、关键技术等方面的内容。实验表明ChangeSpider能够有效地跟踪网页的信息变化,及时地将变化的内容提交给用户。 The information changing of web pages is a very important property of the web.With the fast changing content on the web,similar to some other web,how to trace the update of the specified pages effectively is very important. In this paper,an adaptive tracing system for the web is discussed together with its architecture,key technologies etc. With the results of experiments,we can see that the ChangeSpider can trace the web changes effectively as well as notify the changes to the user in time.
出处 《计算机工程与应用》 CSCD 北大核心 2003年第34期160-164,共5页 Computer Engineering and Applications
基金 国家自然科学基金(编号:NFSC60131160743)
关键词 信息更新 更新频率信息采集 信息分发 Information Changing,Update Frequency,Information Gathering,Information Delivery
  • 相关文献

参考文献8

  • 1[1]Vijay Boyapati, Kristie Chevrier,Avi Finkel et al. Chip Whitmer WhizBang! Labs: ChangeDetector(TM ) :A Site-Level Monitoring Tool for the WWW, WWW2002, May7~ 11,2002, Honolulu, Hawaii, USA.ACM 1-58113-449-5/02/0005
  • 2[2]E Coffman Jr,Z Liu,R R Weber. Optimal robot scheduling for web search engines[R].Technical report,INRIA, 1997
  • 3[3]Liu L, CPu, W Tang. WebCQ: Detecting and Delivering Information Changes on the Web.In the Proceedings of International Conference on Information and Knowledge Management(CIKM).Washington D C:ACM Press,2000
  • 4[4]Chen Y -F et al.The AT&T Internet Difference Engine:Tracing and Viewing Changes on the web. World Wide Web, 1998; 1 (1)
  • 5[5]Junghoo Cho, Hector Garcia-Molina. Estimating Frequency of Change.Submitted for Publication,2000-02
  • 6[6]CongPeng Ma,Shuang Gao,GuangWen Yang. THMonitor:A new Schema of WWW updating monitor. ICA3PP 2002
  • 7[7]http://www.netmind.com
  • 8[8]Mercator: A Scalable, Extensible Web Crawler Allan Heydon and Marc Najork Compaq Systems Research Center

同被引文献15

  • 1孟涛,闫宏飞,王继民.一个增量搜集中国W eb的系统模型及其实现[J].清华大学学报(自然科学版),2005,45(S1):1882-1886. 被引量:7
  • 2孟涛,闫宏飞,王继民.Web网页信息变化的时间局部性规律及其验证[J].情报学报,2005,24(4):398-406. 被引量:8
  • 3程菲,汪建海,罗键.增量更新Crawler进行Web收集方法研究[J].计算机工程与科学,2006,28(12):28-30. 被引量:2
  • 4中国互联网络信息中心.第27次中国互联网络发展状况统计报告[R],2011.
  • 5CHOJ,GARCIA-MOL1NA H. The evolution of the Web and implications for an incremental crawler[A].San Francisco,ca:morgan Kaufmann Publishers,2000.
  • 6CHO J,GARCIA-MOLINA H. Effective page refresh policies for Web crawlers[J].ACM Transactions on Database Systems,2003,(04).
  • 7CHO J,GARCIA-MOLINA H. Estimating frequency of change[J].ACM Transactions on Internet Technology,2003,(03).
  • 8周艳;吴跃;鲁珂.Web搜索的网页更新检测算法研究[A]2009年西南地区网络与信息系统学术年会,2009.
  • 9CASTILLO C,BAEZA-YATES R. A new model for Web craw ling[A].2002.
  • 10CHO J,NTOULAS A. Effective change detection using sampling[A].San Francisco:Morgan Kaufmann Publishers,2002.

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部