期刊文献+

基于Websphinx网络爬虫的研究与改进

Research And Improvement of Network Reptile Based on Websphinx
下载PDF
导出
摘要 搜索引擎技术随着互联网的日益壮大而飞速发展。它成功的商业运作也造就了Google、百度等这样的商业奇迹。作为搜索引擎的重要组成部分,网络爬虫的爬行效率对搜索引擎至关重要。基于Websphinx对网络爬虫进行了相关介绍,概述了Websphinx的结构框架、搜索方式及提出了一些看法。 With the development of intemet technical, Search engine is becoming more and more powerful. There are also some fantastic success cases like google and baidu. Network Reptile, as an important part of search engine, play an irreplaceable role in it, especially the performance, here we discuss about Network Reptile based on an exist open source "Websphinx", explain the structure and search style of Websphinx, and show out some new opinion.
作者 周翔 ZHOU Xiang (Tongji University Software College, Shanghai 200000, China)
出处 《电脑知识与技术》 2008年第10期75-77,83,共4页 Computer Knowledge and Technology
关键词 搜索引擎 Websphinx网络爬虫 超时 智能化 search engine websphinx network reptileslnterface overtime intelligent
  • 相关文献

参考文献1

二级参考文献11

  • 1[1]R Botafogo, E Rivlin, B Shneiderman. Structural analysis of hypertext: Identifying hierarchies and useful metrics. ACM Trans on Information System, 1992, 10(2): 142~180
  • 2[2]J Carriere, R Kazman. WebQuery: Searching and visualizing the Web through connectivity. The 6th Int'l WWW Conf (WWW6), Santa Clara, 1997
  • 3[3]Jon M Kleinberg. Authoritative sources in a hyperlinked environment. The 9th Annual ACM-SIAM Symp on Discrete Algorithms, California, 1997
  • 4[4]K Bharat, M R Henzinger. Improved algorithms for topic distillation in a hyperlinked environment. The 21st Int'l ACM SIGIR Conf on Research and Development in Information Retrieval (SIGIR 98), Melbourne, 1998
  • 5[5]S Brin, L Page. The anatomy of a large-scale hypertextual web search engine. The 7th Int'l WWW Conf (WWW7), Brisbane, Australia, 1998
  • 6[6]L Page, S Brin .et al.. The pagerank citation ranking: Bringing order to the web. 1998. http://dbpubs.stanford.edu:8090/pub/1999-66
  • 7[7]N Craswell, D Hawking, S E Robertson. Effective site finding using link anchor information. The SIGIR 2001, Louisiana, 2001
  • 8[8]Gao Jianfeng .et al.. TREC-10 Web track experiments at MSRA. The 10th Text Retrieval Conf, Gaithersburg, 2001
  • 9[9]S Chakrabarti, B Dom, D Gibson .et al.. Automatic resource compilation by analyzing hyperlink structure and associated text. The 7th Int'l WWW Conf (WWW7), Brisbane, 1998
  • 10[10]B D Davison. Topic locality in the web. The 23rd Int'l ACM SIGIR Conf on Research and Development in Information Retrieval(SIGIR 2000), Athens, 2000

共引文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部