期刊文献+

关于中国Web的大小、形状和结构 被引量:17

ON THE STRUCTURE OF CHINESE WEB 2002
下载PDF
导出
摘要 基于天网搜索引擎连续 4次的大规模搜集记录 ,揭示了中国 2 0 0 2年初中国 Web的大小、形状和结构 .主要结论包括有 :1中国大约有 5 0 0 0万网页和 5万个 Web站点 ;2全国不到 1/ 3的省市拥有 2 / 3强数目的网站 ;3中国网络是高度连通的 ,Web直径是 17;4网页入度分布很好地符合幂级数定律 ;5有确凿证据显示 。 Based on the data produced from four consecutive crawling processes, a comprehensive report on the structure of Chinese web as of dawn of the year 2002 is presented. The prominent results include ① the scale of Chinese web is about 50 million web pages and 50 thousand active websites; ② more than 2/3 websites are deployed in less than 1/3 big cities and provinces; ③ the Chinese web is highly connected with diameter of 17; ④ the distribution of in-degrees of web pages follows a power-law nicely; and ⑤ strong evidence exists for large amount of web communities that are formed autonomously.
出处 《计算机研究与发展》 EI CSCD 北大核心 2002年第8期958-967,共10页 Journal of Computer Research and Development
基金 国家重点基础研究发展规划项目 ( G19990 32 70 6 ) "创建世界一流大学工程"项目基金资助
关键词 WEB 网站 网页 互连结构 搜索引擎 INTERNET world wide web, web page, web site, hyperlink structure, web community
  • 相关文献

参考文献9

  • 1[2]赵江华,闫宏飞,王建勇等. 天网中的并行与分布处理. 北京大学,技术报告:PKU CS NET TR2002001, 2002. Http://162.105.80.88/crazysite/home/report(Zhao Jianghua, Yan Hongfei, Wang Jianyong et al. Parallel and distributed processing in WebGather(in Chinese). Peking University, Tech Rep: PKU CS NET TR2002001, 2002.Http://162.105.80.88/crazysite/home/report)
  • 2[3]Yan Hongfei, Wang Jianyong, Li Xiaoming. A dynamically reconfigurable model for a distributed web crawling system. In: 2001 Int'l Conf Computer Networks and Mobile Computing. Beijing, 2001. 157~162
  • 3[4]Marc Najork, Janet L Wiener. Breadth-first search crawling yields high-quality pages. In: Proc of the 10th Int'l World Wide Web Conf. Hongkong, 2001. 114~118
  • 4[5]Li Xiaoming, Wang Jianyong. WebGather: Towards quality and scalability of a web search service. In: Proc of the 10th Int'l World-Wide Web Conf. Hongkong, 2001
  • 5[7]中国互联网络信息中心(CNNIC). 信息服务. 2000. http://www.nic.edu.cn/INFO/cindex.html(CNNIC. Information service(in Chinese), 2000. http://www.nic.edu.cn/INFO/cindex.html)
  • 6[9]Andrei Broder, Ravi Kumar, Farzin Maghoul et al. Graph structure in the web: Experiments and models. In: Proc of the 9th Int'l World-Wide Web Conf. Amsterdam, 2000. 309~320
  • 7[10]Reka Albert, Hawoong Jeong, Albert-Laszlo Barabasi. Internet: Diameter of the world-wide web. Nature, 1999, 401: 130~131
  • 8[11]S R Kumar, P Raghavan, S Rajagopalan et al. Trawling the Web for emerging cyber-communities. In Proc of the 8th Int'l World-Wide Web Conf. Toronto, Canada, 1999. http://www8.org/w8-papers/4a-search-mining/trawling/trawling.html
  • 9[12]J Kleinberg. Authoritative sources in a hyperlinked environment. In: Proc of 9th ACM-SIAM Symp on Discrete Algorithms, 1998. Extended version in Journal of the ACM 1999, 46(5): 604~632

同被引文献120

引证文献17

二级引证文献80

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部