摘要
基于天网搜索引擎连续 4次的大规模搜集记录 ,揭示了中国 2 0 0 2年初中国 Web的大小、形状和结构 .主要结论包括有 :1中国大约有 5 0 0 0万网页和 5万个 Web站点 ;2全国不到 1/ 3的省市拥有 2 / 3强数目的网站 ;3中国网络是高度连通的 ,Web直径是 17;4网页入度分布很好地符合幂级数定律 ;5有确凿证据显示 。
Based on the data produced from four consecutive crawling processes, a comprehensive report on the structure of Chinese web as of dawn of the year 2002 is presented. The prominent results include ① the scale of Chinese web is about 50 million web pages and 50 thousand active websites; ② more than 2/3 websites are deployed in less than 1/3 big cities and provinces; ③ the Chinese web is highly connected with diameter of 17; ④ the distribution of in-degrees of web pages follows a power-law nicely; and ⑤ strong evidence exists for large amount of web communities that are formed autonomously.
出处
《计算机研究与发展》
EI
CSCD
北大核心
2002年第8期958-967,共10页
Journal of Computer Research and Development
基金
国家重点基础研究发展规划项目 ( G19990 32 70 6 )
"创建世界一流大学工程"项目基金资助