摘要
基于开源搜索引擎Nutch,通过修改、调整和创新研制了文中介绍的6搜——一个专门搜索支持IPv6协议网站的专用IPv6搜索引擎。6搜的特点和创新点有:采集IPv6网页的速度在每秒100页以上;采集了54 195个IPv6网站,存储有2 000万IPv6网页,并且网页在不断更新和增加;有中文分词功能和自主创新的搜索网站功能。通过运行,6搜为用户提供了优质IPv6搜索服务;通过对6搜采集数据的分析,得到世界IPv6网站的分布。展现了IPv6网络的发展。
Based on open source search engine Nutch, through modification, tuning and innovation, 6sou, a search engine that only searches IPv6 protocol supporting web sites, is developed. It contains following features and innovations: 6sou crawls IPv6 web sites at more than 100 pages per second; 6sou has crawled 54195 web sites and has stored 20 million IPv6 web pages; the number of pages is increasing and the pages are being updated continuously; 6sou has Chinese word segmentation feature and independently innovated search web site feature. After going online, 6sou has provided users with high quality IPv6 search service. Through the analysis of data collected by 6sou, world IPv6 web site distribution is presented. It reflects the development of IPv6 network.
出处
《电子设计工程》
2011年第23期34-37,40,共5页
Electronic Design Engineering