摘要
随着Internet/Intranet的快速发展和普及,丰富的Web资源构成一个巨大的全球信息仓库。在海量数据空间中快速、准确地获取用户所需成为Web检索系统研究的焦点。将一种全新的网页自动分类技术引入WWW信息抽取领域,解决网上信息有效获取的问题。获取网站分类体系,设计的Web信息自动归类算法,可通过Web数据抽取机制以及Web信息分类技术实现检索结果的分类和层次化展示,使用户快捷准确地从WWW上获取所需信息。
As Internet/Intranet developing quickly and being popular,affluent Web resources have composed a huge global information warehouse. It becomes more and more important in information retrieval research that how to obtain the Web in- formation what users need among magnanimity data space fast and accurately. In order to improve the performance of search engine,this paper applies a new technology of Web page classification to the existing search engine. We obtain Website classification system and design arithmetic of Web information classification. Result can be classified into groups and displayed hierarchically by Web information extraction mechanism and users obtain what they need on WWW fast.
出处
《现代电子技术》
2008年第10期76-78,84,共4页
Modern Electronics Technique
关键词
信息检索
信息归类
分类体系
层次化展示
information retrieval
information classification
classification system
hierarchical display