摘要
专业搜索引擎提供特定主题的信息检索服务,是新一代搜索引擎的发展方向之一,而网页主题相关度分析是专搜索引擎的核心技术,它指导着robot进行有价值的搜索,专门搜索与主题相关的页面;提出一种综合的网页主题相关度析方法,方法同时对网页内容价值和链接价值进行了考察,从而保证了robot搜索的网页与主题有着较高的相关度;在网内容价值评价时,对传统的方法进行了改进,新的方法能高好的实现。该方法也用于服装行业的搜索引擎,效果明显。
Special search engine provides service of informational retrieval in special area, and this technology is one of the hot topic in search engine recent years. And the analysis of related subject is the key of the special search engine, it conducts the net robot search valuable pages, only search the related subject page. A methods ofintegrated page related subject evaluation is proposed, which consider the page content value and page link value in the same time, and guarantee the web robot do a value search. When consider the page content value, the traditional method is improved, the new method is more suitable to realization and the clothing profession has adopted this technology, the efficiency is valuable.
出处
《计算机工程与设计》
CSCD
北大核心
2008年第23期6020-6022,6046,共4页
Computer Engineering and Design
关键词
主题爬虫
专业搜索
网页内容分析
链接分析
特征词
focused robot
special search
web-text evaluation
link analysis
special words