期刊文献+

基于网页内容和链接价值的相关度方法的实现 被引量:4

Realization of method of related subject based on page content value and link value
下载PDF
导出
摘要 专业搜索引擎提供特定主题的信息检索服务,是新一代搜索引擎的发展方向之一,而网页主题相关度分析是专搜索引擎的核心技术,它指导着robot进行有价值的搜索,专门搜索与主题相关的页面;提出一种综合的网页主题相关度析方法,方法同时对网页内容价值和链接价值进行了考察,从而保证了robot搜索的网页与主题有着较高的相关度;在网内容价值评价时,对传统的方法进行了改进,新的方法能高好的实现。该方法也用于服装行业的搜索引擎,效果明显。 Special search engine provides service of informational retrieval in special area, and this technology is one of the hot topic in search engine recent years. And the analysis of related subject is the key of the special search engine, it conducts the net robot search valuable pages, only search the related subject page. A methods ofintegrated page related subject evaluation is proposed, which consider the page content value and page link value in the same time, and guarantee the web robot do a value search. When consider the page content value, the traditional method is improved, the new method is more suitable to realization and the clothing profession has adopted this technology, the efficiency is valuable.
出处 《计算机工程与设计》 CSCD 北大核心 2008年第23期6020-6022,6046,共4页 Computer Engineering and Design
关键词 主题爬虫 专业搜索 网页内容分析 链接分析 特征词 focused robot special search web-text evaluation link analysis special words
  • 相关文献

参考文献7

二级参考文献77

  • 1欧阳柳波,李学勇,李国徽,王鑫.专业搜索引擎搜索策略综述[J].计算机工程,2004,30(13):32-33. 被引量:34
  • 2马辉民,李卫华,吴良元.VSM在中文文本聚类中的应用及实证分析[J].武汉理工大学学报(信息与管理工程版),2006,28(4):56-59. 被引量:13
  • 3[1]R Botafogo, E Rivlin, B Shneiderman. Structural analysis of hypertext: Identifying hierarchies and useful metrics. ACM Trans on Information System, 1992, 10(2): 142~180
  • 4[2]J Carriere, R Kazman. WebQuery: Searching and visualizing the Web through connectivity. The 6th Int'l WWW Conf (WWW6), Santa Clara, 1997
  • 5[3]Jon M Kleinberg. Authoritative sources in a hyperlinked environment. The 9th Annual ACM-SIAM Symp on Discrete Algorithms, California, 1997
  • 6[4]K Bharat, M R Henzinger. Improved algorithms for topic distillation in a hyperlinked environment. The 21st Int'l ACM SIGIR Conf on Research and Development in Information Retrieval (SIGIR 98), Melbourne, 1998
  • 7[5]S Brin, L Page. The anatomy of a large-scale hypertextual web search engine. The 7th Int'l WWW Conf (WWW7), Brisbane, Australia, 1998
  • 8[6]L Page, S Brin .et al.. The pagerank citation ranking: Bringing order to the web. 1998. http://dbpubs.stanford.edu:8090/pub/1999-66
  • 9[7]N Craswell, D Hawking, S E Robertson. Effective site finding using link anchor information. The SIGIR 2001, Louisiana, 2001
  • 10[8]Gao Jianfeng .et al.. TREC-10 Web track experiments at MSRA. The 10th Text Retrieval Conf, Gaithersburg, 2001

共引文献160

同被引文献30

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部