期刊文献+

基于文本计算和链接分析的主题导航优化——以ERS网站为例 被引量:4

Combining Link and Content Analysis to Optimize Website Navigation——Illustrated by the Case of ERS Website
下载PDF
导出
摘要 网站的信息组织是图书情报领域研究的重要内容,尤其在导航优化方面也有较多探讨。本文综合运用了文本相似度计算、链接分析、社会网络分析、聚类分析等方法,提出了兼顾页面内容和已有链接关系的网站主题导航优化方案,并针对ERS网站提出了优化的具体做法。论文通过与仅基于内容和仅基于链接的网站主题导航构建方案进行比较,证明所提方案的可行性和有效性。针对ERS网站,实例也证明通过增加语义相似度高于阈值的主题间链接,并按照相似程度对相关主题的链接进行排序,可以有效地实现主题导航优化。研究表明链接关系也是一种隐含语义关系,网站导航既要考虑语义相似度高的页面,也要考虑语义相似度低但存在链接关系的页面。 Information organization of website is an important part of Library and Information Science study, especially the website navigation guidance optimization. By using text similarity calculation, link analysis, social network analysis, and cluster analysis, this paper proposes a new method of website navigation guidance, which combines both webpage content and existing link relations, and then takes ERS Website as an example to present a detailed proposal of the optimization. Besides, this paper verifies the feasibility and effectiveness of the new method by comparing with two other methods (content-based only, and link-based only). For ERS website, the topic-based guidance is optimized by adding links for topics that have high semantic similarity degrees above threshold, and sorting the links by topic semantic similarity degrees. The research shows that existing links imply the relevance of content. Meanwhile, both the website of high semantic similarity and the low semantic similarity with links should be taken into consideration when constructing the website guidance system.
作者 许鑫 苏晓兰
出处 《情报学报》 CSSCI 北大核心 2015年第9期938-948,共11页 Journal of the China Society for Scientific and Technical Information
关键词 文本相似度计算 链接分析 网站导航 text similarity calculation, link analysis, website navigation guidance
  • 相关文献

参考文献20

  • 1lCNNIC发布第34次《中国互联网络发展状况统计报告》[EB/OL].http://www.cnnic.net.cn/gywm/xwzx/rdxw/2014/ 201407/t20140721 47439.htm, 2014-07-21/2015-01-12.
  • 2PERUGINI S. Symbolic links in the open directory project [ J]. Information Processing and Management, 2008, 44 (2) : 910-930.
  • 3Sitemaps & SEO: An Introductory Guide [ EB/OL]. [ 2014-09-20 ]. http://searchenginewatch, corn/article/ 2048706/Sitemaps-SEO-An-Introductory-Guide # disqus _ thread.
  • 4Zhu S, Yu K, Chi Y, Gong Y. Combining content and link for classification using matrix factorization [ J ]. SIGIR, 2007: 487-494.
  • 5Yang H C,Lee C H. A text mining approach on automatic generation of Web directories and hierarchies[ J]. Expert Systems with Applications, 2004,27 (4) : 645-663.
  • 6Ingerwerson P. The calculation of web impact factors[ J ]. Journal of Documentation, 1998, 54 ( 2 ) : 236-243.
  • 7Perkowitz M, Etzioni O. Towards adaptive web sites: conceptual framework and case study [ J ]. Computer Networks, 1999, 31: 1245-1258.
  • 8Almpanidis G, Kot ropoulos C, Pitas I. Combining text and link analysis for focused crawling-An application for vertical search engines [ J ]. Information Systems, 2007,32(6) : 886-908.
  • 9Pal A, Tomar D S, Shrivastava S C. Effective focused crawling based on content and link structure analysis [ J ]. International Journal of Computer Science and Information Security ,2009,2( 1 ) :67-69.
  • 10闫光辉,舒昕,马志程,李祥.基于主题和链接分析的微博社区发现算法[J].计算机应用研究,2013,30(7):1953-1957. 被引量:28

二级参考文献144

共引文献60

同被引文献47

引证文献4

二级引证文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部