期刊文献+

PageCluster:一种Web页面层次聚类方法

PageCluster:A Method of Web Page Hierarchical Clustering
下载PDF
导出
摘要 提出了Web页面聚类算法PageCluster及相应的改进算法ImPageCluster。该方法在兼顾Web站点结构和页面链接的同时,基于各个页面的重要程度对各个超链接进行赋权。与传统聚类算法相比,该算法不需要事先给定相似度阈值。实验结果证实了该算法的可行性和高效性。 A Web page clustering algorithm called PageCluster,with the improved algorithm called ImPageCluster is proposed.These methods take not only the web structure and page hyperlinks,but also the importance of each page which is described as in-weight and out-weight into account.Compared with the traditional clustering methods,these algorithms don't need to be given the similarity threshold.And the experimental results show that these algorithms are feasible and high-efficient.
出处 《计算机工程与应用》 CSCD 北大核心 2004年第29期84-86,共3页 Computer Engineering and Applications
关键词 聚类 WEB页面 超链接 相似矩阵 PageCluster ImPageCluster clustering,Web page,hyperlink,similarity matrix,PageCluster,ImPageCluster
  • 相关文献

参考文献10

  • 1L Wang,M Kitsuregawa. Use Link-based Clustering to Improve Web Search Results[C].In :Proceedings of the 2nd International Conference on Web Information Systems Engineering(WISE 2001),2001:119~128
  • 2S Chakrabarti,B Dom,P Indyk. Enhanced Hypertext Categorization Using Hyperlinks[C].In:Proceedings of SIGMOD1998,1998:307~318
  • 3S Chakrabarti,B Dom et al.Mining the Web's Link Structure[J].COMPUTER, 1999; 32: 60~67
  • 4K Bharat,A Broder et al. Connectivity Server. Fast Access to Linkage Information on the Web[C].In:Proceedings of the 7th International World Wide Web Conference, 1998:469~477
  • 5J Pitkow,P Pirolli.Life,Death,and Lawfulness on the Electronic Frontier[C].In:Proceedings of ACM CHI'97,1997:383~390
  • 6Zhang T,Ramakrishnan R et al.BIRCH :An Efficient Data Clustering Method for Very Large Databases[C].In:Proceedings of ACM International Conference on Management of Data,ACM Press,1996:103~114
  • 7Martin Ester,Hans-Peter Kriegel,and et al.A Density based Algorithm for Discovering Clusters in Large Spatial Databases with Noise[C].In:Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining(KDD96),AAAI Press,1996:226~231
  • 8Wei Wang,Jiong Yang, Richard Muntz.STING: A Statistical Information Grid Approach to Spatial Data Mining[C].In :Proceedings of the 23rd international Conference on Very Large Data Bases. Morgan Kaufmann, 1997:186~195
  • 9Rakesh Agrawal,Johanners Gehrke et al. Automatic Subspace Clustering of High Dimensional Data Mining Applications[C].In:Proceedings of ACM SIGMOD International Conference on Management of Data,1998: 84~ 105
  • 10J Kleinberg. Authoritative Seurces in a Hyperlinked Environment[C].In:Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms(SODA), 1998

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部