期刊文献+

基于有向带权图的页面聚类算法研究 被引量:2

Study on Page Clustering Algorithms Based on Weighted Directed Graph
下载PDF
导出
摘要 聚类算法是数据挖掘中的一个重要的分析工具。Web使用挖掘中的聚类分析一般分为用户聚类和页面聚类。其中页面聚类是指导网站结构离线优化的重要方法。利用有向带权图表示用户的访问会话记录,对建立的有向带权图模型运用聚类算法实现页面聚类。选取真实数据对典型的聚类算法K-means算法、DBSCAN算法和COBWEB算法进行实验。实验结果表明,在选取的数据集范围内,COBWEB算法准确率要高于K-means算法和DBSCAN算法,时间性能与用户访问频率矩阵大小有密切关系。 Clustering algorithm is an important analytical tool in data mining. Clustering analysis is generally fallen into user clustering and page clustering in Web usage mining. Page clustering is an important methods for guiding for the structure of the site off- line optimization. This paper use weighted directed graph to describe user visit and conversation records, and use clustering algorithms to realize the page clustering by the weighted directed graph mode established. Select the real data carries on the experiment to the typical clustering algorithms K-means algorithm,DBSCAN algorithm and COBWEB algorithm. The experiments results indicate that in the selected data sets, the accuracy rate of COBWEB algorithm is higher than that of K - means algorithm and DBSCAN algorithm, and the time capability is closely related to the size of user visit frequency matrix.
机构地区 合肥工业大学
出处 《计算机技术与发展》 2009年第9期49-53,共5页 Computer Technology and Development
基金 国家自然科学基金项目(70672097) 国家自然科学基金重点项目(70631003)
关键词 有向带权图 聚类算法 页面聚类 K-MEANS算法 DBSCAN算法 COBWEB算法 weighted directed graph clustering algorithms page clustering K - means algorithm DBSCAN algorithm COBWEB algorithm
  • 相关文献

参考文献11

二级参考文献49

共引文献209

同被引文献21

  • 1郑佳谦,徐隽,姚静,牛军钰.论坛社区用户时空特征建模与挖掘[J].计算机研究与发展,2007,44(z3):7-12. 被引量:1
  • 2韩家炜.数据挖掘:概念与技术[M].北京:机械工业出版社,2000.
  • 3Han J,Kamber M.数据挖掘概念与技术[M].范明,译.北京:机械工业出版社,2007:32-59.
  • 4戴维迪,张璐.基于网格密度和距离信息特征的聚类算法[J].华南理工学报,2009(4):18-23.
  • 5Dai Weidi, Hou Yuexian, He Pitian. A Clustering Algorithm Based on Building a Density-tree[ C]//Proc. of the 4th Int' 1 Conf. on Machine Learning and Cybernetics. Guangzhou : [ s.n. ],2005:18-21.
  • 6Blake C L, Merz C J. UCI machine learning repository of ma- chine learning databases [ EB/OL]. 1998. http://www, ics. uci. edu/~ mleam/MLSummary, html.
  • 7Elder J, Pregibon D. A statistical perspective on knowledge discovery in database[ M]//Advances in Knowledge Discover- y and Data Mining. [s. 1. ] :AAAI/MIT Press,1996:83-115.
  • 8Meila M. Comparing Clusterings [ R/OL ]. 2005. http ://www. stat. washington, edu/www/research/reports/2002.
  • 9Getoor L, Diehl C. Link mining : a survey [ J ]. ACM SIGKDD Explorations Newsletter, 2005, 7 (2) : 3-12.
  • 10Cook D J, Holder L B. Mining graph data[ M]. New Jersey: John Wiley & Sons, 2007.

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部