期刊文献+

CK-Modes Clustering Algorithm Based on Node Cohesion in Labeled Property Graph

原文传递
导出
摘要 The designation of the cluster number K and the initial centroids is essential for K-modes clustering algorithm. However, most of the improved methods based on K-modes specify the K value manually and generate the initial centroids randomly, which makes the clustering algorithm significantly dependent on human-based decisions and unstable on the iteration time. To overcome this limitation, we propose a cohesive K-modes (CK-modes) algorithm to generate the cluster number K and the initial centroids automatically. Explicitly, we construct a labeled property graph based on index-free adjacency to capture both global and local cohesion of the node in the sample of the input datasets. The cohesive node calculated based on the property similarity is exploited to split the graph to a K-node tree that determines the K value, and then the initial centroids are selected from the split subtrees. Since the property graph construction and the cohesion calculation are only performed once, they account for a small amount of execution time of the clustering operation with multiple iterations, but significantly accelerate the clustering convergence. Experimental validation in both real-world and synthetic datasets shows that the CK-modes algorithm outperforms the state-of-the-art algorithms.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2019年第5期1152-1166,共15页 计算机科学技术学报(英文版)
基金 supported by the National Natural Science Foundation of China under Grant No. 61772534 the Excellent Chinese-Foreign Youth Exchange Foundation Program of Chinese Association of Science and Technology under Grant No. 311319000207.
  • 相关文献

参考文献2

二级参考文献16

  • 1徐永红,洪文学,陈铭明.基于Radviz及其优化的可视化故障诊断方法[J].计算机应用研究,2009,26(3):840-842. 被引量:5
  • 2Flake G W, Tarjan R E, Tsioutsiouliklis K. Graph cluste- ring and minimum cut trees. Internet Mathematics, 2004, 1(4) : 385-408.
  • 3Fjallstrom P. Algorithms for graph partitioning: A Survey, Linkoping Electronic Articles in Computer and Information Science, 1998, 3(10): 1-34.
  • 4Lowell W B, Robin J W, Peter J C. Topics in Algebraic Graph Theory. New York.. Cambridge University Press, 2004.
  • 5Yang B, Cheung W K, Liu J. Community mining from signed social networks. IEEE Transactions on Knowledge and Data Engineering, 2007, 19(10): 1333-1348.
  • 6Pizzuti C. A multi-objective genetic algorithm for community detection in networks//Proceeding of the 21st International Conference on Tools with Artificial Intelligence. Newark, New Jersey, USA, 2009: 379-386.
  • 7Tian Y, Hankins R A, Patel J M. Efficient aggregation for graph summarization//Proceedings of the ACM SIGMOD International Conference on Management of Data. Vancouver, BC, Canada, 2008:567-580.
  • 8Zhang N, Tian Y, Patel J M. Discovery-driven graph summarization//Proceeding of the 26th IEEE International Conference on Data Engineering. Long Beach, California, USA, 2010:880-891.
  • 9Cheng H, Zhou Y, Yu J X. Clustering large attributed graphs: A balance between structural and attribute similarities.ACM Transactions on Knowledge Discovery from Data, 2011, 5(2) : 1-33.
  • 10Liu Z, Yu J X, Cheng H. Approximate homogeneous graph summarization. Journal of Information Processing, 2012, 20(1) : 77-88.

共引文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部