期刊文献+

基于密度峰和划分的快速聚类算法 被引量:3

A Fast Clustering Algorithm Based on Cluster-centers and Partition
下载PDF
导出
摘要 传统基于划分的聚类算法需要人工给定聚类数,且由于算法采取刚性划分,可能会导致将较大或延伸状的聚类簇分割的现象,导致错误的聚类结果。密度峰聚类是近年提出的一种新的基于密度的聚类算法,该算法不需要预先指定聚类数目,且能够发现非球形簇。将密度峰思想引入基于划分的聚类算法,提出一种基于密度峰和划分的快速聚类算法(DDBSCAN),该算法首先获取一组簇的核心对象(密度峰),用于描述簇的"骨骼",而后将周围的点划分到最近的核心对象,最后通过判断划分边界处的密度情况合并簇。实验证明,该算法能有效地适应任意形状、大小不一的数据集,与传统基于密度的聚类算法相比收敛速度更快。 The clustering algorithm based on traditional partition needs to give the number of clustering artificially,and due to the rigid partition of the algorithm,it may lead to the segmentation of large or extended clusters,leading to the wrong clustering results. Clustering by density peak is a new clustering algorithm based on density proposed in recent years. The algorithm does not need to specify the number of clusters in advance,and can detect nonspherical clusters. A fast clustering algorithm based on density peak and partition(DDBSCAN) is proposed in this paper. The algorithm first obtains the cluster center(density peak) of a group of clusters,which describes the"skeleton"of the cluster,then divides the surrounding points into the nearest core object,and finally the clusters is merged by judging the density at the dividing edge. Experiments show that the algorithm can effectively adapt to data sets of arbitrary shape and size,and converges faster than traditional clustering algorithms based on density.
作者 琚书存 程文杰 徐建鹏 徐祥 徐阳 JU Shu-cun;CHENG Wen-jie;XU Jian-peng;XU Xiang;XU Yang(Rural Comprehensive Economic Information Center of Anhui Province,Hefei 230001,China;Anhui Agrometeorological Center,Hefei 230001,China)
出处 《计算机与现代化》 2018年第8期16-20,共5页 Computer and Modernization
基金 国家科技支撑计划项目(2014BAD10B05-02) 国家星火计划项目(2014GA710001) 安徽省科技攻关项目(1804A07020124)
关键词 密度峰聚类 核心对象 基于划分 边界密度 任意形状 clustering by density peak cluster center partition-based boundary density irregular shape
  • 相关文献

参考文献12

二级参考文献58

  • 1黄永平,邹力鹍.数据仓库中基于密度的批量增量聚类算法[J].计算机工程与应用,2004,40(29):206-208. 被引量:9
  • 2李洁,高新波,焦李成.基于特征加权的模糊聚类新算法[J].电子学报,2006,34(1):89-92. 被引量:113
  • 3Dubes R C, Jain A K. Algorithms for Clustering Data. Englewood Cliffs, NJ: Prentice Hall, 1988.
  • 4Zhang Tian, Ramakrishnan Raghu, Livny Miron. Birch: An efficient data clustering method for very large databases// Proceedings of the ACM SIGMOD Conference on Management of Data. Montreal, Canada, 1996: 103-114.
  • 5Guha S, Rastogi R, Shim K. ROCK:A robust clustering algorithm for categorical attributes//Proceedings of the 15th International Conference on Data Engineering. Sydney, Australia, 1999:1-11.
  • 6Gupta G K, Ghosh J. Value balanced agglomerative connectivity clustering//Proceedings of the SPIE Conference on Data Mining and Knowledge Discovery Ⅲ. Orlando, USA, 2001:6-15.
  • 7Dutta M, Kakoti Mahanta A, Pujari Arun K. QROCK: A quick version of the ROCK algorithm for clustering of categorical data. Pattern Recognition Letters, 2005, 26(15): 2364-2373.
  • 8Gehrke J. New research directions in KDD. Report on the SIGKDD 2001 Conference Panel, SIGKDD Explorations,2002, 3(2): 76-77.
  • 9Steinbach M, Karypis G, Kumar V. A comparison of document clustering techniques. Minnesota: University of Minnesota, Technical Report: 00-034, 2002.
  • 10Sebastiani F. A tutorial on automatic text categorization// Proceedings of the 1st Argentinean Symposium on Artificial Intelligence (ASAI'99). Buenos Aires, AR, 1999:7-35.

共引文献1157

同被引文献29

引证文献3

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部