期刊文献+

基于密度可达的多密度聚类算法 被引量:7

Density-reachable Based Clustering Algorithm for Multi-density
下载PDF
导出
摘要 为对多密度数据集聚类,提出一种基于密度可达的多密度聚类算法。使用网格划分技术来提高计算每个点密度值的效率,每次聚类都是从最高密度点开始,根据密度可达的概念和广度优先的策略逐步向外扩展进行聚类。实验表明,该算法能够有效地对任意形状、大小的均匀数据集和多密度数据集进行聚类,并能较好地识别出孤立点和噪声,其精度和效率优于SNN算法。 In order to cluster multi-density dataset, a clustering algorithm based on density-reachable for multi-density is proposed. Grid partition method is used to improve efficiency when computing each point's density. A clustering starts with the highest density point and uses expansion to form a cluster based on density-reachable and breadth-first strategy. Experimental results show that this algorithm can effectively discover clusters of arbitrary shapes for multi-density and uniformity density data sets with noises. It can get good cluster quality and is more efficient than SNN algorithm.
出处 《计算机工程》 CAS CSCD 北大核心 2009年第17期66-68,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60673087) 郑州大学骨干教师基金资助项目
关键词 聚类算法 邻域网格 密度可达 广度优先 多密度 clustering algorithm neighborhood grid density-reachable breadth-first multi-density
  • 相关文献

参考文献3

二级参考文献12

  • 1Zhao Yanchang, Song Junde. AGRID: an efficient algorithm for clustering large high-dimensional datasets[A]. Proc the 7th Pacific-Asia Conf on Knowledge Discovery and Data Mining (PAKDD-03)[C]. Seoul ,Korea : 2003.
  • 2Ester M, Kriegel H P, Sander J, et al. A density-based algorithm for discovering clusters in large spatial databases with noise[A]. Proc the 2nd Int Conf On Knowledge Discovery and Data Mining[C].Portland, Oregon : 1996. 226-- 231.
  • 3HanJW KamberM.数据挖掘概念与技术[M].机械工业出版社,2001..
  • 4Wang W, Yang J, Muntz R R. STING: A Statistical Information Grid Approach to Spatial Data Mining. In: Proc of the 23rd International Conference on Very Large Data Bases. Athens,Greece, 1997, 186-195
  • 5Sheikholeslami G, Chatterjee S, Zhang A D. WaveCluster: A Multi-Resolution Clustering Approach for Very I.arge Spatial Databases. In: Proc of the 24th International Conference on Very Large Data Bases. New York, USA, 1998, 428-439
  • 6Agrawal R, Gehrke J, Gunopulos D, Raghavan P. Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications. In: Proc of the ACM SIGMOD International Conference on Management of Data. Seattle, USA, 1998, 94-105
  • 7Zhao Y C, Song J D. GDILC: A Grid-Based Density-lsoline Clustering Algorithm. In: Proc of the International Conference on Info-Teeh and Info-Net. Beijing, China, 2001, 140-145
  • 8Hsu C M, Chen M S. Subspace Clustering of High Dimensional Spatial Data with Noises. In: Proe of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Sydney, Australia, 2004, 31-40
  • 9Ma E W M, Chow T W S. A New Shifting Grid Clustering Algorithm. Pattern Recognition, 2004, 37(3): 503-514
  • 10Hinneburg A, Keim D A. Optional Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering. In: Proc of the 25th International Conference on Very Large Data Bases. Edinburgh, Scotland, 1999, 506-517

共引文献43

同被引文献88

引证文献7

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部