摘要
针对基于密度带有"噪声"的空间聚类应用(DBSCAN)聚类算法存在的3个主要问题:输入参数敏感、对内存要求高、数据分布不均匀时影响聚类效果,提出了一种基于遗传方法的DBSCAN算法改进方案数据分区中使用遗传思想的DBSCAN算法(DPDGA)来提高聚类质量.利用遗传算法改进K-means算法来获取初始聚类中心;对数据进行划分,在此基础上对划分的每一部分使用DBSCAN算法进行聚类;合并聚类的结果.仿真实验表明,新方法较好解决了传统DBSCAN聚类算法存在的问题,在聚类效率和聚类效果方面均优于传统DBSCAN聚类算法.
There are three problems along with the Density Based Spatial Clustering of Applications with Noise(DBSCAN) Clustering Algorithm: input sensitivity, desire for too much memory space and the effect of nonuniform data. To solve these problems, a fast Data Partition DBSCAN using Genetic Algorithm(DPDGA) Algorithm is developed which considerably improves the cluster quality. First, the Genetic Algorithm is used to improve the K-means Algorithm to get the initial clustering center. Second, data is partitioned and the DBSCAN Algorithm is applied to cluster partitions. Finally, all clustered result sets are merged. Simulation experiments indicate that the DPDGA Algorithm works well to solve these problems and that both the efficiency and the cluster quality are better than those of the original DBSCAN Algorithm.
出处
《西安电子科技大学学报》
EI
CAS
CSCD
北大核心
2008年第3期523-529,共7页
Journal of Xidian University
基金
国家自然科学基金资助(50474033)
关键词
聚类算法
遗传算法
数据划分
密度
clustering algorithm
genetic
data partition
density