期刊文献+

一种改进的自适应快速AF-DBSCAN聚类算法 被引量:30

An improved adaptive and fast AF-DBSCAN clustering algorithm
下载PDF
导出
摘要 基于密度的DBSCAN聚类算法可以识别任意形状簇,但存在全局参数Eps与Min Pts的选择需人工干预,采用的区域查询方式过程复杂且易丢失对象等问题,提出了一种改进的参数自适应以及区域快速查询的密度聚类算法。根据KNN分布与数学统计分析自适应计算出最优全局参数Eps与Min Pts,避免聚类过程中的人工干预,实现了聚类过程的全自动化。通过改进种子代表对象选取方式进行区域查询,无需漏检操作,有效提高了聚类的效率。对4种典型数据集的密度聚类实验结果表明,本文算法使得聚类精度提高了8.825%,聚类的平均时间减少了0.92 s。 The density-based DBSCAN clustering algorithm can identify clusters with arbitrary shape,however,the choice of the global parameters Eps and Min Pts requires manual intervention,the process of regional query is complex and loses objects easily. Therefore,an improved density clustering algorithm with adaptive parameter for fast regional queries is proposed. Using KNN distribution and mathematical statistical analysis,the optimal global parameters Eps and Min Pts are adaptively calculated,so as to avoid manual intervention and enable full automation of the clustering process. The regional query is conducted by improving the selection manner of the object,which is represented by a seed and thus avoiding manual intervention,and so the clustering efficiency is effectively increased. The experiment results looking at density clustering of four typical data sets show that the proposed method effectively improves clustering accuracy by 8.825% and reduces the average time of clustering by 0.92 s.
出处 《智能系统学报》 CSCD 北大核心 2016年第1期93-98,共6页 CAAI Transactions on Intelligent Systems
基金 国家自然科学基金资助项目(61373126) 江苏省产学研联合创新资金-前瞻性联合研究基金资助项目(BY2013015-33)
关键词 密度聚类 DBSCAN 区域查询 全局参数 KNN分布 数学统计分析 density clustering DBSCAN region query global parameters KNN distribution mathematical statis tics and analysis
  • 相关文献

参考文献4

二级参考文献22

  • 1岳士弘,李平,郭继东,周水庚.Using Greedy algorithm: DBSCAN revisited Ⅱ[J].Journal of Zhejiang University Science,2004,5(11):1405-1412. 被引量:2
  • 2李锁花,孙志挥,周晓云.基于特征向量的分布式聚类算法[J].计算机应用,2006,26(2):379-382. 被引量:6
  • 3..http://www.ics.uci.edu/mleam/MLSununary.html,.
  • 4MacQueen J.Some methods for classification and analysis of multivariate observations[C]//LeCam L,Neyman J,eds.Proceedings of the Fifth Berkeley Symposium on Mathematics,Statistics and Probability.Berkeley:University of California Press,1967:281-297.
  • 5Leonard Kaufman,Peter J Rousseenw.Finding groups in data:An introduction to cluster analysis[M].New York:Wiley Press,2005.
  • 6Tan P N,Steinbach M,Kumar V 著,范明,范宏建,等译,数据挖掘导论(Introduction to DataMining).北京:人民邮电出版社,2006.
  • 7Ester M,Kriegel H P,Sander J.A density-based algorithm for discovering clusters in large spatial databases with noise[C]//Simoudis E,Hart JW,Fayyad UM,eds.Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining.Portland:AAAI Press,1996:226-231.
  • 8Ankerst M,Breunig M M,Kriegcl H P.OPTICS:ordering points to identify the clustering structure[C]//Alex Dells,Christns Faloutscs,Shahram Ghandeharizadeh eds.Proceedings of the ACM SIGMOD'99 lnt Conf on Management of Data.Philadelphia Pennsylvania:ACM Press,1999:49-60.
  • 9Hinneburg A,Keim D A.An efficient approach to clustering in large multimedia databases with noise[C]//Rakesh Agrawal,Paul Stolorz,eds.Proceedings of the 4th lnt Conf on Knowledge Discovery and Data Mining.New York:AAAI Press,1998:58-65.
  • 10Feng P J,C,e L D.Adaptive DBSCAN-bused algorithm for constellation reconstruction and modulation identification[C]//Keyun Tang,Dayong Lio,eds.Proceedings of Radio Science Conference 2004.Beijing:Pub House of Electronics Industry,2004:177-180.

共引文献170

同被引文献289

引证文献30

二级引证文献222

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部