期刊文献+

FDBSCAN:一种快速 DBSCAN算法(英文) 被引量:42

FDBSCAN: A Fast DBSCAN Algorithm
下载PDF
导出
摘要 聚类分析是一门重要的技术 ,在数据挖掘、统计数据分析、模式匹配和图象处理等领域具有广泛的应用前景 .目前 ,人们已经提出了许多聚类算法 .其中 ,DBSCAN是一种性能优越的基于密度的空间聚类算法 .利用基于密度的聚类概念 ,用户只需输入一个参数 ,DBSCAN算法就能够发现任意形状的类 ,并可以有效地处理噪声 .文章提出了一种加快 DBSCAN算法的方法 .新算法以核心对象邻域中所有对象的代表对象为种子对象来扩展类 ,从而减少区域查询次数 ,降低 I/ O开销 .实验结果表明 ,FDBSCAN能够有效地对大规模数据库进行聚类 ,速度上数倍于 DBSCAN. Clustering is an important application area for many fields including data mining, statistical data analysis, pattern recognition, image processing, and other business applications. Up to now, many algorithms for clustering have been developed. Contributed from the database research community, DBSCAN algorithm is an outstanding representative of clustering algorithms for its good performance in clustering spatial data. Relying on a density based notion of clusters, DBSCAN is designed to discover clusters of arbitrary shape. It requires only one input parameter and supports the user in determining an appropriate value of it. In this paper, a fast DBSCAN algorithm (FDBSCAN) is developed which considerably speeds up the original DBSCAN algorithm. Unlike DBSCAN, FDBSCAN uses only a small number of representative points in a core point's neighborhood as seeds to expand the cluster such that the execution frequency of region query and consequently the I/O cost are reduced. Experimental results show that FDBSCAN is effective and efficient in clustering large scale databases, and it is faster than the original DBSCAN algorithm by several times.
出处 《软件学报》 EI CSCD 北大核心 2000年第6期735-744,共10页 Journal of Software
基金 国家重点基础研究计划 !No.G19980 30 414 国家自然科学基金!No.6 97430 0 国家博士后项目基金 !No.19990 2 46 2 1&&
关键词 大规模数据库 数据挖掘 聚类 快速DBSCAN算法 代表点 Large scale database, data mining, clustering, fast DBSCAN algorithm, representative point.
  • 相关文献

参考文献6

  • 1Sheikholeslami G,Proceedings of the 2 4th VL DB Conference,1998年,428页
  • 2Zhang W,Proceedings of the 2 3rd VL DB Conference,1997年,186页
  • 3Chen M S,IEEE Transactions on Knowledge andData Engineering,1996年,8卷,6期,866页
  • 4Ester M,Proceedings of the 2nd International Conference on Knowledge Discovering in Data,1996年,226页
  • 5Zhang T,Proceedings of the ACM SIGMOD International Conference on Management of Data,1996年,103页
  • 6Ng R T,Proceedings of the2 0 th VL DB Conference,1994年,144页

同被引文献357

引证文献42

二级引证文献411

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部