期刊文献+

一种改进的基于密度的抽样聚类算法 被引量:4

An Improved Density-based Spatial Clustering Algorithm with Sampling
下载PDF
导出
摘要 基于密度的聚类算法DBSCAN是一种有效的空间聚类算法,它能够发现任意形状的聚类并且有效地处理噪声。然而,DBSCAN算法也有一些缺点,例如,①在聚类时只考虑空间属性没有考虑非空间属性;②在对大规模空间数据库进行聚类分析时需要较大的内存支持和I/O消耗。为此,在分析DBSCAN算法不足的基础上,提出了一种改进的基于密度的抽样聚类(improved density-based spatial clustering algorithm with sampling,IDBSCAS)算法,使之能够有效地处理大规模空间数据库,并且它不仅考虑了空间属性也考虑了非空间属性。2维空间数据的测试结果表明,该算法是可行、有效的。 DBSCAN is one of the effective spatial clustering algorithms, which can discover clusters of any arbitrary shape and handle the noise effectively. However, it has also several disadvantages. First, it is based on only spatial attributes without considering non-spatial attributes in the databases. Second, when DBSCAN handles large-scale spatial databases, it requires large volume of memory support and I/O cost. In this paper, an improved density-based spatial clustering algorithm with sampling(IDBSCAS) is developed, which not only clusters large-scale spatial databases effectively, but also considers spatial attributes and non-spatial attributes. Experimental results of 2-D spatial datasets show that the new algorithm is feasible and efficient.
出处 《中国图象图形学报》 CSCD 北大核心 2007年第11期2031-2036,共6页 Journal of Image and Graphics
基金 国家自然科学基金项目(60673127) 江苏省自然科学基金项目(BK2001045)
关键词 空间数据挖掘 空间聚类 密度 种子 非空间属性 spatial data mining, spatial clustering, density, seeds, non-spatial attributes
  • 相关文献

参考文献10

  • 1HanJiawei MichelineKamber 范明 孟小峰译.数据挖掘概念和技术[M].北京:机械工业出版社,2001..
  • 2Ng R T,Han Jiawei.CLARANS:A method for clustering objects for spatial data mining[J].IEEE Transactions on Knowledge and Data Engineering,2002,14 (5):1003-1016.
  • 3Guha S,Rastogi R,Shim K.CURE:An efficient clustering algorithm for large databases[A].In:Proceedings of the ACM SIGMOD International Conference on Management of Data[C],Seattle,WA,USA,1998:73-84.
  • 4Zhang T,Ramakrishna R,Livny M.BIRCH:An efficient data clustering method for very large databases[A].In:Proceedings of the ACM SIGMOD International Conference on Management of Data[C],Montreal,Canada,1996:103-114.
  • 5Ester M,Kriegel H,Sander J,et al.A density-based algorithm for discovering clusters in large dpatial databases with noise[A].In:Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining[C],Portland,Maine,USA,1996:226-231.
  • 6Ankerst M,Breunig M,Kriegel H,et al.OPTICS:Ordering points to identify the clustering structure[A].In:Proceedings of the ACM SIGMOD International Conference on Management of Data[C],Philadelphia,PA,USA,1999:49-60.
  • 7Sander J,Ester M,Kriegel H,et al.Density-based clustering in spatial databases:the algorithm GDBSCAN and its applications[J].Data Mining and Knowledge Discovery,1998,2(2):169-194.
  • 8Wang W,Yang J,Muntz R.STING:An statistical information grid approach to spatial data mining[A].In:Proceedings of the 23th International Conference on Very Large Data Bases[C],Athens,Greece,1997:186-195.
  • 9Sheikholeslami G,Chatterjee S,Zhang A.WaveCluster:A multi resolution clustering approach for very large spatial databases[A].In:Proceedings of the 24th International Conference on Very Large Data Bases[C],New York,USA,1998:428-439.
  • 10Beckmann N,Kriegel H P,Schneider R,et al.The R* -Tree:An efficient and robust access method for points and rectangles[A].In:Proceedings of the ACM SIGMOD International Conference on Management of Data[C],Atlantic City,NJ,USA,1990:322-331.

共引文献11

同被引文献74

引证文献4

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部