期刊文献+

一种改进的BIRCH分层聚类算法 被引量:15

Improved BIRCH Hierarchical Clustering Algorithm
下载PDF
导出
摘要 由于传统的BIRCH算法是用直径来控制聚类的边界,因此如果簇不是球形,它就不能很好地工作,而且传统的BIRCH算法只适用于单表。针对BIRCH的这些缺点,本文提出了一种改进的BIRCH——IBIRCH算法,该算法首先通过ID传播把多个表联系起来,使得BIRCH算法可以适用于多表的情况,再通过计算共享最近邻密度,可以发现任意形状的簇。实验表明,该算法不仅具有较强的可伸缩性,还可以得到较高精确的聚类结果。 The traditional BIRCH clustering algorithm has many shortcomings, such as it is only fit for single table and only finds the global clusters. For these shortcomings, we introduce an improved algorithm—IBIRCH algorithm. First, this algorithm joins every table through the tuple ID propagation to be applied in relational databases. Then, find arbitrary clusters using the shared nearest neighbor density algorithm. The experiment shows the efficiency and scalability of this approach.
出处 《计算机科学》 CSCD 北大核心 2008年第3期180-182,208,共4页 Computer Science
基金 国家自然科学基金(60673136)
关键词 BIRCH算法 层次聚类 ID传播 SNN密度 BIRCH algorithm, Hierarchical clustering, Tuple ID propagation, SNN density
  • 相关文献

参考文献6

  • 1Han Jiawei, Kamber M. Data mining: concepts and technique.China Machine Press, 2006
  • 2Zhang T, Ramakrishnan R, Livny M. BIRCH: An efficient data clustering method for very large databases[C]. In: Proc. ACM SIGMOD Conf. Management of Data, Montreal ,1996. 103-114
  • 3Ertoz L,Steinbach M,Kumar V. Finding Clusters of Different Sizes,Shapes, and Densities in Noisy, High Dimensional Data[C]. In: Proc. of the 2003 SIAM International Conference on Data Mining,San Francisco, 2003. 150-155
  • 4Yin X, Han J, Yang J, et al. CrossMine: Efficient Classification Across Multiple Database Relations[C]. In: ICDE, Boston, 2004. 399-410
  • 5Ester M, Kriegel H, Sander J, et al. A density-based algorithm for discovering clusters in large spatial databases with noise[C]. In: Proc. 2nd Int Conf Knowledge Discovery and Data Mining (KDD'96), Portland , 1996. 226-231
  • 6Karypis G, Han E H, Kumar V. CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling [J]. IEEE Computer, 1999,32(8) : 68-75

同被引文献117

引证文献15

二级引证文献217

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部