期刊文献+

ISNN:一种基于密度的高效增量聚类算法

An Efficient Incremental Cluster Algorithm Based on Density
下载PDF
导出
摘要 目的提高算法效率,减少磁盘访问次数,提出一种基于密度的高效增量聚类算法ISNN.方法将更新对象的空间进行划分,定义了基于该划分的最近邻居概念,在此基础上应用一种剪枝策略来确定受影响对象的集合,数据更新时,只需要对受影响对象集合进行处理.结果受影响对象集合远小于原数据集合,显著地提高了算法效率.结论实验表明,ISNN在效率和磁盘访问次数上都显著优于SNN算法. This paper proposes an incremental algorithm, ISNN which is based on density-based clustering algorithm SNN. The algorithm partitions the space around the update object, and redefines the nearest neighbors in each partition. In addition, a prune strategy is adopted; in this way we can find the influenced object dataset. When updating, the algorithm only deals with the set of the influenced objects instead of the whole dataset. Since the size of the influenced object dataset is far smaller than that of the whole dataset, the performance of the algorithm is improved. The evaluation shows that ISNN has much better efficiency and less I/O processing than SNN.
出处 《沈阳建筑大学学报(自然科学版)》 CAS 2006年第6期1015-1018,共4页 Journal of Shenyang Jianzhu University:Natural Science
基金 辽宁省自然科学基金(20052006) 辽宁省教育厅攻关计划(05L354)
关键词 聚类分析 SNN 增量聚类算法 基于密度的算法 ISNN cluster analysis SNN incremental clustering algorithm the density-based algorithm ISNN
  • 相关文献

参考文献9

  • 1Kaufman L,Rousseeuw P J.Finding groups in data:An introduction to cluster analysis[M].New York:John Wiley & Sons,1990.
  • 2Raymond T,Han J W.CLARANS:A method for clustering objects for spatial data mining[J].IEEE Transactions on Knowledge and Data Engineering,2002,14(5):1003-1016.
  • 3Zhang T,Ramakerishnan R,Livany.BIRCH:An efficient data clustering method for very large databases[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1996:103-114.
  • 4Guha S,Rastogir,Sh I M K.CURE:An efficient clustering algorithm for large databases[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1998:73-84.
  • 5Ankerst M,Breunig M,Kriegel H P,et al.OPTICS:Ordering points to identify the clustering structure[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1999:49-60.
  • 6Levent E,Meacheal S,Vipin K.Finding clusters with different size,shapes,and densities in noise high dimensional data[C]//Proc.SDM'.New York:SDM Press,2003:57-69.
  • 7Agrawal R,Gehrke J,Gunopulos D,et al.Automatic subspace clustering of high dimensional data for data mining applications[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1998:94-105.
  • 8Sun H L,Bao Y B,Zhao F X,et al.CD-Trees:An efficient index structure for outlier detection[C]//Proc.of WAIM' 04.Dalian:WAIM Press,2004:600-609.
  • 9Ester M,Kriegel H P,Sander J,et al.Incremental clustering for mining in a data warehousing environment[C]//Gupta A,Shmueli O,Widom J,et al.Proc.of the 24th Int' l Conf.on Very Large Data Bases.San Fransisco:Morgan Kaufmann Publishers,1998:323 -333.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部