ISNN:一种基于密度的高效增量聚类算法

An Efficient Incremental Cluster Algorithm Based on Density

下载PDF

导出

摘要目的提高算法效率,减少磁盘访问次数,提出一种基于密度的高效增量聚类算法ISNN.方法将更新对象的空间进行划分,定义了基于该划分的最近邻居概念,在此基础上应用一种剪枝策略来确定受影响对象的集合,数据更新时,只需要对受影响对象集合进行处理.结果受影响对象集合远小于原数据集合,显著地提高了算法效率.结论实验表明,ISNN在效率和磁盘访问次数上都显著优于SNN算法. This paper proposes an incremental algorithm, ISNN which is based on density-based clustering algorithm SNN. The algorithm partitions the space around the update object, and redefines the nearest neighbors in each partition. In addition, a prune strategy is adopted; in this way we can find the influenced object dataset. When updating, the algorithm only deals with the set of the influenced objects instead of the whole dataset. Since the size of the influenced object dataset is far smaller than that of the whole dataset, the performance of the algorithm is improved. The evaluation shows that ISNN has much better efficiency and less I/O processing than SNN.

作者孙焕良邱菲朱叶丽王永会

机构地区沈阳建筑大学信息与控制工程学院

出处《沈阳建筑大学学报（自然科学版）》 CAS 2006年第6期1015-1018,共4页 Journal of Shenyang Jianzhu University：Natural Science

基金辽宁省自然科学基金(20052006) 辽宁省教育厅攻关计划(05L354)

关键词聚类分析 SNN 增量聚类算法基于密度的算法 ISNN cluster analysis SNN incremental clustering algorithm the density-based algorithm ISNN

分类号 TP311.131 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献9

1Kaufman L,Rousseeuw P J.Finding groups in data:An introduction to cluster analysis[M].New York:John Wiley & Sons,1990.
2Raymond T,Han J W.CLARANS:A method for clustering objects for spatial data mining[J].IEEE Transactions on Knowledge and Data Engineering,2002,14(5):1003-1016.
3Zhang T,Ramakerishnan R,Livany.BIRCH:An efficient data clustering method for very large databases[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1996:103-114.
4Guha S,Rastogir,Sh I M K.CURE:An efficient clustering algorithm for large databases[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1998:73-84.
5Ankerst M,Breunig M,Kriegel H P,et al.OPTICS:Ordering points to identify the clustering structure[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1999:49-60.
6Levent E,Meacheal S,Vipin K.Finding clusters with different size,shapes,and densities in noise high dimensional data[C]//Proc.SDM'.New York:SDM Press,2003:57-69.
7Agrawal R,Gehrke J,Gunopulos D,et al.Automatic subspace clustering of high dimensional data for data mining applications[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,1998:94-105.
8Sun H L,Bao Y B,Zhao F X,et al.CD-Trees:An efficient index structure for outlier detection[C]//Proc.of WAIM' 04.Dalian:WAIM Press,2004:600-609.
9Ester M,Kriegel H P,Sander J,et al.Incremental clustering for mining in a data warehousing environment[C]//Gupta A,Shmueli O,Widom J,et al.Proc.of the 24th Int' l Conf.on Very Large Data Bases.San Fransisco:Morgan Kaufmann Publishers,1998:323 -333.

1孙焕良,邱菲,刘俊岭,朱叶丽.IncSNN——一种基于密度的增量聚类算法[J].计算机研究与发展,2006,43(z3):309-313. 被引量：5
2许洪玮,曹江中,何家峰,戴青云.基于密度与路径的稳健谱聚类[J].计算机工程与应用,2015,51(2):165-170. 被引量：1
3辜季艳.基于主动网络技术的网络管理模型研究[J].电子技术与软件工程,2014(19):19-19.
4IE零日漏洞,约70% PC会受影响[J].微电脑世界,2013(11):116-116.
5顾洪博,张继怀.不确定性数据的聚类分析研究及应用[J].河北工程大学学报（自然科学版）,2012,29(1):109-112. 被引量：1
6张钰,陆军.基于ISNN和HGA的沪深300指数预测方法[J].计算机应用研究,2010,27(6):2156-2159.
7曾泽林,段明秀.基于密度的聚类算法DBSCAN的研究与实现[J].科技信息,2012(30):163-163. 被引量：3
8刘敬光,刘桂雄,周德光,洪晓斌.SNN算法在测量信息处理中的应用[J].现代制造工程,2006(10):90-92. 被引量：1
9孙焕良,毕占举,刘俊岭,周祥国,许景科.一种发现多层次密度的聚类算法[J].沈阳建筑大学学报（自然科学版）,2006,22(2):329-333.
10肖必虎.手机Link车机的方向?[J].音响改装技术,2012(7).

沈阳建筑大学学报（自然科学版）

2006年第6期

浏览历史

内容加载中请稍等...

ISNN:一种基于密度的高效增量聚类算法

参考文献9

相关作者

相关机构

相关主题

浏览历史