基于候选对象裁剪的密度子空间聚类

Candidate Pruning Based Density Subspace Clustering

下载PDF

导出

摘要针对目前子空间聚类算法存在精度差、效率低的问题,设计了一种子空间聚类算法DSUB.提出了裁剪候选对象的方法,减少了候选聚类对象的个数且对候选对象分组,使得待搜索的聚类簇只能是某个组的子集,可降低后续聚类处理的复杂度.此外,提出了新的邻域查询方法和抽样覆盖策略用以提高密度聚类的处理速度.实验结果表明:DSUB算法精度高,能够发现任意形状的聚类簇;计算复杂度与数据量呈线性关系;抗噪声性能强;聚类结果与处理顺序无关.DSUB算法非常适合处理子空间聚类. DSUB subspace clustering algorithm was proposed in this paper because the existing algorithms suffer from low accuracy and efficiency. A candidate pruning method was introduced to reduce the number of candidates for clustering and divide them into groups, so that clusters for search can only locate in one group, which reduced the computational complexity of later clustering processing. New neighborhood inquiry method and sampling coverage method were introduced to speed up density clustering processing. Test results show that DSUB algorithm is high in accuracy and effective in discovering clusters of arbitrary shape. The computational complexity is linear with data number. The algorithm is robust against noise and the clustering results are not affected by the order of processing. DSUB is a satisfactory subspace clustering algorithm.

作者张强吴腾飞杨颖

机构地区：~：~

出处《天津大学学报》 EI CAS CSCD 北大核心 2010年第7期623-628,共6页 Journal of Tianjin University(Science and Technology)

基金天津市高等学校科技发展基金资助项目(20080810) 中国博士后科学基金资助项目(20090450767)

关键词高维度数据子空间聚类数据挖掘 high-dimensional data subspace clustering data mining

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献8

1Jing Liping,Ng M K,Huang J Z. An entropy weightingkmeans algorithm for subspace clustering of high- dimensional sparse data[J].IEEE Transactions on Knowledge and Data Engineering,2007,19 (8) : 1026- 1041.
2Yiu M L,Mamoulis N. Iterative projected clustering by subspace mining[J]. IEEE Transactions on Knowledge and Data Engineering, 2005,17 (2) : 176-189.
3Liu G ,Li J,Sim K,et al. Distance based subspace clustering with flexible dimension partitioning [C]// Proceedings of the 23rd International Conference on Data Engineering. Istanbul, Turkey, 2007: 1250-1254.
4Agarwal N,Haque E,Liu H,et al. Research paper recommender systems :A subspace clustering approach [C]//Advanees in Web-Age Information Management. Hangzhou, China,2005,3739: 475-491.
5Agrawal R,Gehrke J ,Gunopulos D,et al. Automatic subspace clustering of high dimensional data [J]. Data Mining and Knowledge Discovery,2005,11 ( 1 ) : 5-33.
6Procopiuc C M,Jones M. A monte carlo algorithm for fast projective clustering [C] //Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data. New York,USA ,2002:418-427.
7Guha S ,Rastogi R, Shim K. CURE :An efficient clustering algorithm for large databases[J]. Information Systems ,2001,26 (1) :35-58.
8Wang X,Hamilton H J. DBRS:A density-based spatial clustering method with random sampling [C]// Advances in Knowledge Discovery and Data Mining. Seoul,Korea, 2003,2637:563-575.

1付淇,李正凡.基于CLIQUE的聚类算法研究[J].华东交通大学学报,2006,23(5):79-82. 被引量：12
2彭敏,曹加恒,揭志忠,刘茂福,刘娟.多维数据库检索查询的新机制[J].武汉大学学报（理学版）,2001,47(3):318-320. 被引量：3
3李开拓,彭慧,周晓锋,李帅.基于多种相关性度量的特征选择方法研究[J].小型微型计算机系统,2017,38(4):696-700. 被引量：6
4高原,刘辉,樊孝忠,牛振东,邵维忠.代码坏味的处理顺序[J].软件学报,2012,23(8):1965-1977. 被引量：11
5符祥,路春平,曾接贤.基于全局贪心的有向传感器网络覆盖算法[J].现代电子技术,2012,35(14):59-61.
6谭利民,李仁发,陈志.H.264去块滤波的流水线结构硬件设计与优化[J].计算机科学,2011,38(12):288-292.
7钟键,徐洪智,郭鑫.一种混合代码坏味的重构研究[J].福建电脑,2015,31(8):89-91.
8梁德坚,廖燕玲.H.264中图像去方块的编码算法研究[J].通信技术,2009,42(2):184-186. 被引量：2
9邓维斌.维规约对朴素贝叶斯分类性能的影响研究[J].计算机应用与软件,2010,27(6):89-91. 被引量：1
10彭立志,杨波,陈月辉.基于高维度数据单元划分算法的异常检测[J].计算机工程与应用,2006,42(3):133-135.

天津大学学报

2010年第7期

浏览历史

内容加载中请稍等...

基于候选对象裁剪的密度子空间聚类

参考文献8

相关作者

相关机构

相关主题

浏览历史