基于时间衰减和密度的任意簇数据流聚类

A data stream clustering algorithm based on time recession and arbitrary shape

下载PDF

导出

摘要数据挖掘的一个重要分支是数据流聚类技术。基于K均值算法的基础提出了CluTA算法。该算法在处理用K均值方法分类得到的结果时考虑时间衰减因素和相似簇的合并,达到用户对时间的要求并实现了任意形状簇聚类。理论分析和实验结果都表明算法具有可行性。 Data mining technology is an important branch of the data stream mine. This paper proposed a new algorithm named CluTA which based on kmeans algorithm. This algorithm consider time factor and merged similar sets when processed results of kmeans, it could realize users requirement of time limits and product arbitrary shape date set. Theoretic analysis and experimental results showed that CluTA is feasibility.

作者龚云赵鹏王守军

机构地区安徽大学计算机科学与技术学院

出处《微型机与应用》 2011年第6期17-19,共3页 Microcomputer & Its Applications

基金安徽省教育厅重点科研项目(KJ2009A001Z)

关键词数据流密度聚类均值关键点时间衰减 data stream density based clustering key point time recession

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献5

1Hart Jiawei. Micheline. Data Mining:Concepts and Techniques, Second Edition[M].China Machine Press,2008.
2AGGARWAL C C, et al. A framework for clustering evolving data streams.In:Proc.of the 29th VLDB Conf.,2003.
3GUHA S, MISHRA N, MOTWANI R. Clustenng data streams[C].Proceedings of the Annual Symposium on Foun dations of Computer Science .2000.
4倪巍伟,陆介平,陈耿,孙志挥.基于k均值分区的流数据高效密度聚类算法[J].小型微型计算机系统,2007,28(1):83-87. 被引量：8
5HALKIDI M, VAZIRGIANNIS M. Clustering validity assessment ;finding the optimal partitioning of adata set[C]. ICDM 2001 : 187-194.

二级参考文献9

1倪巍伟,孙志挥,陆介平.k-LDCHD——高维空间k邻域局部密度聚类算法[J].计算机研究与发展,2005,42(5):784-791. 被引量：18
2Han Jia-wei.Micheline.Data mining:concepts and techniques[M].Morgan Kaufmann Publishers,San Fransisco,CA,2000.
3Ester M,Kriegel HP,Sander J,et al.A density based algorithm of discovering clusters in large spatial databases with noise[C].In:Simoudis E,Han JW,Fayyad UM,eds.Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining Portland,AAAI Press,1996:226-231.
4Zhang T,Ramakrishnan R,Livny M.BIRCH:an efficient data clustering method for very large databases[C].In:Jagadish HV,Mumick IS,eds.Proc.of the 1996 ACM SIGMOD Int.Conf.on Management of Data.Montreal:ACM Press,1996:103-114.
5Guha S,Rostogi R,Shim K.CURE:an efficient clustering algorithm for large databases[C].In:Haas LM,Tiwary A,eds.Proceedings of the ACM SIGMOD International Conference on Management of Data Seattle.ACM Press,1998:73-84.
6Wang W,Yang J,Muntz R.STING:a statistical information grid approach to spatial data mining[C].Proc.Int.Conf.on Very Large Databases(VLDB97),1997:186-195.
7Guha S,Mishra N,Motwani R.Clustering data streams[C].In:Proceedings of the Annual Symposium on Foundations of Computer Science,2000:359-366.
8Liadan OCallaghan,Nina Mishra,Adam Meyerson,Sudipto Guha,Rajeev Motwani.Streaming-data algorithms for high-quality clustering[C].In:Proceedings of IEEE International Conference on Data Engineering,2002:685-696.
9Maria Halkidi,Michalis Vazirgiannis.Clustering validity assessment:finding the optimal partitioning of a data set[C].ICDM 2001:187-194.

共引文献7

1吾守尔.斯拉木,李丰军,陶梅.IBORA:一种改进的有效的边界点检测[J].小型微型计算机系统,2008,29(10):1845-1848.
2印桂生,于翔,宁慧.基于粗约简的数据流增量聚类算法[J].西南交通大学学报,2009,44(5):637-642. 被引量：2
3吴磊,彭德中,彭磊,曾家智.结合Mercer核与SOM的动态免疫网络聚类算法[J].小型微型计算机系统,2010,31(2):333-337. 被引量：3
4樊龙军,李艳,吴磊,陈鹏.基于动态免疫网络的聚类算法[J].福建电脑,2011,27(5):1-4.
5李杨,檀柏红.基于点击流的频繁模式聚类算法研究[J].天津科技大学学报,2011,26(3):69-73.
6钱晨嗣,陈伟鹤.基于转发关系和单词特征的微博话题识别模型[J].信息技术,2018,42(9):44-49.
7陈华,陈伟旭,雷衍,王亚伟.基于引力原理的聚类问题一个新算法[J].新型工业化,2014,4(6):67-71. 被引量：3

1郭芸,刘纯平,龚声蓉.3D Zernike径向多项式的性质和快速算法[J].江苏大学学报（自然科学版）,2016,37(2):188-193.
2董作霖,刘宏飞,李明.面向传感器网络的高能效任务分配算法研究[J].太原理工大学学报,2006,37(5):593-596.
3刘欣亮,裴亚辉.基于用户反馈的时序二部图推荐方法[J].河南大学学报（自然科学版）,2015,45(2):229-234. 被引量：1
4李奕诺,肖如良,倪友聪,苏小敏,杜欣,蔡声镇.基于动态环境衰减的粒子滤波室内定位算法[J].计算机应用,2015,35(9):2465-2469. 被引量：2
5尚兆梅,陈波,彭勇.R-Theta算法的一种快速实现方式[J].科学技术与工程,2010,10(31):7803-7806. 被引量：2
6刘志建,关维国,华海亮,孙泽鸿.基于克里金空间插值的位置指纹数据库建立算法[J].计算机应用研究,2016,33(10):3139-3142. 被引量：13
7冯焕霞,刘莉,李正淳.异构集群下的动态任务调度策略[J].软件导刊,2014,13(6):23-26. 被引量：2
8余以胜.一种基于上下文感知的网络交易信任评价模型[J].电脑编程技巧与维护,2017(8):70-71.
9李昕,孟祥福.基于相似性推荐的电子商务Web数据库关键字近似查询方法[J].小型微型计算机系统,2015,36(7):1487-1491. 被引量：4
10刘莉,姜明华.异构集群下的任务调度算法研究[J].计算机应用研究,2014,31(1):80-84. 被引量：7

微型机与应用

2011年第6期

浏览历史

内容加载中请稍等...

基于时间衰减和密度的任意簇数据流聚类

参考文献5

二级参考文献9

共引文献7

相关作者

相关机构

相关主题

浏览历史