期刊文献+

分布式密度和中心点数据流聚类算法的研究 被引量:7

RESEARCH ON DISTRIBUTED DATA STREAM CLUSTERING BASED ON DENSITY AND CENTRES
下载PDF
导出
摘要 分析分布式数据流聚类算法的基本框架结构,针对CluStream算法对非球形聚类效果不佳提出一种基于密度和中心点的分布式数据流聚类算法DDCS-Clustering(Distributed Density and Centers Stream Clustering)。该算法应用密度、中心点与衰减时间窗口,在分布式环境下对数据流进行聚类。实验结果表明,DDCS-Clustering算法具有较高的聚类质量与较低的通信代价。 We analyse the basic structure of distributed data stream clustering algorithm, and propose a kind of distributed data stream clustering algorithm which is based on ~tensity and centre points named the DDCS-clustering aiming at that the CluStream algorithm is less effective for non-spherical cluster. The algorithm applies the density, centre points and decay time windows mechanism, clusters the data stream under the distributed environment. Experimental result shows that the DDCS-clustering algorithm has higher clustering quality and lower communication cost.
出处 《计算机应用与软件》 CSCD 北大核心 2013年第10期181-184,共4页 Computer Applications and Software
基金 广东省自然科学基金项目(S2011010003681)
关键词 密度 中心点 分布式 数据流聚类 Density Centres Distributed Data stream clustering
  • 相关文献

参考文献8

  • 1Minos Garofalakis. Distributed Data Streams. Yahoo! Research and Univ. of California, Berkely[ R/OL]. http ://www. softnet, tuc. gr/-minos/Papers/eds09 dstreams, pdf.
  • 2Charu C Aggarwal, Han Jiawei, Wang Jianyong, et al. A Framework for Clustering Evolving Data Streams [ C ]//Proceedings of the 29th VLDB Conference, Berlin, Germany, 2003.
  • 3Luo Ke, Wang Lin. Data Streams Clustering Algorithm Based on Grid and Particle Swarm Optimization [C]//2009 International Forum on Computer Science-Technology and Applications.
  • 4张晓龙,曾伟.基于衰减窗口与剪枝维度树的实时数据流聚类[J].计算机应用研究,2009,26(4):1331-1334. 被引量:4
  • 5Amineh Amini, The Ying Wah. Density Micro-Clusteirng Algorithms on Data Streams: A Review [C]//Proceedings of the International MultiConference of Engineers and Computer Scientists 2011 Vol 1, IMECS 2011, March 16 -15, 2011, Hang Kong.
  • 6Martin Ester, Hans-Peter Kriegel, Jorg Sander, et al. A Density Algorithm for Discovering Clusters in Large Spatial Databases with Noise [ C ]//Proceedings of 2^nd International Conference on Knowledge Dis- covery and Data Mining( KDD-96 ).
  • 7Aadtiya Desai, Dr. Sunil Rai. Analysis of Machine Learning Atgorithms usirig WEKA [ C ]//International Conference & Workshop on Recent Trends in Technology, (TCET) 2012, Proceedings published in International Journal of Computer Applications (IJCA).
  • 8Albert Brief, Geoff Holmes, Bernhard Pfahringer, et al. MOA: Massive Online Analysis, a Framework for Stream Classification and Clustering[ C ]//JMLR : Workshop and Conference Proceedings 2010,11 : 44 - 50.

二级参考文献11

  • 1GOLAB L. Issues in data stream management [ J ]. ACM SIGMOD Record, 2003, 32(2) : 5- 14.
  • 2BABCOCK B, BABU S, DATAR M, et al. Models and issues in data stream systems [ C ]//Proc of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. New York: ACM Press, 2002: 1- 16.
  • 3GABER M M, ZASLAVSKY A, KRISHNASWAMY S. Mining data streams: a review[J]. ACM SIGMOD Record, 2005, 34 (2) : 18- 26.
  • 4MUTHUKR/SHNAN S. Data streams: algorithms and applications [ M ]. Hanover: Now Publishers Inc, 2005.
  • 5GUHA S, MEYERSON A, MISHRA N, et al. Clustering data streams [ C]//Proc of the 41st Annual Symposium on Foundations of Computer Science. Washington DC : IEEE Computer Society, 2000 : 359- 366.
  • 6O'CALLAGHAN L. Streaming data algorithms for high quality clustering[C]//Proc of the 18th International Conference an Data Engineering. Massachusetts: IEEE Computer Society, 2002 : 685- 694.
  • 7AGGARWAL C C, HAN Jia-wei, WANG Jian-yong, et al. A framework for clustering evolving data streams[ C]//Proc of the 29th VLDB Conference. Berlin: [s. n. ], 2003: 81-92.
  • 8孙焕良.流数据聚类分析与孤立点检测算法的研究[D].沈阳:东北大学,2005.
  • 9AGGARWAL C C, HAN Jia-wei, WANG Jian-yong, et al. A framework for projected clustering of high dimensional data streams [ C ]// Proc of the 30th VLDB Conference. Toronto: [ s. n. ], 2004:852- 863.
  • 10CHEN Yi-xi, TU Li. Density-based clustering for real-time stream data [ C ]//Proc of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. California : [ s. n. ], 2007 : 133- 142.

共引文献3

同被引文献51

引证文献7

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部