期刊文献+

一种基于K-means的分布式聚类算法 被引量:7

A distributed clustering algorithm based on K-means algorithm
下载PDF
导出
摘要 为解决现有的分布式聚类算法效率低下和不能保护数据隐私的问题,在K-Dmeans算法的基础上,提出一种新的分布式聚类算法。该算法利用数据对象间的密度函数值来优化站点初始聚类中心,从而大大降低了聚类的迭代次数;同时各从站点只需向主站点传送其聚簇的特征信息,有效降低分布式聚类过程中的通信量,保护了各个站点的独立性。实验结果表明,该算法有效可行,且在效率和聚类质量上优于K-Dmeans。 In order to improve the privacy protection and efficiency of the existing distributed clustering algorithms.A new algorithm based on the K-Dmeans algorithm was proposed,The algorithm used the value of the density function to optimize the sets' centers of K-means clustering algorithm,significantly reducing the number of iterations of clustering,while the slaver sets only sent the information of the clustering,reducing the distributed clustering process of traffic and protecting the independence of each site.Experimental results show that the proposed clustering algorithm is effective and feasible,which is better than K-Dmeans and K-means algorithm in efficiency and quality.
出处 《桂林电子科技大学学报》 2011年第6期460-463,共4页 Journal of Guilin University of Electronic Technology
基金 国家自然科学基金(10502050)
关键词 聚类 分布式聚类算法 K-MEANS算法 K-Dmeans算法 clustering distributed clustering algorithm K-means algorithm K-Dmeans algorithm
  • 相关文献

参考文献10

  • 1JAIN A K, MURTY M N, FLYNN P J. Data clusteringz A review[J]. ACM Computing Surveys, 1999,31 (3) : 265-323.
  • 2Han Jiawei, KAMBER M. Data Mining Concepts and Tech- nique, Second Edition[M]. Beijing: China Machine Press, 2007 : 251-300.
  • 3ESTER M, KRIEGEL H P, SANDER J, et al. A density based algorithm of discovering clusters in large spatial databases with noise[C]//Proc of the 2^nd Int' 1 Conf on Knowledge Discovery and Data Mining. Portland: AAAI Press, 1996 : 226-231.
  • 4GUHA S, RAJEEV R, KYUSEOK S. CURE, An efficient clustering algorithm for large databases[C]//Proc of the ACMSIG- MOD Int'1l ConfOil of Data. New York: ACMPress, 1998: 73- 84.
  • 5郑苗苗,吉根林.DK-Means——分布式聚类算法K-Dmeans的改进[J].计算机研究与发展,2007,44(z2):84-88. 被引量:9
  • 6PRODIO H, LAWRENCE H, Scalable clustering : A distributed approach[C]//The IEEE Int'1 Conf on Fuzzy Systems. Buda- pest: Hungary, 2004 : 88-102.
  • 7KANTABUTRA S, COUCH A L. Parallel K-means clustering algorithm on rows[J]. NECTEC Technical Journal, 2000,1(6): 243-247.
  • 8Iris. [DB/OL]. (1998-07-27). http://www, ics. edu/- relearn/ databases/iris.
  • 9Kddcup99. [DB/OL]. (1999-09-15). http://www, kdd. uci. edu/databases/kddcup99/kddcup99, html.
  • 10MODHA D S, SPANGLER W S. Feature weighting in Kmeans clustering [ J ]. Machine Learning, 2003, 52 ( 3 ) : 217- 237.

二级参考文献12

  • 1[1]A K Jain,M N Murty,P J Flynn.Data clustering:A review.ACM Computing Surveys,1999,31(3):265-323
  • 2[2]H Jiawei,M Kamber.Data Mining:Concepts and Techniques.San Francisco:Morgan Kaufmann,2000.232-233
  • 3[3]M Ester,H P Kriegel,J Sander,et al.A density based algorithm of discovering clusters in large spatial databases with noise.In:Proc of the 2nd Int'lConf on Knowledge Discovery and Data Mining.Portland:AAAI Press,1996.226-231
  • 4[4]S Guha,R Rajeev,S Kyuseok.CURE:An efficient clustering algorithm for large databases.In:Proc of the ACM SIGMOD Int'lConf on Management of Data.New York:ACM Press,1998.73-84
  • 5[5]T Zhang,R Ramakrishnan,M Livny.Birch:An efficient data clustering method for very large databases.In:Proc of the ACM SIGMOD Int'lConf on Management of Data.New York:ACM Press,1996.73-84
  • 6[6]S Kantabutra,A L Couch.Parallel k-means clustering algorithm on nows.NECTEC Technical Journal,2000,1(6):243-247
  • 7[7]H Prodio,H Lawrence.Scalable clustering:A distributed approach.The IEEE Int'lConf on Fuzzy Systems,Budapest,Hungary,2004
  • 8[8]D K Tasoulis,M N Vrahatis.Unsupervised distributed clustering.The IASTED Int'lConf on the Parallel and Distributed Computing and Networks,Innsbruck,2004
  • 9[9]E Januzaj,H P Kriegel,M Pfeifle.DBDC:Density based distributed clustering.In:Proc of the 9th Int'lConf on Extending Database Technology.Berlin:Springer,2004.88-105
  • 10[10]http://www.ics.uci.edu/~mlearn/databases/iris/,1998

共引文献8

同被引文献40

  • 1段翔兮,高剑,李熠,龙呈,邹琬,何锐.基于大数据挖掘技术的电网电压越限成因诊断方法研究[J].电子测量技术,2020,43(8):81-85. 被引量:9
  • 2王金丽.农村“低电压”问题原因剖析[J].农村电气化,2012(S1):7-8. 被引量:3
  • 3柴岳军,杨杰.因地制宜、多措并举治理“低电压”[J].农村电气化,2012(S1):105-107. 被引量:4
  • 4李玉鑑.符号序列之间的归一化距离度量[J].北京工业大学学报,2005,31(4):439-442. 被引量:5
  • 5Saukh O,Sauter R,Gauger M,et al. On boundary recog- nition without location information in wireless sensor networks[C]//2008 International Conference on Infor- mation Processing in Sensor Networks. New York: As- sociation for Computing Machinery,2008.
  • 6Fang Q,Gao J,Guibas L. Locatingand bypassing rou- ting holes in sensor networks[C]//Proc Mobile Net- works and Applications. Hongkong:Institute of Electri- cal and Electronics Engineers Inc,2006:187-200.
  • 7Funke S, Klein C. Hole detection or:how much geome-try hides in connectivity? [C]//Proc 22nd ACM Sym- pos. Computational Geometry. Sedona, AZ: Association for Computing Machinery,2006:377-385.
  • 8Wang Y, Gao J, Mitchell S ]3. Boundary recognition in sensor networks by topological methods[C]//12th An- nual international Conference on Mobile Computing and Networking (MobiCom'06). Los angeles, USA: Associa- tion for Computing Machinery,2006.
  • 9Fekete S P, Kr611er A, Pfisterer D, et al. Neighborhood- based topology recognition in sensor networks [C]// Proc Algosensors, Springer LNCS. Stevenage United Kingdom: Institution of Engineering and Technology, 2004,3121 : 123-136.
  • 10KRISHNA B S V,SATHEESH P, SUNEEL KUMAR R. Comparative study of k-means and bisecting k-means techniques in wordnet based document clustering [J]. International Journal of Engineering and Advanced Technology, 2012,2249 : 8958.

引证文献7

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部