期刊文献+

一种分裂式的k-means聚类算法 被引量:1

A Split K-means Clustering Algorithm
下载PDF
导出
摘要 k-means是一种快速有效的聚类算法,但是随着数据量的增加,k-means算法的局限性日益突出。该文从数据预处理,初始聚类中心的选取,最佳聚类数的确定等几个方面优化了k-means算法。仿真实验表明,优化后的k-means算法在稳定性和准确性方面都有很大的提高,证明提出的算法有一定的价值。 The k-means algorithm is fast and effective.With increasing number of data,the limitations of k-means algorithm have become increasingly prominent.This paper presents an improved k-means algorithm from data preprocessing,initial clustering centers choosing and the best number of clusters' determination for better clustering results.The experiments demonstrate that the improved k-means algorithm has a good performance with stability and accuracy.
作者 楼佳 王小华
出处 《杭州电子科技大学学报(自然科学版)》 2009年第4期54-57,共4页 Journal of Hangzhou Dianzi University:Natural Sciences
关键词 聚类 数据预处理 初始聚类中心 clustering data processing initial clustering center
  • 相关文献

参考文献6

二级参考文献32

  • 1荆丰伟,刘冀伟,王淑盛.改进的K-均值算法在岩相识别中的应用[J].微计算机信息,2004,20(7):41-42. 被引量:5
  • 2袁方,孟增辉,于戈.对k-means聚类算法的改进[J].计算机工程与应用,2004,40(36):177-178. 被引量:48
  • 3Guha S,Rastogi R,Shim K.Cure:an efficient clustering algorithm for large database[C]//Proc of ACM-SIGMOND lnt Conf Managemerit on Data, Seattle, Washington, 1998 . 73-84.
  • 4Ester M,Kriegel H P,Sander J.A density-based algorithm tier discovering chlsters in large spatial databases with noise[C]//Proc 2nd Int Conf on Knowledge Discovery and Data Mining.Portland, 1999.20:226-231.
  • 5Treshansky A,McGraw R.An overview of clustering algorithms[A].Proceedings of SPIE,The International Society for Optical Engineering[C].2001(4367):41-51.
  • 6Clausi D A.K-means Iterative Fisher (KIF) unsupervised clustering algorithm applied to image texture segmentation[J].Pattern Recognition,2002,35:1959-1972.
  • 7Bezdek J C,Pal N R.Some new indexes of cluster validity[J].IEEE Transactions on Systems,Man,and Cybernetics _ Part B:Cybernetics,1998,28(3):301-315.
  • 8Ramze R M,Lelieveldt B P F,Reiber J H C.A new cluster validity indexes for the fuzzy c-mean[J].Pattern Recognition Letters,1998,19:237-246.
  • 9JiaweiHan MichelineKamber 范明 孟小峰 译.数据挖掘概念与技术[M].北京:机械工业出版社,2002..
  • 10E M Knorr,R T Ng,V Tucakov. Distance-Based Outliers :Algorithms and Applications[J].VLDB Journal:Very Large Databases,2000:237~253

共引文献342

同被引文献9

引证文献1

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部