期刊文献+

K-Means聚类算法的研究 被引量:134

The Research about Clustering Algorithm of K-Means
下载PDF
导出
摘要 K-Means算法是一种经典的聚类算法,有很多优点,也存在许多不足。比如初始聚类数K要事先指定,初始聚类中心选择存在随机性,算法容易生成局部最优解,受孤立点的影响很大等。文中主要针对K-Means算法初始聚类中心的选择以及孤立点问题加以改进,首先计算所有数据对象之间的距离,根据距离和的思想排除孤立点的影响,然后提出了一种新的初始聚类中心选择方法,并通过实验比较了改进算法与原算法的优劣。实验表明,改进算法受孤立点的影响明显降低,而且聚类结果更接近实际数据分布。 The algorithm of K-means is one kind of classical clustering algorithm,including both many points and also shortages.For example must choose the initial clustering number.The choose of initial clustering centre has randomness.The algorithm receives locally optimal solution easily,the effect of isolated point is serious.Mainly improved the choice of initial clustering centre and the problem of isolated point.First of all,the algorithm calculated distance between all data and eliminated the effect of isolated point.Then proposed one new method for choosing the initial clustering centre and compared the algorithm having improved and the original algorithm using the experiment.The experiments indicate that the effect of isolated point for algorithm having improved reduces obviously,the results of clustering approach the actual distribution of the data.
出处 《计算机技术与发展》 2011年第2期62-65,共4页 Computer Technology and Development
基金 安徽省教育科研重点项目(KJ2009A57)
关键词 K-MEANS算法 初始聚类中心 孤立点 K-Means initial clustering centre isolated point
  • 相关文献

参考文献11

  • 1周卫星,廖欢.基于K均值聚类和概率松弛法的图像区域分割[J].计算机技术与发展,2010,20(2):68-70. 被引量:10
  • 2Mac Q J. Some methods for classification and analysis of mult- ivariate observations [ C ]//In: Proc. 5th Berkeley Symposium in Mathematics. Berkeley, USA : Univ of California, 1967.
  • 3GUHA S, RASTOGI R, SHIM K. CURE: An efficient clustering algorithm for large databases [ C ]//Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 1998: 73-84.
  • 4Ester,Martin, Hans Peter Kriegel, et al. A Density Based Algoriihm for Discovering Clusters in Large Spatial Databases with Noise [ C ]//Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining(KDD-96). Ortland,Oregon: [ s. n. ] ,.1996.
  • 5Wang W, Yang J, Muntz R. STING : A Statistical Information Grid Approach to Spatial Data Mining[ C ]//Proc. of 1997 Intl. Conf. on Very Large Databases. Athens, Greece : [ s. n. ], 1997 : 186-195.
  • 6Kohonen T. Self -- organized Formation of Topologically Correct Feature Maps [ J ]. Biological Cybernetics, 1982,43 ( 1 ) : 59 -69.
  • 7李业丽,秦臻.一种改进的k-means算法[J].北京印刷学院学报,2007,15(2):63-65. 被引量:9
  • 8苏锦旗,薛惠锋,詹海亮.基于划分的K-均值初始聚类中心优化算法[J].微电子学与计算机,2009,26(1):8-11. 被引量:33
  • 9步媛媛,关忠仁.基于K-means聚类算法的研究[J].西南民族大学学报(自然科学版),2009,35(1):198-200. 被引量:22
  • 10连凤娜,吴锦林,唐琦.一种改进的K-means聚类算法[J].电脑与信息技术,2008,16(1):38-40. 被引量:23

二级参考文献37

共引文献125

同被引文献1167

引证文献134

二级引证文献559

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部