期刊文献+

一种改进的k-means算法 被引量:9

An Improved Algorithm of k-means
下载PDF
导出
摘要 k-means(k均值)算法是聚类方法中常用的一种划分方法。该算法适合对海量数据进行聚类,对球状、凸形分布的数据具有很好的聚类效果,但该算法有其突出的局限性,少量的孤立点就会对聚类结果产生很大的影响,因此,采用聚类均值点与聚类种子相分离的思想,给出了基于该思想的对k均值算法的改进算法。实验表明,该改进算法比原k均值算法具有更高的准确性。 K-means algorithm is a widely used partition method in clustering. The algorithm is suitable for the spherical wave data. The algorithm has a good result for spherical, protruding data. However, the algorithm has its prominent limitations. A small number of isolated points would have a considerable impact on the clustering results. This paper study presents an idea to separate the clustering centroid from the clustering seed and completes an algorithm based on this idea, improving the k-means algorithm. It also provides a specific ideology based on the k-means algorithm to improve the algorithm. The paper presents the results of the experiments to prove that this algorithm is more veracious than the k-means algorithm.
作者 李业丽 秦臻
出处 《北京印刷学院学报》 2007年第2期63-65,共3页 Journal of Beijing Institute of Graphic Communication
关键词 数据挖掘 聚类算法 K-MEANS算法 data mining clustering algorithm k-means algorithm
  • 相关文献

参考文献3

二级参考文献19

  • 1(加)HanJ KamberM 范明 盂小峰 等译.数据挖掘概念与技术m[M].北京:机械工业出版社,2001.223-262.
  • 2..http://lib, slat. Cmu. Edu/datasets/places. Data,.
  • 3Forgy E. Cluster analysis of multivariate data: Efficiency vs. interpretabillty of classifications[ M]. Biometrics, 1965, 21(3) : 768.
  • 4MacQueen J. Some methods for classlfication and analysis of multivariate observations[ A]. Proceedinss of the Fifth Berkeley Symposium on Mathematical Statistics and Probability[ C]. Volume 1. Le-Cam LM, Neyman N, Ed. University of California Press, 1967.
  • 5Duda RO, Hart PE. Pattern Classification and Scene Analysis[ M].New York: John Wiley and Sons, 1973.
  • 6Selim SZ, Alsultan K. A Simulated Annealing Algorithm for the Clustering Problem[J]. Pattern Recognition, 1991, 24(10): 1003- 1008.
  • 7Fayyad U, Reina C, Bradley PS. Initialization of Iterative Refinement Clustering Algorithms[ R]. Microsoft Research Technical Report MSR-TR-98-38, June 1998.
  • 8Selim SZ, Ismail MA. K-Means-Type Algorithms: A Generalized Convergence Theorem and Charadterization of Local Optimality[ M].IEEE Trans Pattern Analysis and Machine Intelligence, 1984, PA-MI-6(1).
  • 9Kaufman L, Rouseeuw P. Finding Groups in Data: An Introduction to Cluster Analysis[ M]. New York : John Wiley and Sons, 1990.
  • 10Alsabti K, Ranks S, Singh V. An Efficient K-Means Clustering Algorithm[ A]. Proc. First Workshop on High-Performance Data Mining[C], 1997.

共引文献97

同被引文献55

引证文献9

二级引证文献184

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部