期刊文献+

一种改进的K-means算法 被引量:72

An Improved K-means Algorithm
下载PDF
导出
摘要 聚类分析在科研和商业应用中都有着非常重要的应用,K means算法是聚类方法中常用的一种划分方法。随着数据量的增加,K means算法的局限性日益突出。基于取样的划分思想,提出了一种改进的K means算法,在一定程度上避免了聚类结果陷入局部解的现象,减少了原始K means算法因采用误差平方和准则函数而出现将大的聚类簇分割开的情况,仿真实验结果表明:改进后的K means算法优于原始算法,并且稳定性更好。 Clustering analysis plays an important role in scientific research and commercial application. Kmeans algorithm is a widely used partition method in clustering. As the datasets scale increases rapidly, it is difficult to use Kmeans to deal with massive data. An improved Kmeans algorithm is presented, which can avoid getting into locally optimal solution in some degree, and reduce the probability of dividing a big cluster into two or more ones owing to the adoption of Jc. The experiments demonstrate the improved Kmeans is more stable and more accurate.
出处 《计算机应用》 CSCD 北大核心 2003年第8期31-33,60,共4页 journal of Computer Applications
关键词 聚类 K-MEANS算法 误差平方和准则函数 Clustering the K-means Algorithm J_c
  • 相关文献

参考文献13

  • 1王实,高文.数据挖掘中的聚类方法[J].计算机科学,2000,27(4):42-45. 被引量:88
  • 2(加)HanJ KamberM 范明 盂小峰 等译.数据挖掘概念与技术m[M].北京:机械工业出版社,2001.223-262.
  • 3唐立新,杨自厚,王梦光.用遗传算法改进聚类分析中的K-平均算法[J].数理统计与应用概率,1997,12(4):350-356. 被引量:23
  • 4陈恩红,王上飞,宁岩,王煦法.一种利用代表点的有效聚类算法设计与实现[J].模式识别与人工智能,2001,14(4):417-422. 被引量:10
  • 5..http://lib, slat. Cmu. Edu/datasets/places. Data,.
  • 6Forgy E. Cluster analysis of multivariate data: Efficiency vs. interpretabillty of classifications[ M]. Biometrics, 1965, 21(3) : 768.
  • 7MacQueen J. Some methods for classlfication and analysis of multivariate observations[ A]. Proceedinss of the Fifth Berkeley Symposium on Mathematical Statistics and Probability[ C]. Volume 1. Le-Cam LM, Neyman N, Ed. University of California Press, 1967.
  • 8Duda RO, Hart PE. Pattern Classification and Scene Analysis[ M].New York: John Wiley and Sons, 1973.
  • 9Selim SZ, Alsultan K. A Simulated Annealing Algorithm for the Clustering Problem[J]. Pattern Recognition, 1991, 24(10): 1003- 1008.
  • 10Fayyad U, Reina C, Bradley PS. Initialization of Iterative Refinement Clustering Algorithms[ R]. Microsoft Research Technical Report MSR-TR-98-38, June 1998.

二级参考文献2

  • 1Cheng C H,Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,1999年,84页
  • 2Zhang T,Proceedings of the ACM SIGMOD International Confernece on Management of Data,1996年,103页

共引文献117

同被引文献479

引证文献72

二级引证文献839

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部