期刊文献+

基于k-means算法的k值优化的研究与应用 被引量:6

Study and Application of k Value Optimization Based on the k-means Clustering Algorithm
下载PDF
导出
摘要 k-means算法是经常使用的一种聚类算法,但是易受聚类个数k的影响,其性能主要取决于k值优化,因此对近年来k-means算法的研究现状与进展进行总结。对较有代表性的k值优化的k-means算法,从思想、关键技术等方面进行分析概括,并选用著名数据集对一些典型算法进行了测试,主要从同一个数据集、不同的k值优化情况进行对比分析.上述工作将为聚类分析和数据挖掘的研究提供有益的参考. k-means Clustering Algorithm is widely used and is sensitive to k. The performance of the k-means Clustering Algorithm primary depends on the optimization of k. In this paper, the progresses on k-means algorithm were summarized. Firstly, some representative k-means algorithms about the optimization of k were outlined. Secondly, several known data sets were selected to test some typical k-means algorithms. The work will be valuable for data clustering and data mining.
作者 顾洪博
出处 《海南大学学报(自然科学版)》 CAS 2009年第4期386-389,共4页 Natural Science Journal of Hainan University
基金 黑龙江省自然科学基金项目(F200603)
关键词 K-MEANS算法 有效性度量 k值优化 k-means validity measure optimization of k
  • 相关文献

参考文献17

  • 1MACQUEEN J. Some methods for classification and analysis of multivariate observations: proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, June 21 - July 18, 1965 [ C 1- California : University of California Press, 1967:281 - 297.
  • 2MARQUES J P.模式识别-原理、方法及应用[M].吴逸飞,译.2版.北京:清华大学出版社,2002:54-71.
  • 3HUANG Zhe-xue. A fast clustering algorithm to cluster very large categorical data sets in data mining:proceedings ACM SIG- MOD International Conference on Management of Data, Tucson, May 13 - 15,1997 [ C]. USA : ACM Press, 1997 : 146 - 151.
  • 4BEZDEK J C. Fuzzy Mathematics in Pattern Classification[ D ]. New York: Applied Mathematics Centre of Cornell University, 1973.
  • 5KANFMAN L,ROUSSEUW P. Finding Groups in data:An Introduction to Cluster Analysis [ M ]. New York:Jphn Wiley and sons, 1990:78 - 94.
  • 6LEI Xu, BAYESIAN Ying-Yang. machine, clustering and number of clusters [ J]. Pattern Recognition Letters. 1997,18: 1167 - 1176.
  • 7RAMZE R M, IXLIEVELDT B P F, REIBER J H C. A new cluster validity index for the fuzzy c-mean [ J]. Pattern Recognition Letters, 1998,19:237 - 246.
  • 8范九伦,裴继红,谢维信.基于可能性分布的聚类有效性[J].电子学报,1998,26(4):113-115. 被引量:41
  • 9XU L, KRZYZAK A, OJA E. Rival penalized competitiv learning for clustering analysis, RBF net, and curve detection [ J ], IEEE Transactions on Neural Networks, 1993,4 (4) :636 - 649.
  • 10李昕,郑宇,江芳泽.用改进的RPCL算法提取聚类的最佳数目[J].上海大学学报(自然科学版),1999,5(5):409-413. 被引量:15

二级参考文献25

  • 1张伟.Fuzzy聚类算法中的一个新算法--Fuzzy PFS聚类法[J].模糊数学,1987,3(4):51-56.
  • 2Borgen F H.Applying cluster analysis in counseling psychology research[J].Journal of Counsel Psychology,1987,34(4):456-468.
  • 3Khan S S,Ahmad A.Cluster center initialization algorithm for k-means clustering[J].Pattern Recognition Letters,2004(25):1293-1302.
  • 4Ward J H, Hook M E.Application of a hierarchical grouping procedure to a problem of grouping profiles[J].Educational and Psychological Measurement, 1963, (23) : 69-81.
  • 5Schreiber D,Schneider,W.Monte carlo tests of the accuracy of cluster analysis algorithms:a comparison of hierarchical and nonhierarchical methods[J].Multivariate Behavioral Research, 1985(20): 283-304.
  • 6Han J,Kamber M擞据挖掘概念与技术[M].范明,孟小峰,译.北京:机械工业出版社,2001.
  • 7Dubes R C,Jan A K.Algorithms for clustering data[M].[S.l.]:Prentice Hall, 1988.
  • 8[1]Usama M.Fayyad Cory A.Reina Paul S.Bradley,Initialization of Iterative Refinement Clustering Algorithms[C].Proc.4th International Conf.On Knowledge Discovery & Data Mining,1998.
  • 9[2]Pena J M ,J.A.Lozano,and P.Larranaga,An Empirical Comparison of four Initialization Methods for the K-Means Algorithm[J].Pattern Recognition Letters, 1999,20:1027-1040.
  • 10[3]Pal N R and J.C.Bezdek,On Cluster Validity for the Fuzzy c-Means Model,IEEE Transactions on Fuzzy Systems[J].1995,3:370-390.

共引文献444

同被引文献61

引证文献6

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部