期刊文献+

对k-means聚类算法的改进 被引量:47

Improved k-means Clustering Algorithm
下载PDF
导出
摘要 提出了一种k-means聚类算法中寻找初始聚类中心的新方法。算法首先计算样本间的距离,然后根据样本点之间的距离寻找有可能是一类的数据,依据这些样本点形成初始聚类中心,从而得到较好的聚类结果。实验表明,改进后的方法相对于随机选取初始聚类中心具有较高的准确率。 This paper investigates the standard k-means clustering algorithm and gives an improved algorithm by selecting better initial centers that the algorithm begins with.First the paper computes distances between data points;then tries to find out the data points that are similar;finally constructs initial centers according to these data points.In the experiment,authors find that different data points lead to different results.If people can find initial centers that are consistent with the distribution of data,people could get good clusterings.According to the experiment,the improved k-means Clustering Algorithm can get higher accuracy.
出处 《计算机工程与应用》 CSCD 北大核心 2004年第36期177-178,232,共3页 Computer Engineering and Applications
基金 河北省自然科学基金(编号:603137) 河北省教育厅科研计划(编号:2001206 2002154)资助
关键词 K-MEANS聚类算法 聚类 模式识别 k-means clustering algorithm,clustering,pattern recognition
  • 相关文献

参考文献4

  • 1La Jolla. Alternatives to the k-means algorithm that find better clustering. Department of Computer Science and Engineering,University of California,San Diego,CA92093
  • 2Kaufan L,Rousseeuw PJ.Finding Groups in Data:an Introduction to Cluster Analysis[M].New York:John Wiley & Sons, 1990
  • 3Guha S,Rastogi R,Shim K.CURE:an efficient clustering algorithm for large databases[C].In:Haas LM,Tiwary A eds.Proceedings of the ACM SIGMOD International Conference on Management of Data,Seattle: ACM Press, 1998: 73~84
  • 4Agrawal R,Gehrke J,Gunopolos D et al. Automatic subspace clustering of high dimensional data for data mining application[C].In:Haas LM,Tiwary A eds. Proceedings of the ACM SIGMOD International Conference on Management of Data, Seattle: ACM Press, 1998: 94~105

同被引文献310

引证文献47

二级引证文献500

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部