期刊文献+

基于密度的K-means聚类中心选取的优化算法 被引量:48

Optimization algorithm of K-means clustering center of selection based on density
下载PDF
导出
摘要 针对传统的K-means算法对于初始聚类中心点和聚类数的敏感问题,提出了一种优化初始聚类中心选取的算法。该算法针对数据对象的分布密度以及计算最近两点的垂直中点方法来确定k个初始聚类中心,再结合均衡化函数对聚类个数进行优化,以获得最优聚类。采用标准的UCI数据集进行实验对比,发现改进后的算法相比传统的算法有较高的准确率和稳定性。 Aiming at the problem of traditional K-means algorithm which is sensitive to initial clustering center and the number of cluster,this paper proposed a kind of optimization algorithm of initial clustering center selection.The algorithm was accor-ding to the distribution density of data and calculated the two vertical halfway points recently to determine the initial clustering center,then combined the equalization function to optimize the cluster number and got the optimal cluster.Used the standard UCI data sets as the contrast experiment objects,and found that the improved algorithm has the high accuracy and relative stability compared with traditional algorithm.
出处 《计算机应用研究》 CSCD 北大核心 2012年第5期1726-1728,共3页 Application Research of Computers
基金 湖南省教育厅创新平台开放基金资助项目(11K069) 湖南省自然科学基金资助项目(07JJ6115) 智能制造湖南省高校重点实验室资助项目(2009IM06)
关键词 K-均值 数据挖掘 聚类中心 垂直中点 密度 K-means data mining clustering center vertical halfway point density
  • 相关文献

参考文献11

二级参考文献58

  • 1杨善林,李永森,胡笑旋,潘若愚.K-MEANS算法中的K值优化问题研究[J].系统工程理论与实践,2006,26(2):97-101. 被引量:187
  • 2李洁,高新波,焦李成.基于特征加权的模糊聚类新算法[J].电子学报,2006,34(1):89-92. 被引量:113
  • 3李永森,杨善林,马溪骏,胡笑旋,陈增明.空间聚类算法中的K值优化问题研究[J].系统仿真学报,2006,18(3):573-576. 被引量:39
  • 4冯征.一种基于粗糙集的K-Means聚类算法[J].计算机工程与应用,2006,42(20):141-142. 被引量:16
  • 5钱线,黄萱菁,吴立德.初始化K-means的谱方法[J].自动化学报,2007,33(4):342-346. 被引量:32
  • 6Han J, Kamber M. Data Mining Concepts and Techniques. Orlando, USA: Morgan Kaufmann Publishers, 2001
  • 7Huang J Z, Ng M K, Rang Hongqiang, et al. Automated Variable Weighting in K-means Type Clustering. IEEE Trans on Pattern Analysis and Machine Intelligence, 2005, 27 (5) : 657 - 668
  • 8Dhillon I S, Guan Yuqiang, Kogan J. Refining Clusters in High Dimensional Text Data//Proc of the 2nd SIAM Workshop on Clustering High Dimensional Data. Arlington, USA, 2002 : 59 - 66
  • 9Zhang B. Generalized K-Harmonic Means: Dynamic Weighting of Data in Unsupervised Learning//Proc of the 1 st SIAM International Conference on Data Mining. Chicago, USA, 2001 : 1 - 13
  • 10Sarafis I, Zalzala A M S, Trinder P W. A Genetic Rule-Based Data Clustering Toolkit//Proc of the Congress on Evolutionary Computation. Honolulu, USA, 2002 : 1238 - 1243

共引文献1634

同被引文献372

引证文献48

二级引证文献320

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部