期刊文献+

半监督的改进K-均值聚类算法 被引量:13

Semi-supervised improved K-means clustering algorithm
下载PDF
导出
摘要 K-均值聚类算法必须事先获取聚类数目,并且随机地选取聚类初始中心会造成聚类结果不稳定,容易在获得一个局部最优值时终止。提出了一种基于半监督学习理论的改进K-均值聚类算法,利用少量标签数据建立图的最小生成树并迭代分裂获取K-均值聚类算法所需要的聚类数和初始聚类中心。在IRIS数据集上的实验表明,尽管随机样本构造的生成树不同,聚类中心也不同,但聚类是一致且稳定的,迭代的次数较少,验证了该文算法的有效性。 K-means clustering algorithm acquires the number of clusters in advance.The random selection of the initial cluster centers will result in the instability and K-means clustering algorithm will be terminated in access to a local optimum value.In order to solve the problem,the improved K-means clustering algorithm based on semi-supervised learning theory obtains the number of clustering and initial clustering centers after building minimum spanning tree used by few label samples and splitting it iteratively.Although minimum spanning tree making up of random samples and initial clustering centers are different,the clustering is consistent and stable;the iteration is less than traditional K-means algorithm.It proves that the semi-supervised improved K-means algorithm is effective.
出处 《计算机工程与应用》 CSCD 北大核心 2009年第28期137-139,共3页 Computer Engineering and Applications
基金 国家自然科学基金专项基金No10826098~~
关键词 半监督学习 K-均值聚类 标签样本 最小生成树 semi-supervised learning K-means clustering labeled sample minimum spanning tree
  • 相关文献

参考文献7

  • 1Jain A K,Dubes R C.Algorithms for clustering data[M].Englewood Cliffs,NJ:Prentiee Hall, 1988.
  • 2Han J,Kamber M.Data mining:Concepts and techniques[M].San Francisco,US:Morgan Kaufmann, 2001.
  • 3MacQueen J.Some methods for classification and analysis of multivariate observations[C]//Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability,Berkeley,CA,1967,1: 281-297.
  • 4Pedrycz W,Vukovich G.Fuzzy clustering with supervision[J].Pattern Recognition, 2004,37(7 ) : 1339-1349.
  • 5Pedrycz W,Waletzky J.Fuzzy clustering with partial supervision[J]. IEEE Trans on Systems, Man, and Cybernetics-PartB, 1997,27 (5) : 787-795.
  • 6Zhang D Q,Tan K R,Chen S C.Semi-supervised kernel-based fuzzy c-rneans[C]//LNCS 3316:Proe of the Int'l Conf on Neural Information Processing(ICONIP'04).Berlin:Springer,2004:1229-1234.
  • 7Li Y J.A clustering algorithm based on maximal θ-distant subtrees[J].Pattern Recognition,2007,40(5) : 1425-1431.

同被引文献110

引证文献13

二级引证文献116

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部