期刊文献+

基于半监督的K-means聚类改进算法 被引量:1

Clustering Algorithm Based on Semi-Supervised K-means
下载PDF
导出
摘要 针对原始K-means算法的一系列问题,提出一种基于半监督的K-means聚类改进算法,能够自动进行聚类,找出最优K值,并且最大限度地找出孤立点。首先根据样本集自身的特点,按照"类内尽可能相似"原则一步一步形成数据集,然后对数据集进行"去噪"与合并相似簇,最后,利用少量的标记信息指导和修正聚类结果。在UCI的多个数据集上测试,结果表明改进的算法较原始算法在准确率上有较大提高,并且具有更好的稳定性。 Original k-means algorithm for a range of issues,which is proposed on the basis of semi-supervised k-means Clustering Algorithm,can automatically cluster,finding the optimal k value,and the maximum outliers.First,according to the own characteristics of sample and the principle of category as similar as possible,data set is formed step by step,then "denoised" or merged into similar clusters,and finally,the resultant clustering is guided and corrected by using a small amount of tag information.Multiple data sets in the UCI test results show that the improved algorithm is of better accuracy and better stability than the original algorithm.
作者 李小展
出处 《东莞理工学院学报》 2011年第1期29-32,共4页 Journal of Dongguan University of Technology
关键词 半监督 K-MEANS算法 聚类改进算法 semi-supervised k-means algorithm clustering algorithm
  • 相关文献

参考文献6

二级参考文献35

  • 1Dzeroski S. Multi-Relational data mining: An introduction. ACM SIGKDD Explorations Newsletter, 2003,5(1):1-16.
  • 2Dzeroski S, Lavrac N. Relational Data Mining. Berlin: Springer-Verlag, 2001. 339-364.
  • 3Domingos P. Prospects and challenges for multi-relational data mining. ACM SIGKDD Explorations Newsletter, 2003,5(1):80-83.
  • 4Bouchachia A. Learning with partly labeled data. Neural Computing and Applications, 2007,16(3):267-293.
  • 5Zhu XJ. Semi-Supervised learning literature survey. Technical Report, Computer Sciences TR 1530, University of Wisconsin- Madison, 2007. 1-42.
  • 6Chapelle O, Seholkopf B, Zien A. Semi-Supervised Learning. Cambridge: MIT Press, 2006. 3-14.
  • 7Long B, Zhang F, Wu XY, Yu PS. Spectral clustering for multi-type relational data. In: Cohen WW, Moore A, eds. Proc. of the 23rd Int'l Conf. on Machine Learning. New York: ACM Press, 2006. 585-592.
  • 8Marques de Sa JP, Wrote; Wu YF, Trans. Pattern Recognition Concepts, Methods and Applications. 2nd ed., Beijing: Tsinghua University Press, 2002.51-74 (in Chinese).
  • 9http://archive.ics.uci.edu/ml/datasets.html
  • 10Yin XX, Han JW, Yu PS. CrossClus: User-Guided multi-relational clustering. Data Mining Knowledge Discovery, 2007,15(3): 321-348.

共引文献50

同被引文献10

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部