期刊文献+

一种主动半监督K-means聚类算法的改进策略 被引量:3

An Improved Strategy of Active Semi-supervision K-means Clustering Algorithm
下载PDF
导出
摘要 经典的APCKmeans(active pairwise constrained K-means)算法通过主动学习的方式构造must-link约束集和cannot-link约束集作为监督信息进行半监督聚类,提高了结果的准确性.但该算法在样本指派的过程中可能出现指派不是当前最优的问题.提出一种优先指派标签样本的方法,应用于APCKmeans算法,使用改进后的APCKmeans_I算法实现了使用较少的监督信息取得更好的聚类结果.将改进策略应用于PCKmeans(pairwise constrained K-means)算法,提出改进后的PCKmeans_I算法.通过在UCI基准数据集的实验表明,改进后算法的性能得到明显提升. The classic APCKmeans (active pairwise constrained K-means )algorithm constructs the must-link constraint set and the cannot-link constraint set as the supervised information by Semi-Supervised Clustering through the active learning method to improve the accuracy of the results. However, the algorithm may not be assigned to the current optimal problem during the sample assignment process. This paper proposes a method of assigning label samples to APCKmeans algorithm, and proposes an improved APCKmeans_I algorithm to aehieve better clustering results with less supervisory information. The improved strategy is applied to PCKmeans(pairwise constrained K-means)algorithm, and PCKmeans_I algorithm is proposed. Experiments on the UCI reference data set show that the performance of the improved algorithm is obviously improved.
作者 吕峰 柴变芳 李文斌 王垚 Lü Feng;Chai Bianfang;Li Wenbin;Wang Yao(School of Information Engineering,Hebei GEO University,Shijiazhuang 050031,China)
出处 《南京师范大学学报(工程技术版)》 CAS 2018年第2期56-62,共7页 Journal of Nanjing Normal University(Engineering and Technology Edition)
基金 国家自然科学基金(61503260) 河北省研究生创新资助项目(CXZZSS2017131) 河北地质大学教改项目(2017J04)
关键词 主动半监督聚类 成对约束聚类 改进算法 active semi-supervised clustering pairwise constrained clustering improved algorithm
  • 相关文献

参考文献6

二级参考文献86

  • 1阮园,马建华.2型糖尿病与骨质疏松相关因素研究进展[J].现代生物医学进展,2011,11(S1):4778-4780. 被引量:13
  • 2邢岩,赵晋华,张黎铭,车文军.老年2型糖尿病患者骨密度变化及其影响因素分析[J].中国骨质疏松杂志,2006,12(5):482-484. 被引量:6
  • 3Basu S, Banerjee A, Mooney RJ. A probabilistic framework for semi-supervised clustering. In: Boulicaut JF, Esposito F, Giannotti F, Pedreschi D, eds. Proc. of the 10th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. New York: ACM Press, 2004.59-68.
  • 4Bilenko M, Basu S, Mooney RJ. Integrating constraints and metric learning in semi-supervised clustering. In: Brodley CE, ed. Proc. of the 21st Int'l Conf. on Machine Learning. New York: ACM Press, 2004. 81-88.
  • 5Tang W, Xiong H, Zhong S, Wu J. Enhancing semi-supervised clustering: a feature projection perspective. In: Berkhin P, Caruana R, Wu XD, eds. Proc. of the 13th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. New York: ACM Press, 2007. 707-716.
  • 6Basu S, Banerjee A, Mooney RJ. Active semi-supervision for pairwise constrained clustering. In: Jonker W, Petkovic M, eds. Proc. of the SIAM Int'l Conf. on Data Mining. Cambridge: MIT Press, 2004. 333-344.
  • 7Yan B, Domeniconi C. An adaptive kernel method for semi-supervised clustering. In: Fiirnkranz J, Scheffer T, Spiliopoulou M, eds. Proc. of the 17th European Conf. on Machine Learning. Berlin: Sigma Press, 2006. 18-22.
  • 8Yeung DY, Chang H. Extending the relevant component analysis algorithm for metric learning using both positive and negative equivalence constraints. Pattern Recognition, 2006,39(5):1007-1010.
  • 9Beyer K, Goldstein J, Ramakrishnan R, Shaft U. When is "Nearest Neighbors Meaningful"? In: Beeri C, Buneman P, eds. Proc. of the Int'l Conf. on Database Theory. New York: ACM Press, 1999.217-235.
  • 10Ding CH, Li T. Adaptive dimension reduction using discriminant analysis and K-means clustering. In: Ghahramani Z, ed. Proc. of the 19th Int'l Conf. on Machine Learning. New York: ACM Press, 2007.521-528.

共引文献199

同被引文献32

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部