期刊文献+

一种基于Seeds集和成对约束的主动半监督聚类算法 被引量:2

An Active Semi-supervised Clustering Algorithm Based on Seeds Set and Pairwise Constraints
下载PDF
导出
摘要 针对半监督聚类算法中监督信息使用不充分,监督信息中信息含有量低的问题,提出一种结合主动学习的半监督聚类算法.首先结合使用数据的类别标记和成对约束信息,指导Kmeans聚类过程,设计出一种基于Seeds集和成对约束的半监督聚类算法SC-Kmeans;其次将主动学习算法引入到SC-Kmeans中,以尽量小的代价选取信息含有量更高的监督信息,提高SC-Kmeans算法的聚类精度;最后在UCI标准数据集上进行仿真实验.实验结果表明,该算法取得了较好的聚类效果,有效提高了聚类准确率. Aiming at the problem that the supervised information was not sufficient and the information content of supervision information was low in semi-supervised clustering algorithm, we proposed a semi-supervised clustering algorithm based on active learning. Firstly, we designed a semi- supervised clustering algorithm based on Seeds set and pairwise constraints (SC-Kmeans) to guide the clustering process of the Kmeans algorithm by using the labeled data and pairwise constraints. Secondly, we introduced the active learning algorithm into SC-Kmeans, in order to select a higher amount of supervision information with a small cost and improve the clustering accuracy of SC-Kmeans algorithm. Finally, the simulation experiments were performed on machine learning repository (UCI) standard data sets. The experimental results show that the proposed algorithm can achieve better clustering effect, and effectively improve the clustering accuracy.
出处 《吉林大学学报(理学版)》 CAS CSCD 北大核心 2017年第3期664-672,共9页 Journal of Jilin University:Science Edition
基金 国家自然科学基金重点项目(批准号:61133011) 吉林省科技发展计划重点科技攻关项目(批准号:20150204005GX) 长春市科技计划重大科技攻关项目(批准号:14KG082)
关键词 半监督聚类 Kmeans算法 成对约束 Seeds集 主动学习 semi-supervised clustering Kmeans algorithm pairwise constraint Seeds set active learning
  • 相关文献

参考文献7

二级参考文献107

  • 1Basu S, Banerjee A, Mooney RJ. A probabilistic framework for semi-supervised clustering. In: Boulicaut JF, Esposito F, Giannotti F, Pedreschi D, eds. Proc. of the 10th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. New York: ACM Press, 2004.59-68.
  • 2Bilenko M, Basu S, Mooney RJ. Integrating constraints and metric learning in semi-supervised clustering. In: Brodley CE, ed. Proc. of the 21st Int'l Conf. on Machine Learning. New York: ACM Press, 2004. 81-88.
  • 3Tang W, Xiong H, Zhong S, Wu J. Enhancing semi-supervised clustering: a feature projection perspective. In: Berkhin P, Caruana R, Wu XD, eds. Proc. of the 13th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining. New York: ACM Press, 2007. 707-716.
  • 4Basu S, Banerjee A, Mooney RJ. Active semi-supervision for pairwise constrained clustering. In: Jonker W, Petkovic M, eds. Proc. of the SIAM Int'l Conf. on Data Mining. Cambridge: MIT Press, 2004. 333-344.
  • 5Yan B, Domeniconi C. An adaptive kernel method for semi-supervised clustering. In: Fiirnkranz J, Scheffer T, Spiliopoulou M, eds. Proc. of the 17th European Conf. on Machine Learning. Berlin: Sigma Press, 2006. 18-22.
  • 6Yeung DY, Chang H. Extending the relevant component analysis algorithm for metric learning using both positive and negative equivalence constraints. Pattern Recognition, 2006,39(5):1007-1010.
  • 7Beyer K, Goldstein J, Ramakrishnan R, Shaft U. When is "Nearest Neighbors Meaningful"? In: Beeri C, Buneman P, eds. Proc. of the Int'l Conf. on Database Theory. New York: ACM Press, 1999.217-235.
  • 8Ding CH, Li T. Adaptive dimension reduction using discriminant analysis and K-means clustering. In: Ghahramani Z, ed. Proc. of the 19th Int'l Conf. on Machine Learning. New York: ACM Press, 2007.521-528.
  • 9Zhang DQ, Zhou ZH, Chen SC. Semi-Supervised dimensionality reduction. In: Mandoiu I, Zelikovsky A, eds. Proc. of the 7th SIAM Int'l Conf. on Data Mining. Cambridge: MIT Press, 2007. 629-634.
  • 10Ye JP, Zhao Z, Liu H. Adaptive distance metric learning for clustering. In: Bishop CM, Frey B, eds. Proc. of IEEE Computer Society Conf. on Computer Vision and Pattern Recognition. Madison: IEEE Computer Society Press, 2007. 1-7.

共引文献196

同被引文献36

引证文献2

二级引证文献35

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部