基于差异半监督学习的相关理论分析

Related Theoretical Analysis of Diversity-Based Semi-supervised Learning

下载PDF

导出

摘要基于差异的半监督学习属于半监督学习和集成学习的结合,是近年来机器学习领域的研究热点.但相关的理论研究较缺乏,且都未考虑存在分布噪声的情况.文中首先针对基于差异的半监督学习的特点,定义一种分类噪声和分布噪声的混合噪声(HCAD).其次给出算法在HCAD噪声下的可能近似正确(PAC)理论分析及其应用实例.最后基于投票边缘函数,推导出在HCAD噪声下多分类器系统的泛化误差上界,并给出相关证明.文中开展的理论研究可用于设计基于差异的半监督学习算法及评估算法的泛化能力,具有广阔的应用前景. Diversity-based semi-supervised learning is the combination of semi-supervised learning and ensemble learning. It is a research focus in machine learning. However, its related theoretical analysis is insufficient, and the presence of distribution noise is not taken into account in these researches. In this paper, according to the characteristic of diversity-based semi-supervised learning, a hybrid classification and distribution （HCAD） noise is defined firstly. Then, probably approximately correct （PAC） analysis for diversity-based semi-supervised learning in the presence of HCAD noise and its application of the theorem are given. Finally, based on the voting margin, an upper bound is developed on the generalization error of multi-classifier systems with theoretic proofs in the presence of HCAD noise. The proposed theorems can be used to design diversity-based semi-supervised learning algorithms and evaluate their generalization ability, and they have a promising application prospect.

作者姜震詹永照

机构地区江苏大学计算机科学与通信工程学院

出处《模式识别与人工智能》 EI CSCD 北大核心 2014年第10期865-872,共8页 Pattern Recognition and Artificial Intelligence

基金国家自然科学基金项目(No.61170126) 江苏大学高级人才启动基金项目(No.1291170022)资助

关键词基于差异的半监督学习噪声可能近似正确(PAC)分析泛化误差 Diversity-Based Semi-supervised Learning, Noise, Probably Approximately Correct（PAC） Analysis, Generalization Error

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献18

1d'Alche Buc F, Grandvalet Y, Ambroise C. Semisupervised Marginboost // Dietterich T G, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems. Cambridge, USA: MIT Press, 2001, 14: 553-560.
2Chen K, Wang S H. Semi-supervised Learning via Regularized Boo- sting Working on Multiple Semi-supervised Assumptions. IEEE Trans on Pattern Analysis and Machine Intelligence, 2011, 33(1): 129-143.
3Chen K, Wang S H. Regularized Boost for Semi-supervised Learning[EB/OL]. [2013-12-25]. http://machinelearning.wustl.edu/mlpapers/paper_files/NIPS2007_164.pdf.
4Blum A, Mitchell T. Combining Labeled and Unlabeled Data with Co-training // Proc of the 11th Annual Conference on Computational Learning Theory. Madison, USA, 1998: 92-100.
5Jiang Z, Zhang S Y, Zeng J P. A Hybrid Generative/Discriminative Method for Semi-supervised Classification. Knowledge-Based Systems, 2013, 37: 137-145.
6Brefeld U, Scheffer T. Co-EM Support Vector Learning // Proc of the 21st International Conference on Machine Learning. Banff, Ca-nada, 2004: 16-23.
7Zhou Z H, Li M. Tri-training: Exploiting Unlabeled Data Using Three Classifiers. IEEE Trans on Knowledge and Data Engineering, 2005, 17(11): 1529-1541.
8Dasgupta S, Littman M L, McAllester D. PAC Generalization Bou- nds for Co-training // Dietterich T G, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems. Cambridge, USA: The MIT Press, 2001, 14: 375-382.
9Wang W, Zhou Z H. Analyzing Co-training Style Algorithms // Proc of the 18th European Conference on Machine Learning. Warsaw, Poland, 2007: 454-465.
10Wang W, Zhou Z H. A New Analysis of Co-training // Proc of the 27th International Conference on Machine Learning. Haifa, Israel, 2010: 1135-1142.

1周进,熊建武.基于Web的远程协同CAD教学系统的研究[J].科学技术与工程,2006,6(4):406-411. 被引量：1
2曹国钧.MS Windows 3.0的驱动程序Smartdrv Sys在提高Super WPS 2.1和2.13H汉字显示以及HCAD图形显示速度中的运用[J].软件世界,1994(8):26-27.
3华徐勇,姜秀萍,曹学云.AutoCAD环境下面向HCAD图形信息系统的设计与实现[J].机床与液压,2003,31(5):190-191.
4刘成志,李军成,杨炼.基于三次Bézier曲线逼近的边缘亚像素定位方法[J].软件,2015,36(7):31-35. 被引量：4
5李远航,刘波,唐侨.面向多标签图数据的主动学习[J].计算机科学,2014,41(11):260-264. 被引量：1
6顾照正,杨云.环保废水处理工程绘图软件包HCAD开发的难点及解决方法[J].造船工业建设,1994(3):54-58.
7俞扬信,严云洋.视频序列中的人脸检测与定位算法研究[J].计算机技术与发展,2009,19(2):109-111. 被引量：2
8魏晨,陈宗基.非线性系统的鲁棒故障检测与诊断[J].自动化学报,2003,29(6):976-980. 被引量：9
9罗和平,容太平,王筠.AUTO-CAD系统的堆栈空间的扩展[J].微型机与应用,1994,13(3):14-15.
10莫建林,张卫东,许晓鸣.一类推广的两步鲁棒辨识算法[J].控制与决策,2002,17(5):575-578.

模式识别与人工智能

2014年第10期

浏览历史

内容加载中请稍等...

基于差异半监督学习的相关理论分析

参考文献18

相关作者

相关机构

相关主题

浏览历史