基于指数损失和0-1损失的在线Boosting算法被引量：2

Online Boosting Algorithms Based on Exponential and 0-1 Loss

下载PDF

导出

摘要推导了使用指数损失函数和0-1损失函数的Boosting算法的严格在线形式,证明这两种在线Boosting算法最大化样本间隔期望、最小化样本间隔方差.通过增量估计样本间隔的期望和方差,Boosting算法可应用于在线学习问题而不损失分类准确性.UCI数据集上的实验表明,指数损失在线Boosting算法的分类准确性与批量自适应Boosting(AdaBoost)算法接近,远优于传统的在线Boosting;0-1损失在线Boosting算法分别最小化正负样本误差,适用于不平衡数据问题,并且在噪声数据上分类性能更为稳定. In this paper, strict derivation for the online form of Boosting algorithms using exponential loss and 0-1 loss is presented, which proves that the two online Boosting algorithms can maximize the average margin and minimize the margin variance. By estimating the margin mean and variance incrementally, Boosting algorithms can be applied to online learning problems without losing classification accuracy. Experiments on UCI machine learning datasets show that the online Boosting using exponential loss is as accurate as batch AdaBoost, and significantly outperforms the traditional online Boosting, and that the online Boosting using 0-1 loss can minimize classification errors of positive samples and negative samples at the same time, thus applies to imbalance data. Moreover, Boosting using 0-1 loss is more robust on noisy data.

作者侯杰茅耀斌孙金生

机构地区南京理工大学自动化学院

出处《自动化学报》 EI CSCD 北大核心 2014年第4期635-642,共8页 Acta Automatica Sinica

基金国家自然科学基金(60974129)资助~~

关键词 ADABOOST 在线学习特征选择不平衡数据 AdaBoost, online learning, feature selection, imbalance data

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献25

1Freund Y, Schapire R E, Abe N. A short introduction to Boosting. Journal-Japanese Society for Artificial Intelligence, 1999, 14(5): 771-780.
2Freund Y, Schapire R E. A desicion-theoretic generalization of on-line learning and an application to Boosting. Journal of Computer and System Sciences, 1997, 55(1): 119-139.
3曹莹,苗启广,刘家辰,高琳.AdaBoost算法研究进展与展望[J].自动化学报,2013,39(6):745-758. 被引量：267
4Viola P, Jones M J. Robust real-time face detection. International Journal of Computer Vision, 2004, 57(2): 137-154.
5Zhang C, Zhang Z Y. A Survey of Recent Advances in Face Detection, Technical Report MSR-TR-2010-66, Microsoft Research, Redmond, WA, 2010.
6Wu J X, Rehg J M, Mullin M D. Learning a rare event detection cascade by direct feature selection. [Online], available: http: //papers.nips.cc/paper/2353-learning-a-rare-event-detection-cascade-by-direct-feature-selection.pdf, October 25, 2012.
7Bartlett P, Freund Y, Lee W S, Schapire R E. Boosting the margin: a new explanation for the effectiveness of voting methods. The Annals of Statistics, 1998, 26(5): 1651-1686.
8Grabner H, Grabner M, Bischof H. Real-time tracking via on-line Boosting. In: Proceedings of the 2006 British Machine Vision Conference. Edinburgh, British, 2006, 1: 4756.
9Kuo C H, Nevatia R. How does person identity recognition help multi-person tracking? In: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI: IEEE, 2011. 1217-1224.
10Yang B, Nevatia R. Multi-target tracking by online learning of non-linear motion patterns and robust appearance models. In: Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI: IEEE, 2012. 1918-1925.

二级参考文献19

1涂承胜,陆玉昌.Boosting视角[J].计算机科学,2005,32(5):140-143. 被引量：2
2Mason L,Baxter J,Bartlett P,et al. Boosting algorithms as gra dient deseent[C] // Neural Information Processing Systems 12 Cambridge: MIT Press, 2000 : 512-518.
3Friedman J, Hastie T, Tibshirani R. Additive logistic regression a statistical view of boosting[J]. The Annals of Statistics, 2000 28(2) : 337-407.
4Seiffert C,Khoshgoftaar T M, Hulse J V, et al. RUSBoost: Im proving classification performance when training data is skewed [C]//Proceedings of 19th International Conference on Pattern Recognition. Washington DC: IEEE Computer Society, 2008:1-4.
5Guo H Y,Viktor H L. Learning from imbalanced data sets with boosting and data generation: the DataBoost-IM approach[J]. SIGKDD Explorations, 2004,6 ( 1 ):30-39.
6Sun Y,Kamel M S,Wong A K C, et al. Cost-sensitive boosting for classification of imbalanced data[J].Pattern Recognition, 2007,40(12) :3358-3378.
7Li Q J, Mao Y B, Wang Z Q, et al. Cost-sensitive boosting: fit ring an additive asymmetric logistic regression model[C]//Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning ( ACML ' 09 ). Berlin: Springer, 2009 : 234-247.
8Masnadi-Shirazi H, Vaseoneelos N. Cost-sensitive boosting[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2010,33(2) :294-309.
9Newman D, Hettich S, Blake C, et al. UCI repository of machine learning data bases[DB/OL], http://www, ics. uci. edu/-mlearn/MLRepository, html, 2011-05-01.
10Hanley J A,McNeil B J. The meaning and use of the area under a receiver operating characteristic (ROC) curve [J]. Radiology, 1982,143(1):29-36.

共引文献287

1杨耿,张业明,侯金利,刘咏炫,鲁骏,周靖.高速公路图像识别技术应用探析[J].中国交通信息化,2022(S01):294-298. 被引量：1
2谭朋柳,徐光勇,张露玉,王润庶.基于卷积神经网络和Adaboost的心脏病预测模型[J].计算机应用,2023,43(S01):19-25. 被引量：3
3董恩增,闫胜旭,佟吉钢.基于主动视觉的人脸检测与跟踪算法研究[J].系统仿真学报,2015,27(5):973-979. 被引量：7
4刘红芬,刘晓峰,张雪英,黄丽霞,王子中.改进的AdaBoost.M2-SVM在低信噪比语音识别中的应用[J].微电子学与计算机,2015,32(2):88-91. 被引量：1
5郭迅,黄玉龙,殷建华.Bender Elements在测试土样剪切波速中的应用[J].地震工程与工程振动,2000,20(2):92-96. 被引量：5
6云才,黄健,吴一民,刘浩江,张平平,李力.足底内侧岛状皮瓣转位修复足跟部皮肤缺损[J].内蒙古医学杂志,2000,32(2):85-86.
7于重重,商利利,谭励,涂序彦,杨扬.半监督学习在不平衡样本集分类中的应用研究[J].计算机应用研究,2013,30(4):1085-1089. 被引量：8
8任志博,王莉莉,付忠良,张丹普,杨燕霞.基于Ranking Loss的多标签分类集成学习算法[J].计算机应用,2013,33(A01):40-42. 被引量：1
9黄铃,李学明.基于AdaBoost的微博垃圾评论识别方法[J].计算机应用,2013,33(12):3563-3566. 被引量：6
10侯保卫,杨国胜.人脸检测与识别[J].中央民族大学学报（自然科学版）,2013,22(4):57-62. 被引量：3

同被引文献15

1Hasan M A,Chaoji V, Salem S,et al.Link prediction using supervised learning[C]//In Proe.of SDM 06 workshop on Link Analysis, Cotmterterrorism and Security.Bethesda,MD, USA:lEEE Press,2006: 189-196.
2Taskar B,Wong M,Abbeel P, et al.Link prediction in relational data[C]//In Advances in Neural Information Processing Systems.Lake Tahoe, Nevada,USA:IEEE Press,2013:235-242.
3Clauset A,Moor~ C,Newman M.Struetural inference of hierarchies in networks [C]//23rd International Conference on Machine Learning.Pittsburgh, Pennsylvania,USA:IEEE Press,2006:332-339.
4Vatzquez A.Growing network with local rules:Preferential attachment, clustering hierarchy, and degree correlations[J].Physical Review E,2013,67(5):56-104.
5Palla G, Der6nyi I,Farkas I,et al.Uncovering the overlapping community structure of complex networks in nature and society[J].Nature,2012, 43(7):814-818.
6Xie Y B,Zhou T, Wang B H.Scale-free networks without growth[J]. Physica A:Statistical Mechanics and its Applications,2008,387(7): 1 683-1 688.
7Dhillon l, Savas B,Zhang Y.Social network analysis:Fast and memory -efficient low-rank approximation of massive graphs[C].Householder Symposium XVIII on Numerical Linear Algebra,Lodge,Tahoe City, CA, IEEE,2011:55-63.
8Rubin D B.A Calibrated naultielass extension of AdaI3oost[J].Statistical Applications in Genetics and Molecular Biology,2011,10( 1 ): 1-24.
9P Sen, G M Namata,M Bilgic,et al.Collective classification in network data[J].AI Magazine,2008,29(3 ):93-106.
10徐丹蕾,杜兰,刘宏伟,洪灵,李彦兵.一种基于变分相关向量机的特征选择和分类结合方法[J].自动化学报,2011,37(8):932-943. 被引量：6

引证文献2

1袁芳芳,肖晓.一种改进的代价敏感型链路预测算法[J].辽宁工程技术大学学报（自然科学版）,2015,34(11):1285-1291.
2张红斌,邱蝶蝶,邬任重,蒋子良,武晋鹏,姬东鸿.基于分层基因优选多特征融合的图像材质属性标注[J].自动化学报,2020,46(10):2191-2213. 被引量：1

二级引证文献1

1吴昌隆,卢进,柳建鑫.基于FPGA的材质识别同步采集设计实现方法[J].数字技术与应用,2022,40(11):204-208.

1秦丽,李兵.一种基于云模型的不确定性数据的建模与分类方法[J].计算机科学,2014,41(8):233-240. 被引量：7
2齐志泉,宋野,王来生.基于在线学习的目标跟踪方法研究[J].计算机应用研究,2010,27(2):770-771. 被引量：4
3王亚文,陈鸿昶,李邵梅,高超.融合遮挡感知的在线Boosting跟踪算法[J].通信学报,2016,37(9):92-101. 被引量：1
4陈明,郭西进,许允之.基于柯西分布的LS-SVM电机故障诊断[J].煤矿机械,2010,31(10):238-241. 被引量：2
5吴德会,杨世元,苏海涛.基于支持向量机的传感器动态补偿新方法[J].化工自动化及仪表,2005,32(5):61-63.
6霍红文,封举富.基于多类在线Boosting的图像识别算法[J].计算机辅助设计与图形学学报,2011,23(7):1194-1199. 被引量：4
7孙来兵,陈健美,宋余庆,杨刚.改进的基于在线Boosting的目标跟踪方法[J].计算机应用,2013,33(2):495-498. 被引量：6
8王俊超,张东波,秦海,颜霜.尺度自适应在线鲁棒目标跟踪[J].计算机应用研究,2016,33(4):1245-1248. 被引量：2
9钱胜,吕萍,吴及.改进的基于增量估计的快速高斯计算[J].清华大学学报（自然科学版）,2009(S1):1258-1261.
10杨新武,马壮,袁顺.基于弱分类器调整的多分类Adaboost算法[J].电子与信息学报,2016,38(2):373-380. 被引量：28

自动化学报

2014年第4期

浏览历史

内容加载中请稍等...

基于指数损失和0-1损失的在线Boosting算法被引量：2

参考文献25

二级参考文献19

共引文献287

同被引文献15

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于指数损失和0-1损失的在线Boosting算法 被引量：2

参考文献25

二级参考文献19

共引文献287

同被引文献15

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于指数损失和0-1损失的在线Boosting算法被引量：2