结合类别信念的AdaBoost算法(英文)

AdaBoost Algorithm with Classification Belief

下载PDF

导出

摘要集成学习是一种受到广泛认可和使用的机器学习算法.为此提出一种新的多类集成学习算法,即AdaBoost belief.此算法改进多类集成学习算法AdaBoost·SAMME,使每个基分类器对于每个类别都有权重信息.这种类别上的权重被称为类别信念,可通过计算每次迭代中各个类别的正确率得到.将所提出的算法与原有的AdaBoost·SAMME算法从预测准确率、泛化能力以及理论支持等方面进行比较发现:在高斯数据集、多种UCI数据集以及基于日志的多类别入侵检测应用中,该算法不但具有更高的预测准确率和泛化能力,而且当类别数目增加,即类别更难以预测时,其分类错误率较原有AdaBoost·SAMME算法上升得更缓慢. Ensemble learning is widely accepted and used in machine learning. This paper proposes a multi-class ensemble learning algorithm named AdaBoost belief. The algorithm improves AdaBoost.SAMME by attaching weights to classes in every weak classifier. These weights, called class beliefs, are computed based on class accuracy collected in each round of the iteration. We compare the algorithm with AdaBoost.SAMME in many aspects in- cluding learning accuracy, generalization ability, and theory support. Experimental results indicate that the proposed method has a competitive learning ability and high prediction accuracy in Gaussian sets, several UCI sets, anda number of log-based intrusion detec- tion applications. When the class number increases so that prediction of classes becomes more difficult, the prediction error rate of the proposed algorithm increases slower than AdaBoost.SAMME.

作者严超吴悦岳晓冬

机构地区上海大学计算机工程与科学学院

出处《应用科学学报》 CAS CSCD 北大核心 2015年第2期203-214,共12页 Journal of Applied Sciences

基金 Supported by the National Science Foundation of China(No.61103067)

关键词集成学习多类别类别信念类别权重 AdaBoost·SAMME ensemble learning, multi-class, class belief, class weight, AdaBoost.SAMME

分类号 TN911.73 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献20

1Ensemble learningscholarpediahttp://www.scholarpedia.org/article/Ensemble_learning.
2WANG X, MATWIN S, JAPKOWICZ N, LIU X. Cost-sensitive boosting algorithms for imbalanced multi-instance datasets [J]. Advances in Artificial Intelligence, 2013, 7884: 174-186.
3SUN Y, KAMEL MS, WONG AK, WANG Y. Cost-sensitive boosting for classification of imbalanced data [J]. Pattern Recognition, 2007, 40: 3358-3378.
4YUAN B, MA XL. Sampling + reweighting: boosting the performance of AdaBoost on imbalanced datasets [C]// Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012: 2680-2685.
5SCHAPIRE R, SINGERY. (1999). Improved boosting algorithmsusing confidence-rated prediction. Machine Learning, 1999: 37297-336.
6SCHAPIRE R. Using output codes to boost multiclasslearning problems [C]//Proceedings of the Fourteenth International Conference on Machine Learning. Morgan Kauffman,1997.
7FRIEDMAN J, HASTIE T, TIBSHIRANI R. Additive logistic regression: a statistical view of boosting [J]. Annals of Statistics, 2000, 28: 337-407.
8ROSSET S, ZHU Ji, ZOU Hui, HASTIE T. Multi-class AdaBoost [J]. Statistics and Its Interface, 2009, 2: 349-360.
9CHAWLA N V, LAZAREVIC A, HALL L O, BOWYERK W. SMOTE Boost: improving prediction of the minority class in boosting [C]//Proceedings of Principles of Knowledge Discovery in Databases, 2003: 107-119.
10LIU XY, WU J, ZHOU ZH. Exploratory under-sampling forclass-imbalance learning [J]. IEEE Transactions on Systems, Man andCybernetics - Part B, 2009, 39(2): 539 -550.

1李剑,牛少彰.学习机制在电子商务中的应用[J].北京邮电大学学报,2009,32(3):1-4. 被引量：7
2钱博,唐振民,李燕萍,徐利敏.基于分层采样的集成k近邻说话人识别算法[J].计算机工程与应用,2007,43(35):226-229.
3钱博,金林.基于神经网络集成的SAR图像目标识别[J].现代雷达,2010,32(4):31-34. 被引量：3
4李志亮,黄丹.基于假设间隔的弱随机特征子空间生成算法[J].绵阳师范学院学报,2012,31(11):98-110.
5孙勋,黄平平,涂尚坦,杨祥立.利用多特征融合和集成学习的极化SAR图像分类[J].雷达学报（中英文）,2016,5(6):692-700. 被引量：11
6胡海峰,杨震.无线传感器网络中基于网格的目标跟踪算法[J].南京邮电大学学报（自然科学版）,2007,27(6):1-6.
7张莉芳,王华彬.无源雷达识别数据库设计[J].雷达与电子战,2010,0(3):7-10.
8罗会兰,杜连平.一种SVM集成的图像分类方法研究[J].电视技术,2012,36(23):39-42. 被引量：6
9李汶虹,王建国.结合Bi-2DPCA和PNN集成的SAR图像目标识别[J].中国电子科学研究院学报,2014,9(4):401-407. 被引量：5
10万鑫,王英男,任哲.LDPC码译码算法及调度方案的分析[J].电气应用,2013,0(S1):284-287.

应用科学学报

2015年第2期

浏览历史

内容加载中请稍等...

结合类别信念的AdaBoost算法(英文)

参考文献20

相关作者

相关机构

相关主题

浏览历史