基于二重LOF与逆交叉验证的稳健AdaBoost回归模型

Robust AdaBoost Regression Model Based on Double LOF and Inverse-Cross-Validation

下载PDF

导出

摘要【目的】传统AdaBoost回归模型的稳健性不足,改进的AdaBoost.RT+、AdaBoost.RS算法仍然存在对异常数据抑制效果不显著和识别正确率较低等问题,增强AdaBoost方法的稳健性具有重要的实际应用价值。【方法】给出的AdaBoost.R_LOF模型,首先提出二重LOF和逆交叉验证算法,并将两种方法结合,以概率刻画数据的异常程度。然后在AdaBoost.R2算法的基础上,根据数据的异常程度,对数据设置恰当的权重系数,在不影响正常数据迭代的同时抑制异常数据的影响。【结果】使得新模型具有更好的稳健性,并且得到更小的预测均方误差。【局限】该方法需要调节的超参数有所增加,需要根据数据集分布特征进行调整。【结论】模拟和真实案例结果显示,相比于AdaBoost.R2、AdaBoost.RT+和AdaBoost.RS算法,在不同比例异常值的数据集下,该方法都具有更好的稳健性和估计效果。 [Objective]The robustness of the traditional AdaBoost regression model is insufficient.The improved AdaBoost.RT+and AdaBoost.RS algorithms hold insignificant suppression on abnormal data and low identification accuracy of abnormal data.It is meaningful to enhance the robustness of AdaBoost algorithms.[Methods]First,dual LOF and inverse cross validation algorithms are proposed,the abnormal degree of data is characterized by probability based on these two algorithms.Then,appropriate weight coefficients are given according to the abnormal degree of the data to suppress its influence and keep no effect on the normal data.[Results]This AdaBoost.R_LOF model holds better robustness and less mean squared error on prediction.[Limitations]However,more hyperparameters are needed.[Conclusions]Simulations and real applications show that the new model has better robustness and estimation under the different proportions of outliers compared with AdaBoost.R2,AdaBoost.RT+and AdaBoost.RS algorithms.

作者曾凡倍杨联强 ZENG Fanbei;YANG Lianqiang(School of Big Data and Statistics,Anhui University,Hefei,Anhui 230601,China;School of Artificial Intelligence,Anhui University,Hefei,Anhui 230601,China)

机构地区安徽大学安徽大学

出处《数据与计算发展前沿（中英文）》 CSCD 2024年第5期126-138,共13页 Frontiers of Data & Computing

基金安徽高校自然科学基金(KJ2021A0049) 安徽省自然科学基金(2208085MA06)。

关键词 ADABOOST算法二重LOF算法逆交叉验证 AdaBoost.R_LOF算法 oAdaBoost double LOF Inverse-Cross-Validation AdaBoost.R_LOF

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1Fanyue Qian,Yingjun Ruan,Huiming Lu,Hua Meng,Tingting Xu.Enhancing source domain availability through data and feature transfer learning for building power load forecasting[J].Building Simulation,2024,17(4):625-638.
2贺锋涛,王乐莹,王晓波,杨祎,李碧丽.基于改进的AdaBoost无线光通信信号检测算法[J].激光技术,2023,47(5):659-665. 被引量：1
3王振潜,吴勇,任伟,龙淑芬.基于5G通信的城市智慧交通拥堵预测分析[J].移动信息,2024,46(5):30-32.
4李轩涛,张敬剑,孙洁,郏琴.基于Adaboost的阻塞性睡眠呼吸暂停诊断预测模型研究[J].建模与仿真,2024,13(5):5033-5043.
5王兴隆,尹昊,贺敏.基于LSTM的机场飞行区活动目标潜在冲突预测[J].北京航空航天大学学报,2024,50(6):1850-1860. 被引量：1
6方博扬,赵国彦,马举,陈立强,简筝.Adaboost集成学习优化的巷道围岩松动圈预测研究[J].黄金科学技术,2023,31(3):497-506. 被引量：2
7陈立潮,王冠男.基于面部多特征的驾驶员疲劳状态检测[J].计算机与数字工程,2023,51(3):721-726. 被引量：1
8闫秀英,肖桂波,王鑫洋,吉星星.基于ISSA-LSTM的热舒适短期预测模型[J].计算机测量与控制,2024,32(5):230-237.
9赵海涛,李红烨.基于改进孤立森林算法的Linux日志异常检测方法[J].指挥控制与仿真,2024,46(5):114-118.
10李守俊,李江,严佳杰,金波,徐哲.基于LOF模型的输水管网异常数据检测及校正[J].杭州电子科技大学学报（自然科学版）,2024,44(4):51-59.

数据与计算发展前沿（中英文）

2024年第5期

浏览历史

内容加载中请稍等...

基于二重LOF与逆交叉验证的稳健AdaBoost回归模型

相关作者

相关机构

相关主题

浏览历史