基于准确性和多样性的在线动态选择集成建模方法

Ensemble Modeling Method of Online Dynamic Selection Based on Accuracy and Diversity

下载PDF

导出

摘要为了解决复杂工业过程中的概念漂移问题,提高集成学习模型的泛化性能,在保证集成学习模型精度的基础上,提出了一种用于优化多样性的基学习器在线动态选择集成建模方法.该方法以在线极限学习机作为基学习器,按照基学习器在滑动窗口上的分类精度对其进行逆序排序,将基学习器在滑动窗口上的其他性能指标作为特征属性,依次利用近似线性依靠条件挑选出准确且多样的基学习器用于集成输出,提高了集成学习模型在处理概念漂移数据流时的分类精度.最后,使用合成数据集和公开数据集验证了所提算法的合理性与有效性. To solve the problem of concept drift in complex industrial process and to improve the generalization performance of ensemble learning model,an ensemble modeling method of online dynamic selection for optimizing the diversity of the base learners was proposed,on the basis of ensuring the accuracy of the ensemble learning model.Online sequential extreme learning machine was used as the base learner,and the base learners were sorted in reverse order according to their classification accuracy on the sliding window.The other performance indexes of the basic learners on the sliding window were used as the feature attributes,and the approximate linear dependence condition was used to select accurate and diverse base learners for ensemble output,which improves the classification accuracy of the ensemble algorithm in dealing with the concept drift data stream.Finally,the rationality and effectiveness of the proposed algorithm were verified by using the synthetic data sets and real-world data sets.

作者陈双叶赵荣符寒光高建琛 CHEN Shuangye;ZHAO Rong;FU Hanguang;GAO Jianchen(Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China;College of Materials Science and Engineering,Beijing University of Technology,Beijing 100124,China)

机构地区北京工业大学信息学部北京工业大学材料科学与工程学院

出处《北京工业大学学报》 CAS CSCD 北大核心 2021年第11期1211-1218,共8页 Journal of Beijing University of Technology

基金国家重点研发计划资助项目(2017YFB0306404)。

关键词概念漂移集成学习近似线性依靠在线极限学习机准确性多样性 concept drift ensemble learning approximate linear dependence online sequential extreme learning machine accuracy diversity

分类号 U461 [机械工程—车辆工程] TP308 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献4

1高慧云,陆慧娟,严珂,叶敏超.基于差异性和准确性的加权调和平均度量的基因表达数据选择性集成算法[J].计算机应用,2018,38(5):1512-1516. 被引量：7
2张春霞,张讲社.选择性集成学习算法综述[J].计算机学报,2011,34(8):1399-1410. 被引量：136
3傅强,胡上序,赵胜颖.Clustering-based selective neural network ensemble[J].Journal of Zhejiang University-Science A(Applied Physics & Engineering),2005,6(5):387-392. 被引量：2
4汤健,柴天佑,刘卓,余文,周晓杰.基于更新样本智能识别算法的自适应集成建模[J].自动化学报,2016,42(7):1040-1052. 被引量：17

二级参考文献112

1王丽丽,苏德富.基于群体智能的选择性决策树分类器集成[J].计算机技术与发展,2006,16(12):55-57. 被引量：3
2Thompson S. Pruning boosted classifiers with a real valued genetic algorithm. Knowledge-Based Systems, 1999, 12(5-6): 277-284.
3Zhou Z H, Tang W. Selective ensemble of decision trees// Proceedings of the 9th International Conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. Chongqing, China, 2003:476-483.
4Hernandez-Lobato D, Hernandez-Lobato J M, Ruiz-Torrubiano R, Valle A. Pruning adaptive boosting ensembles by means of a genetic algorithm//Corchado et al. International Conference on Intelligent Data Engineering and Automated Learning. Berlin Heidelberg: Springer-Verlag, 2006: 322- 329.
5Zhang Y, Burer S, Street W N. Ensemble pruning via semidefinite programming. Journal of Machine Learning Research, 2006, 7: 1315-1338.
6Chen H H, Tino P, Yao X. Predictive ensemble pruning by expectation propagation. IEEE Transactions on Knowledge and Data Engineering, 2009, 21(7): 999-1013.
7Dos Santos E M, Sahourin R, Maupin P. Overfitting cautious selection of classifier ensembles with genetic algorithms. Information Fusion, 2009, 10(2): 150-162.
8Li N, Zhou Z H. Selective ensemble under regularization framework//Benediksson J A, Kittler J, Roll F. Multiple Classifier Systems. Berlin Heidelberg: Springer-Verlag, 2009:293-303.
9Reid S, Grudic G. Regularized linear models in stacked generalization//Benediksson J A, Kittler J, Roli F. Multiple Classifier Systems. Berlin Heidelberg: Springer-Verlag, 2009:112-121.
10Zhang L, Zhou W D. Sparse ensembles using weighted combination methods based on linear programming. Pattern Recognition, 2011, 44(1): 97-106.

共引文献156

1王茂光,冀昊悦,王天明.一种基于层次聚类和模拟退火的选择性集成算法的风控模型研究[J].计算机科学,2022,49(S02):201-207. 被引量：1
2崔宇,侯慧娟,苏磊,钱涛,盛戈皞,江秀臣.考虑不平衡案例样本的电力变压器故障诊断方法[J].高电压技术,2020,46(1):33-41. 被引量：28
3郭亚琴,秦燕.改进的模糊聚类在分类器设计中的应用[J].软件导刊,2012,11(3):32-33.
4侯勇,郑雪峰.集成学习算法的研究与应用[J].计算机工程与应用,2012,48(34):17-22. 被引量：8
5邱诚,王大海,任伟家,邹权.基于集成学习的音乐识别方法研究[J].计算机科学,2012,39(12):184-187. 被引量：4
6陈康,向勇,喻超.大数据时代机器学习的新趋势[J].电信科学,2012,28(12):88-95. 被引量：37
7陆慧娟,安春霖,马小平,郑恩辉,杨小兵.基于输出不一致测度的极限学习机集成的基因表达数据分类[J].计算机学报,2013,36(2):341-348. 被引量：41
8汤健,柴天佑,余文,赵立杰.在线KPLS建模方法及在磨机负荷参数集成建模中的应用[J].自动化学报,2013,39(5):471-486. 被引量：21
9周涛,陆惠玲,陈志强,马苗.基于两阶段集成支持向量机的前列腺肿瘤识别[J].光学精密工程,2013,21(8):2137-2145. 被引量：5
10张东波,黄坤鑫.一种适用于多类问题的神经网络集成模型[J].信息与控制,2013,42(5):583-588. 被引量：1

1王瀛,刘哲甫,肖威.相关度排序的知识库检索排序方法研究[J].管理观察,2018(7):72-73.
2韩嵩,李晓俊.融合互联网文本大数据的上市企业信用评价[J].统计学报,2021,2(5):72-81. 被引量：3
3杨国栋.煤矿用阀门智能电动执行器的设计[J].煤矿机械,2021,42(9):10-13. 被引量：4
4常玉清,孙雪婷,钟林生,王福利,刘英娇.基于改进随机森林算法的工业过程运行状态评价[J].自动化学报,2021,47(9):2214-2225. 被引量：13
5汤健,夏恒,乔俊飞,郭子豪.深度集成森林回归建模方法及应用[J].北京工业大学学报,2021,47(11):1219-1229. 被引量：11

北京工业大学学报

2021年第11期

浏览历史

内容加载中请稍等...

基于准确性和多样性的在线动态选择集成建模方法

参考文献4

二级参考文献112

共引文献156

相关作者

相关机构

相关主题

浏览历史