摘要
在解决类别不平衡问题的过程中,传统分类模型往往偏向对大类别样本的学习,影响模型分类效果。基于此,文章从数据采样、模型选择2方面入手,给出代价敏感神经网络集成(cost-sensitive neural network ensemble,CSNN_Ensemble)模型。首先通过随机下采样,得到多组训练数据集;其次对每组训练数据集训练BP神经网络,并结合代价矩阵构造多个代价敏感神经网络;最后以代价敏感神经网络为基学习器构造并行集成模型,并以投票的方式进行最终决策。实验结果表明,该模型在F 1值、AUC值和期望总体代价3种性能方面表现优越,并具有一定的鲁棒性。
In the process of solving the problem of class imbalance,the traditional classification model tends to prefer the learning of large class samples,which affects the classification effect of the model.Based on this,from the aspects of data sampling and model selection,a cost-sensitive neural network ensemble(CSNN_Ensemble)model is proposed.Firstly,several training data sets are obtained by random under-sampling method.Secondly,back propagation(BP)neural networks are trained separately for each training data set,and several cost-sensitive neural networks are constructed by considering the cost matrix.Finally,the cost-sensitive neural networks are used to construct the parallel ensemble model,and the final decision of the model is made by voting.The results of the experiment show that the model has excellent performance in F 1 value,AUC value and expected total cost,and has good robustness.
作者
张俊杰
曹丽
ZHANG Junjie;CAO Li(School of Mathematics,Hefei University of Technology,Hefei 230601,China)
出处
《合肥工业大学学报(自然科学版)》
CAS
北大核心
2023年第11期1573-1579,共7页
Journal of Hefei University of Technology:Natural Science
基金
国家自然科学基金资助项目(41972304)。