期刊文献+

基于样本依赖代价矩阵的小微企业信用评估方法 被引量:14

Credit Scoring of Small and Micro Enterprises Based on Sample-Dependent Cost Matrix
下载PDF
导出
摘要 针对小微企业信用历史数据规模较小,而且类别不平衡问题较为严重,提出基于样本依赖代价矩阵的Smote XGboost-Bayes Minimum Risk(SXG-BMR)模型,对整体样本进行低倍率过采样,以弱化类别不平衡问题,降低模型过拟合的风险;模型将集成学习模型与最小风险贝叶斯决策相结合,以实现代价敏感。同时,模型中引入了样本依赖的代价矩阵,该代价矩阵不仅与类别有关,而且与样本自身属性有关,可以更为准确地表征代价。使用标准信用数据集和上海市小微企业信用数据集,进行多种算法的对比分析,结果表明,该模型性能优良。 Because the credit history data of small and micro enterprises are small and the problem of class imbalance is more serious,this paper proposes a Smote XGboost-Bayes Minimum Risk(SXG-BMR)model based on the sample-dependent cost matrix. The whole sample is oversampled at a low rate to weaken the problem of class imbalance and reduce the risk of model overfitting. The model combines the integrated learning model with the minimum risk Bayes decision to realize the cost sensitivity. At the same time,this paper introduces the sample-dependent cost matrix into the model. The cost matrix is related not only to the category,but also to the attributes of the sample.Therefore,it can characterize the cost more accurately. In the empirical study,this paper uses a standard credit dataset and a real credit dataset of small and micro enterprises in Shanghai. Besides,it compares and analzes of various algorithms. The results show that the SXG-BMR model proposed in this paper has a good performance.
作者 张涛 汪御寒 李凯 张玥杰 ZHANG Tao;WANG Yuhan;LI Kai;ZHANG Yuejie(School of Information Management and Engineering,Shanghai University of Finance and Economics,Shanghai 200433,China;Shanghai Key Laboratory of Financial Information Technology,Shanghai University of Finance and Economics,Shanghai 200433,China;School of Computer Science,Fudan University,Shanghai 200433,China;Shanghai Key Laboratory of Intelligent Information Processing,Fudan University,Shanghai 200433,China)
出处 《同济大学学报(自然科学版)》 EI CAS CSCD 北大核心 2020年第1期149-158,共10页 Journal of Tongji University:Natural Science
基金 国家自然科学基金(61976057,61572140) 上海市自然科学基金(19ZR1417200) 教育部人文社会科学研究规划基金(19YJA630116)
关键词 信用评估 样本依赖 最小风险贝叶斯 XGBoost模型 代价敏感学习 credit scoring sample-dependent minimum Bayes risk XGBoost model cost sensitive learning
  • 相关文献

参考文献11

二级参考文献175

共引文献298

同被引文献185

引证文献14

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部