基于机器学习的银行信用卡违约预测研究

Research on Bank Credit Card Default Prediction Based on Machine Learning

下载PDF

导出

摘要信用卡业务是银行的核心业务,各大商业银行通过发行信用卡来抢占市场和发展客户。虽然信用卡业务给银行带来了高额利润,但信用卡的粗放式管理导致信用卡客户存在较高的违约率,给银行带来了极大的风险。因此,如何有效针对信用卡业务进行风险管理已经成为银行业的热点关注问题之一。本文采用机器学习的相关算法构建银行信用卡违约预测模型,预测信用卡用户次月的违约情况,辅助银行进行风险管理。具体地,本文通过逻辑回归、决策树、随机森林、自适应增强和梯度提升树这五类算法来构建信用卡违约预测模型并通过准确率等模型评价指标对比不同特征选择方式下五种模型的预测效果。本文使用某银行信用卡持卡人的相关数据进行实验,实验结果表明,相比于算法选择,不同的特征选择方式对于模型性能有更大的影响,其中,过滤式特征选择的适应性更强。 Credit card business is the core business of Banks. Commercial Banks seize the market and develop customers by issuing credit cards. Although credit card business brings high profits to banks, extensive credit card management leads to high default rate of credit card customers, which brings great risks to banks. Therefore, how to effectively manage the risk of credit card business has become one of the hot issues in the banking industry. This paper uses machine learning related algorithms to construct a bank credit card default prediction model, predicts credit card users’ defaults in the next month, and assists banks in risk management. Specifically, this paper con-structs credit card default prediction models through logistic regression, decision tree, random forest, adaboost and gradient boosting decision tree, and compares the prediction effects of five models under different feature selection methods through evaluation indexes such as accuracy. In this paper, relevant data of credit card holders of a bank are used for experiments. The experi-mental results show that different feature selection methods have a greater impact on model per-formance than algorithm selection. Among them, the filter feature selection is more adaptable.

作者单华玮

机构地区对外经济贸易大学

出处《数据挖掘》 2019年第4期145-152,共8页 Hans Journal of Data Mining

关键词机器学习信用卡违约特征选择

分类号 F83 [经济管理—金融学]

引文网络
相关文献

参考文献2

1方匡南,章贵军,张惠颖.基于Lasso-logistic模型的个人信用风险预警方法[J].数量经济技术经济研究,2014,31(2):125-136. 被引量：110
2刘铭,张双全,何禹德.基于改进型模糊神经网络的信用卡客户违约预测[J].模糊系统与数学,2017,31(1):143-148. 被引量：6

二级参考文献23

1李志辉,李萌.我国商业银行信用风险识别模型及其实证研究[J].经济科学,2005(5):61-71. 被引量：33
2Efron lB. , Hastie T. , Johnstone I. , Tibshirani R. , 2004, Least Angle Regression [J]. Annals of Statistics, 2 (32), 407-499.
3Breiman L. , 1995, Better Subset Regression Using the Nonnegative Garrote [J]. Technometrics, 4(37), 373-384.
4Carow K. A. , Staten M. E. , 1999, Debit, Credit, orCash : Survey Evidence on Gasoline Purchases [J7, Journal of Economics and Business, 5 (51), 409-421.
5Lee T. H. , Jung S. C. , 1999, Forecasting Credit Worthiness: Logistic vs. Artificial Neural Net [J]. The Journal of Business Forecasting Methods - Systems, 4 (18), 28-30.
6Schreiner M. , 2004, Scoring Arrears at a Aicrolender in Bolivia [J]. Journal of Microfinance, 2 (6), 65-88.
7Tibshirani R. , 1996, Regression Shrinkage and Selection via the Lasso [J]. Journal of the Royal Statistical Society, Series B, 1 (58), 267--288.
8West D. , 2000, Neural Network Credit Scoring Models [J]. Computers - Operational Research, 11 (27), 1131-1152.
9Wiginton J. C. , 1980, A Note on the Comparison of Logit and Discriminant Models of Consumer Credit Behavior [J]. Journal of Financial and Quantitative Analysis, 3 (15), 757-770.
10杨显爵,林左裕,陈震远,陈震武,陈宗豪.小额信贷之违约概率模型：特别考虑异质性[J].浙江大学学报（人文社会科学版）,2008,38(2):179-188. 被引量：4

共引文献113

1范新妍,方匡南,郑陈璐,张志远.基于整合治愈率模型的信贷违约时点预测[J].统计研究,2021(2):99-113. 被引量：1
2佟孟华,邢秉昆,赵作伦,杨思涵.基于FM模型的工业企业碳减排信用风险预警研究[J].数量经济技术经济研究,2021,38(2):147-165. 被引量：8
3王茂光,冀昊悦,王天明.一种基于层次聚类和模拟退火的选择性集成算法的风控模型研究[J].计算机科学,2022,49(S02):201-207. 被引量：1
4欧阳梦倩,周先波,朱君梅.大数据时代下使用互联网搜索量预测CPI——基于LASSO和核偏最小二乘的联合使用[J].金融学季刊,2020(2):112-136.
5张开元.商业银行信用卡信用风险管理研究[J].广东经济,2017,0(6X):94-94.
6方匡南,赵梦峦.基于多源数据融合的个人信用评分研究[J].统计研究,2018,35(12):92-101. 被引量：17
7孙庆文,郭伟伟,魏伟.赊销决策推出的关键因素分析[J].合作经济与科技,2015,0(4):82-83.
8胡小宁,何晓群.基于Group Lasso的个人信用评价分析[J].数学的实践与认识,2015,45(6):89-94. 被引量：10
9胡小宁,何晓群,马学俊.基于Group MCP Logistic模型的个人信用评价分析[J].现代管理科学,2015,3(8):18-20. 被引量：3
10倪新洁,梁彪,倪佩可.结合LASSO算法与logistic回归模型的P2P信贷审批结果研究[J].统计与管理,2015,0(8):44-47. 被引量：3

1何海丽.银行信用卡安全保障服务满意度研究[J].现代营销（下）,2019(11):42-43.
2整治高仿APP乱象[J].中国报业,2019,0(23):108-109.
3全国社保卡发行地图[J].中国社会保障,2019,0(12):34-35.
4陈霄敏,陈君毓.当红ETC，布局未来经济[J].现代商业银行,2019,0(9):37-40.
5梅鸣.从“只能”到“都可以”[J].中国社会保障,2019,0(12):33-33.
6赵安琪.如何让促销活动精准化[J].中国药店,2019,0(12):72-73.
7行业资讯[J].中国信用卡,2019,0(11):92-95.
8严凌.个人银行结算账户粗放式管理下潜在的洗钱风险及对策研究[J].现代经济信息,2019,0(23):264-264. 被引量：1
9曲新宇,高榕.大学生短视频使用过程中存在的问题及对策研究[J].法制与社会（旬刊）,2019,0(27):149-150. 被引量：5
10王睽.基于GBDT的计量装置故障识别模型分析[J].科学大众（科技创新）,2019,0(12):150-151.

数据挖掘

2019年第4期

浏览历史

内容加载中请稍等...

基于机器学习的银行信用卡违约预测研究

参考文献2

二级参考文献23

共引文献113

相关作者

相关机构

相关主题

浏览历史