摘要
信用卡业务是银行的核心业务,各大商业银行通过发行信用卡来抢占市场和发展客户。虽然信用卡业务给银行带来了高额利润,但信用卡的粗放式管理导致信用卡客户存在较高的违约率,给银行带来了极大的风险。因此,如何有效针对信用卡业务进行风险管理已经成为银行业的热点关注问题之一。本文采用机器学习的相关算法构建银行信用卡违约预测模型,预测信用卡用户次月的违约情况,辅助银行进行风险管理。具体地,本文通过逻辑回归、决策树、随机森林、自适应增强和梯度提升树这五类算法来构建信用卡违约预测模型并通过准确率等模型评价指标对比不同特征选择方式下五种模型的预测效果。本文使用某银行信用卡持卡人的相关数据进行实验,实验结果表明,相比于算法选择,不同的特征选择方式对于模型性能有更大的影响,其中,过滤式特征选择的适应性更强。
Credit card business is the core business of Banks. Commercial Banks seize the market and develop customers by issuing credit cards. Although credit card business brings high profits to banks, extensive credit card management leads to high default rate of credit card customers, which brings great risks to banks. Therefore, how to effectively manage the risk of credit card business has become one of the hot issues in the banking industry. This paper uses machine learning related algorithms to construct a bank credit card default prediction model, predicts credit card users’ defaults in the next month, and assists banks in risk management. Specifically, this paper con-structs credit card default prediction models through logistic regression, decision tree, random forest, adaboost and gradient boosting decision tree, and compares the prediction effects of five models under different feature selection methods through evaluation indexes such as accuracy. In this paper, relevant data of credit card holders of a bank are used for experiments. The experi-mental results show that different feature selection methods have a greater impact on model per-formance than algorithm selection. Among them, the filter feature selection is more adaptable.
出处
《数据挖掘》
2019年第4期145-152,共8页
Hans Journal of Data Mining