摘要
文章以互联网金融机构客户信贷数据为基础,使用7种不同的数据挖掘方法建立个人信用评估模型,从预测准确率、模型外推性、第二类错误率、预期错误分类成本4个方面评价模型的综合信用评估能力。评价结果表明,使用分类树和K近邻分类算法建立的个人信用评估模型的综合信用评估能力最高;同时发现使用线性和非线性方法建立的模型各有特点,线性分类模型能够对违约客户进行有效识别,而非线性分类模型的预测精度较高。
Based on the customer credit data of internet financial institutions,this paper uses seven different data mining methods to establish personal credit evaluation model,and evaluates the comprehensive credit evaluation ability of the model from four aspects:prediction accuracy,model extrapolation,the rate of typeⅡerror,expected cost of error classification.The evaluation results show that the comprehensive credit evaluation ability of the personal credit evaluation model based on classification tree and K-nearest neighbor classification algorithm is the highest.At the same time,it is found that the models based on linear and non-linear methods have their own characteristics.Linear classification model can effectively identify defaulted customers,while nonlinear classification model has higher prediction accuracy.
作者
李坤
Li Kun(School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan 430073, China)
出处
《江苏科技信息》
2018年第32期40-43,共4页
Jiangsu Science and Technology Information
关键词
信用评估模型
预测精度
预期错误分类成本
综合信用评估能力
credit evaluation model
prediction accuracy
expected cost of error classification
comprehensive creditevaluation ability