期刊文献+

基于局部采样样本均衡的P2P借贷违约预警模型

P2P Lending Default Warning Model Based on Local Sampling Sample Equilibrium
下载PDF
导出
摘要 随着互联网金融的不断发展,P2P网络借贷的借贷人违约风险识别引起金融机构的重点关注,且随着互联网金融整改措施的实施,借贷违约量不断减少,因此在这P2P网络借贷历史违约数据不断减少的环境下,基于不均衡数据的违约预警分析显得尤为重要。本文在BSL不均衡样本抽样算法的基础上,通过Kmeans聚类算法降低抽样时间复杂度,并使用随机森林与其他机器学习分类算法进行对比实验,同时加入借款描述与借款标题的文本分析,最终建立了基于随机森林的P2P网络借贷违约预警模型来实现对于数据不均衡的P2P借贷违约风险识别。在满足高效率、高识别率的同时,满足了增量学习的现实需求,为P2P网络借贷平台提供一定的监管指导意见。 With the continuous development of Internet finance, the identification of borrowers’ default risk of peer-to-peer (P2P) lending has attracted the attention of financial institutions, and with the imple-mentation of Internet finance rectification measures, the amount of loan defaults has been de-creasing. Therefore, under the environment of decreasing historical default data in P2P lending, default warning analysis based on unbalanced data is particularly important. In this paper, based on BSL unbalanced sample sampling algorithm, K-means clustering algorithm is used to reduce the complexity of sampling time, and random forest is used to compare with other machine learning classification algorithms. At the same time, text analysis of loan description and loan title is added. Finally, a peer-to-peer lending default warning model based on random forest is established to identify P2P loan default risk of unbalanced data. It not only meets the needs of high efficiency and high recognition rate, but also meets the practical needs of incremental learning, and provides cer-tain supervision guidance for peer-to-peer lending platform.
作者 张雪飞
出处 《金融》 2020年第5期455-464,共10页 Finance
关键词 P2P网络借贷 违约预警 随机森林 样本均衡 P2P Lending Default Warning Random Forest Sample Equilibrium
  • 相关文献

参考文献9

二级参考文献105

  • 1曹凤岐.互联网金融对传统金融的挑战[J].金融论坛,2015,20(1):3-6. 被引量:189
  • 2李建平,徐伟宣,刘京礼,石勇.消费者信用评估中支持向量机方法研究[J].系统工程,2004,22(10):35-39. 被引量:22
  • 3迟国泰,许文,孙秀峰.个人信用卡信用风险评价体系与模型研究[J].同济大学学报(自然科学版),2006,34(4):557-563. 被引量:28
  • 4王圆,孙铁利,李杨.Web文本挖掘中的特征表示和特征提取[J].电脑知识与技术,2006,1(5):67-68. 被引量:2
  • 5Fung B C M,Wang K,Ester M.Hierarchical document clustering//Wang John ed.The Encyclopedia of Data Warehousing and Mining,idea Group.2005:970-975.
  • 6Salton G.The SMART Retrieval System-Experiments in Automatic Document Processing.Englewood Cliffs,New Jersey:Prentice Hall Inc,1971.
  • 7Wang Y,Julia H.Document clustering with semantic analysis//Proceedings of the 39th Hawaii International Conferences on System Sciences.Hawaii,US,2006:54-63.
  • 8Hotho A,Staab S,Stumme G.Wordnet improves text document clustering//Proceedings of the Semantic Web Workshop at SIGIR-2003,26th Annual International ACM SIGIR Conference.Toronto,Canada,2003:541-550.
  • 9Hall P,Dowling G.Approximate string matching.Computing Survey,1980,12(4):381-402.
  • 10Coelho T,Calado P,Souza L,Ribeiro-Neto B,Muntz R.Image retrieval using multiple evidence ranking.IEEETransactions on Knowledge and Data Engineering,2004,16(4):408-417.

共引文献456

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部