期刊文献+

随机森林方法及其在客户流失预测中的应用研究 被引量:20

The Research on Random Forests and the Application in Customer Churn Prediction
原文传递
导出
摘要 在全球化的市场竞争中,企业如何利用现有资源,提高客户满意度,保住现有客户,已成为企业面临的主要问题,客户流失预测越来越受到企业关注。本文针对实际客户流失数据中正负样本数量不平衡而且数据量大的特点,提出一种改进的平衡随机森林算法,并将其应用于某商业银行的客户流失预测。实际数据集测试结果表明,与传统的预测算法比较,这种算法集成了抽样技术和代价敏感学习的优点,适合解决大数据集和不平衡数据,具有更高的精确度。 Facing the competition in the global market,enterprises are increasingly keener to explore how to hold the existing customers and improve their satisfaction by making use of the existing resources.The customer churn prediction has aroused more and more attention from enterprises.Given the unbalance and size of actual customer churn data,the paper puts forward an improved balanced-random forest algorithm and applies it to predict the customer churn of a commercial bank.The actual data set test result shows that the algorithm,on the strength of both sampling technique and cost-sensitive learning,has a higher accuracy in solving a large data set and unbalance data than the traditional prediction algorithms.
作者 应维云
出处 《管理评论》 CSSCI 北大核心 2012年第2期140-145,共6页 Management Review
基金 国家自然科学基金项目(71050003/G03 71171121)
关键词 流失预测 不平衡数据 随机森林 churn prediction imbalanced data random forests
  • 相关文献

参考文献18

  • 1Chandar,M.Lahal,A.Krishna,P.Modeling Churn Behavior of Bank Customers Using Predictive Data Mining Techniques.Na-tional Conference on Soft Computing Techniques for Engineering Applications[C].(SCT–2006),March24-26,2006.
  • 2Mozer,M.C.,Wolniewicz,R.,Grimes,D.B.,et al.Churn Reduction in the Wireless Industry[J].Advances in Neural InformationProcessing Systems,2000,(12):935-941.
  • 3Lemmens,A.,Croux,C.Bagging and Boosting Classification Trees to Predict Churn[J].Journal of Marketing Research,2006,43(2):276-286.
  • 4Chiang,D.,Wang,Y.,Lee,S.,Lin,C.Goal-Oriented Sequential Pattern for Network Banking Churn Analysis[J].Expert Systemswith Applications,2003,25(3):293-302.
  • 5Eiben,A.E.,Koudijs,A.E.,Slisser,F.Genetic Modeling of Customer Retention[C].Lecture Notes in Computer Science,178-186,1998.
  • 6Zhao,Y.,Li,B.,Li,X.,Liu,W.,Ren,S.J.Customer Churn Prediction Using Improved One-Class Support Vector Machine[C].Lecture Notes in Computer Science,2005,(3584):300-306.
  • 7赵宇,李兵,李秀,刘文煌,任守榘.基于改进支持向量机的客户流失分析研究[J].计算机集成制造系统,2007,13(1):202-207. 被引量:41
  • 8J.R.Quinlan.Decision Trees as Probabilistic Classifiers[C].Proc.4th Int.Workshop Machine Learning.Irvine,CA,1987,25:31-37.
  • 9Wai-Ho Au,Keith,C.C.Chan,Xin Yao.A Novel Evolutionary Data Mining Algorithm with Applications to Churn Prediction[J].IEEE Transactions on Evolutionary Computation,2003,20(7):532-545.
  • 10郭明,郑惠莉,卢毓伟.基于贝叶斯网络的客户流失分析[J].南京邮电学院学报(自然科学版),2005,25(5):79-83. 被引量:14

二级参考文献53

  • 1钱锋,徐麟文.知识发现中的聚类分析及其应用[J].杭州师范大学学报(自然科学版),2001,5(1):34-37. 被引量:16
  • 2许建华,张学工,李衍达.支持向量机的新发展[J].控制与决策,2004,19(5):481-484. 被引量:132
  • 3李建平,徐伟宣,刘京礼,石勇.消费者信用评估中支持向量机方法研究[J].系统工程,2004,22(10):35-39. 被引量:22
  • 4杨树莲.数据挖掘在电信行业客户流失分析中的应用[J].计算机与现代化,2005(2):109-111. 被引量:8
  • 5CHICKERING D, HECKERMAN D. Efficient Approximations for the Marginal Likelihood of Bayesian Networks with Hidden Variables [ A]. Proceeding of the 12th conference on Uncertainty in Artificial Intelligence[C]. 1996. 158 ~ 168.
  • 6HECKERMAN D. Bayesian networks for data mining[J]. Data Mining and Knowledge Discovery, 1997,1 ( 1 ) :79 ~ 119.
  • 7COOPER G, HERSKOVITS E. A Bayesian method for the induction of probabilistic networks from data[J]. Machine Learning , 1992,9(4) :309 ~347.
  • 8CHICKERING D, HECKERMAN D, GEIGER D. Learning Bayesian networks: search methods and experimental results [ A ]. proceedings of fifth conference on artificial intelligence and statistics[C]. 1995.
  • 9MADIGAN D, YORK J. Bayesian graphical models for discrete data[J]. International Statistical Review, 1995,63(2): 307 ~347.
  • 10GIUDICI P, CASTELO R. Improving markov chain monte carlo model search for data mining [ J ]. Machine Learning, 2003,50 ( 1-2) :215 ~232.

共引文献164

同被引文献203

引证文献20

二级引证文献99

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部