期刊文献+

针对不平衡数据集的客户流失预测算法 被引量:5

The Unbalance Dataset Analysis Algorithm in Customer Churn Prediction
下载PDF
导出
摘要 针对客户关系管理中的客户流失预测问题进行探讨,通过对客户流失数据特点的分析,以及现有预测算法的比较,将数据挖掘方法中的随机森林算法引入客户流失预测,建立预测模型,并在实际的银行业贷款客户数据集上进行实验,得到了较好的效果。 This paper focuses on the customer churn prediction in the field of customer relationship management. Based on the characteristics of customer churn data and the comparison of the current prediction algorithms, we introduce random forests algorithm, a new data mining method, into the customer churn prediction and build a prediction model. Applied to a credit debt customer database of a commercial bank, the model is proved to be effective in classifying the churn customers from the loan data.
出处 《系统工程》 CSCD 北大核心 2008年第11期99-104,共6页 Systems Engineering
基金 国家自然科学基金资且项目(70671059)
关键词 客户流失 数据挖掘 客户关系管理 预测 Customer Churn Data Mining Customer Relationship Management Prediction
  • 相关文献

参考文献12

  • 1Chandar M, Lahal A, Krishna P. Modeling churn behavior of bank customers using predictive data mining techniques[A]. National Conference on Soft Computing Techniques for Engineering Applications (SCT-2006)[C]. 2006.
  • 2Breiman L. Random forests[J].Machine Learning, 2001,45(1) :5-32.
  • 3Breiman L. Manual on setting up, using, and understanding random forests v4 [Z]. http//: PPoz. Berkeley. eduPusersPbreimanPUsing-random-forestsV4.0. pdf.
  • 4Berry M J A,Linoff G S.数据挖掘技术——市场营销、销售与客户关系管理领域应用[M].别蓉芳,尹静,邓六爱译.北京:机械工业出版社,2006.
  • 5Larivie’re B, Van den Poel D. Predicting customer retention and profitability by using random forests and regression forests techniques[J]. Expert Systems with Applications, 2005,29 : 472-484.
  • 6张华伟,王明文,甘丽新.基于随机森林的文本分类模型研究[J].山东大学学报(理学版),2006,41(3):5-9. 被引量:58
  • 7Duda R,Hart P E,Stock D G.模式分类(第二版)[M].李宏东,姚天翔译.北京:机械工业出版社,2003.
  • 8胡庆林,叶念渝,朱明富.数据挖掘中聚类算法的综述[J].计算机与数字工程,2007,35(2):17-20. 被引量:36
  • 9Chen C, Liaw A, Breiman L. Using random forest to learn imbalaneed data[Z]. 2004.
  • 10应维云,覃正,赵宇,李兵,李秀.SVM方法及其在客户流失预测中的应用研究[J].系统工程理论与实践,2007,27(7):105-110. 被引量:30

二级参考文献31

  • 1钱锋,徐麟文.知识发现中的聚类分析及其应用[J].杭州师范大学学报(自然科学版),2001,5(1):34-37. 被引量:16
  • 2许建华,张学工,李衍达.支持向量机的新发展[J].控制与决策,2004,19(5):481-484. 被引量:132
  • 3TomMMitchell.机器学习[M].北京:机械工业出版社,2003..
  • 4Margaret H.Dunham.数据挖掘教程[M].北京:清华大学出版社,2005.
  • 5T Joachims. Text categorization with support vector machines:learning with many relevant features[A]. In: The 10th European Conference on Machine Learning[C]. New York: Springer, 1998. 137-142.
  • 6E Wiener. A neural network approach to topic spotting[A].The 4th Annual Symposium on Document Analysis and Information Retrieval[C]. Las Vegas: ACM Press,1995. 317-332.
  • 7Quanlan J R. C4.5: Programs for machine learning[M]. San Francisco: Morgan Kaufmann, 1993.
  • 8Breiman L. Random forests[J]. Machine Learning, 2001,45(1):5-32.
  • 9Breiman L. Manual on setting up, using, and understanding random forests v4.0 [EB/OL]. http://oz. Berkeley. edu/users/breiman/Using-random-forests-V4.0. pdf.
  • 10Remlinger K. Introduction and application of random forest on high though put screening data from drug discovery[EB/OL].http://www4.ncsu. edu/ksremlin.

共引文献119

同被引文献151

引证文献5

二级引证文献63

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部