期刊文献+

一种基于Fisher比率和预测风险准则的电信客户流失预测分步特征选择方法 被引量:4

A two-stage feature selection method based on Fisher's ratio and prediction risk for telecom customer churn prediction
下载PDF
导出
摘要 电信客户流失预测是电信运营商客户关系管理系统的一个重要问题,其目的是预测具有较高流失风险的客户.电信客户流失预测模型的构建过程包括数据预处理、不均衡处理、特征选择和分类器的训练与评估.针对电信数据集中存在的特征维度过高问题,结合过滤式特征选择和嵌入式特征选择方法的优点,提出了一种基于Fisher比率和预测风险准则的分步特征提取方法.结合真实数据集的实验结果表明,该方法能够减少特征维度,提高分类器的预测效果. Telecom customer churn prediction is crucial to the customer relationship management systems of telecom operators. It aims to predict a particular customer who is at a high risk of churning. The predicting process includes the steps of data pre-processing, imbalance processing, feature selection, classifier training and evaluation. A two-stage feature selection method based on fisher's ratio and prediction risk was proposed, which took advantage of the filter feature selection method and wrapper feature selection method to solve the high dimensionality problem of telecom customer churn prediction. The method was evaluated on a real-world dataset, and the experimental results verify that it is able to reduce feature dimensionality and improve the performance of classifiers.
出处 《中国科学技术大学学报》 CAS CSCD 北大核心 2017年第8期686-694,共9页 JUSTC
基金 Supported by the National Natural Science Foundation of China(61375079)
关键词 大数据 流失预测 分步特征选择 SPARK big data churn prediction two stage feature selection Spark
  • 相关文献

同被引文献32

引证文献4

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部