期刊文献+

基于特征选择和SVM的电信客户离网预测 被引量:5

Prediction on customer leaving the telecom network based on feature selection and SVM
下载PDF
导出
摘要 针对数据挖掘算法在预测电信客户离网时存在的过拟合问题,提出一种基于特征选择和支持向量机的电信客户离网预测算法。将原始的电信数据分别进行数据缺失值填充、数据冗余识别、数据结构化和数据归一化等预处理,得到利于分析处理的规范性数据;利用信息增益完成特征选择,提取影响客户离网的主要因素,降低数据维度,防止出现过拟合现象。将经过特征选择后的数据作为支持向量机算法的输入数据对客户是否离网进行分类,预测客户是否存在离网行为。测试结果表明,该算法预测离网客户的正确率为86%,提升了离网客户预测准确率。 To solve the overfitting problem on predicting the customer s leaving the telecom network for data mining algorithms, a new algorithm based on feature selection and support vector machine is proposed in this paper. Original telecommunication data are processed through data loss, data redundancy identification and data structure to obtain the normalized data. Using information gain for feature selection, the main factors affecting customer out of network are extracted to remove irrelevant or redundant features and then to reduce the data dimension and prevent overfitting. The data after feature selection is then used as the input data of the SVM algorithm to classify whether the customer is out of network, to predict whether the customer has behaviours of potentially leaving the telecom network. Prediction results using this algorithm show that the accuracy rate of leaving the telecom network is 86%, and thus show this algorithm can improve prediction accuracy on the customer leaving the telecom network.
作者 卢光跃 张宏建 闫真光 吴洋 LU Guangyue;ZHANG Hongjian;YAN Zhenguang;WU Yang(Shaanxi Key Laboratory of Information Communication Network and Security,Xi'an University of Posts and Telecommunications, Xi'an 710121, China)
出处 《西安邮电大学学报》 2019年第2期21-25,共5页 Journal of Xi’an University of Posts and Telecommunications
基金 陕西省工业科技攻关计划资助项目(2015GY-013,2016GY-113)
关键词 电信客户 离网预测 特征选择 支持向量机 telecommunications customer leaving the telecom network prediction feature selection support vector machine
  • 相关文献

参考文献13

二级参考文献113

共引文献239

同被引文献56

引证文献5

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部