期刊文献+

基于数据挖掘的手机用户换机行为预测研究 被引量:4

Prediction of Mobile Users for Updating Terminal Research Based on Data Mining
原文传递
导出
摘要 首先对用户数据进行特征分析,变量选择,然后又采集了大量与手机性能相关的数据来扩充数据集,最后利用现代数据挖掘手段对用户的换机行为进行预测,讨论并比较了各种方法对换机预测的准确性.通过对用户数据集进行测试实验,表明变量选择与补充能够有效地提高移动用户换机的预测结果,并且Xgboost方法在各种分析工具中的表现更为优越. This paper, first of all, makes an analysis of the characteristics of user data, variable selectioa, and then collects a large number of new data to expand data sets. At last, this paper makes use of modern data mining methods to predict mobile users for updating terminal behavior, discusses and compares the various methods' accuracy. Based on user data set for testing experiment, it shows that variable selection and adding variables can effectively improve the accuracy of prediction, and Xgboost is more superior performance in various analysis tools.
出处 《数学的实践与认识》 北大核心 2017年第16期71-80,共10页 Mathematics in Practice and Theory
基金 国家自然科学基金 高维数据变量间非线性交互作用的研究(11571009)
关键词 添加变量 变量选择 换机预测 Xgboost adding variables variable selection users for updating terminal prediction Xgboost
  • 相关文献

参考文献3

二级参考文献145

  • 1张启蕊,张凌,董守斌,谭景华.训练集类别分布对文本分类的影响[J].清华大学学报(自然科学版),2005,45(S1):1802-1805. 被引量:26
  • 2HanJiawei MichelineKambe.数据挖掘概念与技术[M].北京:机械工业出版社,2001..
  • 3Mehta M, Agrawal R, Rissanen J. SLIQ: A Fast Scalable Classifier for Data Mining[A]. Lecture Notes in Computer Sci. Proc. of the 5th Int.Conf. on Extending Database Tech. [C], 1996:18-33
  • 4Shafer J C, Agrawal R, Mehta M. SPRINT: A Scalable Parallel Classifier for Data Mining[A]. Mumbai(Bombay), India: Proc. of the 22nd Int. Conf. on Very Large Databases[C], 1996
  • 5Friedman N, Geiger D, Goldszmidt M. Bayesian Network Classifier[J].Machine Learning, 1997, 29( 1 ):131 - 163
  • 6Liu B, Hsu W, Ma Y. Integrating Classification and Association Rule Mining[A]. Agrawal R. Proc. of the 4th Int. Conf. on Knowledge Discovery and DataMining[C], NY, USA: AAAI Press, 1998:80-86
  • 7Probost F.Machine learning from imbalanced data sets 101.In:the AAAI'2000 Workshop on Imbalanced Data Sets.2000.
  • 8Pednault E P D,Rosen B K,Apte C.Handling Imbalanced Data Sets in Insurance Risk Modeling:[Technical Report RC-21731].IBM Research Re port,March 2000.
  • 9Stolfo S J,Fan D W S,Lee W,et al.Prodromids,Chan P K.Credit Card Fraud Detection Using Meta-Learning:Issuesand Initial Results.In:AAAI-97 Workshop on Al Methods in Fraud and Risk M anagement,1997.
  • 10Batista G E A P A,Bazzan A L C,Monard M C.Balancing Training Data For Automated Annotation of Keywords:a Case Study.In:Proc.of the Second Brazilian Workshop on Bioinformatics.SBC,2003.

共引文献126

同被引文献30

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部