期刊文献+

基于指导性正则化随机森林SMOTEBoost的算法与应用 被引量:5

Algorithm and Application of SMOTEBoost Based on Guidance Regularization
下载PDF
导出
摘要 针对企业财务风险研究指标数据中常常存在不平衡现象造成模型算法预测精度较差的问题,文章通过构造基于指导性正则化随机森林的SMOTEBoost算法分析企业财务风险因素,以此提升企业财务风险预测准确性,同时筛选出显著影响企业发生财务风险的特征变量。数值模拟结果发现SMOTEBoost-GRRF算法预测精度好于其他算法,且具有较优的特征变量筛选能力。实证研究结果发现每股资本公积金、营运资金、营业总成本/营业总收入、每股收益增长率四种指标是最能显著影响企业发生财务风险的因素。 Aiming at the problem that the prediction accuracy of the model algorithm is poor due to the unbalanced phenomenon in the index data of enterprise financial risk research,this paper analyzes the enterprise financial risk factors by constructing SMOTEBoost algorithm based on instructional regularized random forest,by which to improve the accuracy of enterprise financial risk forecasting.Meanwhile,the paper screens the characteristic variables that significantly affect the financial risk of enterprises.Numerical simulation results show that the SMOTEBoost-GRRF algorithm has better prediction accuracy than other algorithms,and it has better feature variable screening capabilities.The empirical results further confirms that capital accumulation fund per share,working capital,total operating cost/total operating revenue and growth rate of earnings per share are the four indexes that can most significantly affect the financial risk factors of enterprises.
作者 赵浩 鲁亚军 高洁 张汝飞 Zhao Hao;Lu Yajun;Gao Jie;Zhang Rufei(Postdoctoral Workstation of Credit Information Center of the People's Bank of China,Beijing 100031,China;Credit Information Center of the People's Bank of China,Beijing 100031,China;College of Economics and Trade,Hebei University of Geosciences,Shijiazhuang 050031,China)
出处 《统计与决策》 CSSCI 北大核心 2020年第4期9-14,共6页 Statistics & Decision
基金 河北省社会科学基金资助项目(HB17YJ024)。
关键词 企业财务风险 随机森林 SMOTEBoost算法 corporate financial risk random forest SMOTEBoost algorithm
  • 相关文献

参考文献2

二级参考文献59

  • 1石晓军,肖远文,任若恩.Logistic违约率模型的最优样本配比与分界点研究[J].财经研究,2005,31(9):38-48. 被引量:49
  • 2鲜文铎,向锐.基于混合Logit模型的财务困境预测研究[J].数量经济技术经济研究,2007,24(9):68-76. 被引量:38
  • 3WU Xin-dong,KUMAR V,QUINLAN J R,et al.Top 10 algorithms in data mining[J].Knowledge and Information Systems,2008,14(1):1-37.
  • 4CHAWLA N V,JAPKOWICZ N,KOTCZ A.Editorial:special issue on learning from imbalanced data sets[J].ACM SIGKDD Explorations Newsletter,2004,6(1):1-6.
  • 5HE Hai-bo,GARCIA E A.Learning from imbalanced data[J].IEEE Trans on Knowledge and Data Engineering,2009,21(9):1263-1284.
  • 6TING K M.A comparative study of cost-sensitive boosting algorithms[C]//Proc of the 17th International Conference on Machine Learning.2000:983-990.
  • 7FAN Wei,STOLFO S J,ZHANG Jun-xin,et al.AdaCost:misclassification cost-sensitive boosting[C]//Proc of the 16th International Conference on Machine Learning.1999:97-105.
  • 8SUN Yan-min,KAMEL M S,WONG A K C,et al.Cost-sensitive boosting for classification of imbalanced data[J].Pattern Recognition,2007,40(12):3358-3378.
  • 9GALAR M,FERNNDEZ A,BARRENCHEA E,et al.EUSBoost:enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling[J].Pattern Recognition,2013,46(12):3460-3471.
  • 10JOSHI M V,KUMAR V,AGARWAL R C.Evaluating boosting algorithms to classify rare classes:comparison and improvements[C]//Proc of IEEE International Conference on Data Mining.Washington DC:IEEE Computer Society,2001:257-264.

共引文献130

同被引文献37

引证文献5

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部