期刊文献+

基于分类精度和相关性的随机森林算法改进 被引量:14

Improvement of Random Forests Algorithm Based on Classification Accuracy and Correlation
下载PDF
导出
摘要 为了提升传统随机森林算法的分类精度,首先对传统随机森林模型中的决策树根据分类性能评价指标AUC(area under curve)值进行降序排列,从中选取出AUC值高的决策树,计算这些决策树之间的相似度,并生成相似度矩阵;然后根据相似度矩阵对这些决策树进行聚类。从每一类中选出一棵AUC最大的决策树组成新的随机森林模型,从而达到提升传统随机森林算法分类精度的目的。通过UCI(university of Californialrvine)数据集的实验表明,改进后的随机森林算法在分类精度上最大提高了2.91%。 In order to improve the classification accuracy of random forests algorithm, the decision trees in the random forest model are first sorted according to the AUC value of the classification performance evaluation index.And then the trees with high AUC value is selected to calculate the similarity matrix.Finally the decision tree is clustered according to the similarity matrix.So a new random forest model is generated by selecting the tree with the highest AUC value from each category and to achieve the goal of improving the accuracy of random forests algorithm.Experiments on UCI datasets show that the improved random forest algorithm has improved the highest classification accuracy of 2.91%.
出处 《科学技术与工程》 北大核心 2017年第20期67-72,共6页 Science Technology and Engineering
基金 国家"863"计划(2014AA015204) 山西省国际科技合作项目(2014081018-2)资助
关键词 随机森林 分类精度 决策树相似度 相似度矩阵 random forest classification accuracy the similarity among decision trees similarity matrix
  • 相关文献

参考文献3

二级参考文献62

  • 1刘微,罗林开,王华珍.基于随机森林的基金重仓股预测[J].福州大学学报(自然科学版),2008,36(S1):134-139. 被引量:8
  • 2林成德,彭国兰.随机森林在企业信用评估指标体系确定中的应用[J].厦门大学学报(自然科学版),2007,46(2):199-203. 被引量:36
  • 3Breiman L. Bagging Preditors [J].Machine Learning, 1996,24(2).
  • 4Dietterich T. An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting and Randomization [J].Machine Learning, 2000,40(2).
  • 5Ho T K. The Random Subspace Method for Constructing Decision Forests [J].Trans. on Pattern Analysis and Machine Intelligence, 1998,20 (8).
  • 6Amit Y, Gernan D. Shape Quantization and Recognition with Randomized Trees[J]. Neural Computation, 1997,9(7). Breiman L Random Forest[J]. Machine Learning, 2001,45(1).
  • 7Breiman L. Random Forests[J]. Machine Learning, 2001,45(1).
  • 8Tibshirani tL Bias, Variance, and Prediction Error for Classification Rules[C]. Technical Report, Statistics Department, University of Toronto, 1996.
  • 9Wolpert D H, Macready W G. An Efficient Method to Estimate Bagging's Generalization Error[J]. Machine Learning, 1999,35(1).
  • 10Breiman L. Out-of-bag Estimation[EB/OL]. [2010- 06- 30]. http//stat, berkeley, edu/ pub/ users/ breiman / OOB estimation, ps.

共引文献670

同被引文献91

引证文献14

二级引证文献197

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部