期刊文献+

改进随机森林算法在Android恶意软件检测中的应用 被引量:3

Application of Improved Random Forest Algorithm in Android Malware Detection
下载PDF
导出
摘要 Random Forest作为一种常见的机器学习算法,不仅具备较高的分类回归性能,而且快速高效.传统的Random Forest算法并未在决策树的生成和选择上做深入研究,在本文中笔者提出一种降序去冗的寻优方式对机器学习中监督学习算法Random Forest进行改进,在保证准确率的同时减少随机森林的冗余度,并应用于Android系统的恶意软件检测.经过五折交叉验证法验证,改进的Random Forest算法能够在较低的冗余度下保证较高的准确率,同时改进的算法准确率在与同条件下的原算法的准确率以及OOB模型下的准确率相差在1%以内,在与单模型分类算法KNN和集成式学习算法Adaboost M1的对比试验中改进的Random Forest算法要优于以上两者. As a common machine learning algorithm, Random Forest not only has high classification and regression performance, but also fast and efficient. In this paper, the authors attempt to use a method called descending remove redundancy to improve the supervised learning random forest algorithm, and improved algorithm can reduce the redundancy while guaranteeing the accuracy, thus applied to malware detection in Android system. Five-fold cross validation method was used in the experiment, and the experimental data show that the improved random forest algorithm can guarantee the higher accuracy at the lowest redundancy,the error among the accuracy of the improved algorithm, the original algorithm and the OOB model under the same conditions are less than 1%, compared with the single model classification algorithm KNN and the ensemble learning algorithm Adaboost M1, the improved algorithm is better.
出处 《新疆大学学报(自然科学版)》 CAS 北大核心 2017年第3期322-327,共6页 Journal of Xinjiang University(Natural Science Edition)
基金 国家自然科学基金(61303231)
关键词 RANDOM FOREST ANDROID 降序去冗 寻优 Random Forest Android descending remove redundancy optimizing
  • 相关文献

参考文献4

二级参考文献56

  • 1王莉.自适应遗传算法在PID控制器设计中的应用[J].西安科技大学学报,2005,25(1):93-95. 被引量:4
  • 2鲍文胜,刘晓刚.基于自适应遗传算法的RBF神经网络优化算法[J].山东师范大学学报(自然科学版),2007,22(3):37-39. 被引量:13
  • 3VapnikVN.统计学习理论的本质[M].北京:清华大学出版社,2000..
  • 4周明,孙树栋.遗传算法原理及应用[M].北京:国防工业出版社,2001.
  • 5HASTIE T,TIBSHRANI R,FRIEDMAN J. The elements of statistical learning:data mining,inference,and prediction[M].New York:Springer,2001.
  • 6WASSEMAN L. Bayesian model selection and model averaging[J].Journal of Mathematical Psychology,2000,44(1):92-107.
  • 7SPIEGELHALTER D J,BES N G,CARLIN B P,et al. Bayesian measures of model complexity and fit[J].Statistical Methodology,2002,64(4):583-639.
  • 8KADANE J B,LAZAR N A. Methods and criteria for model selection[J].Journal of the American Statistical Association,2004,99(465):279-290.
  • 9FRONMONT M. Model selection by bootstrap penalization for classification[J].Machine Learning,2007,66(2-3):165-207.
  • 10BENGIO Y,GRANDVALET Y. No unbiased estimator of variance of K-fold cross validation[J].Machine Learning,2004,5:1089-1105.

共引文献134

同被引文献24

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部