摘要
近年来,Powershell由于其易用性强、隐蔽性高的特点被广泛应用于APT攻击中,传统的基于人工特征提取和机器学习方法的恶意代码检测技术在Powershell恶意代码检测中越来越难以有效。本文提出了一种基于随机森林特征组合和深度学习的Powershell恶意代码检测方法。该方法使用随机森林生成更好表征原始数据的新特征组合,随后使用深度学习神经网络训练并进行分类识别。该方法可以弥补人工特征工程经验不足的问题,更好表征原始数据从而提高检测效果。本文实验结果显示,利用本文提出方法构建的Powershell恶意代码检测系统性能良好,在真实数据集中的召回率、准确率均在99%以上,可以对Powershell恶意代码进行有效的检测识别。
In recent years,powershell is widely used in APT attack due to its ease of use and high concealment.Traditional malicious code detection technology based on artificial feature extraction and machine learning method is more and more difficult to be effective in the detection of malicious code in PowerShell.For this reason,this paper proposes a malicious Powershell code detection method based on random forest features combination and deep learning.This method uses random forest to generate new features which better characterize the original data,and uses deep neural network to build classifiers for classification and recognition.This method can make up for the lack of experience in artificial feature engineering,and characterize the original data better,so as to improve the detection effect.The experimental results in this article show that this method has a good performance,high recall rate and accuracy rate,which can effectively detect and identify malicious Powershell code.
作者
刘岳
刘宝旭
赵子豪
刘潮歌
王晓茜
吴贤达
LIU Yue;LIU Baoxu;ZHAO Zihao;LIU Chaoge;WANG Xiaoxi;WU Xianda(Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100093,China;School of Cyber Security,University of Chinese Academy of Sciences,Beijing 100049,China)
出处
《信息安全学报》
CSCD
2021年第1期40-53,共14页
Journal of Cyber Security
基金
国家自然科学基金项目(No.61902396)
中国科学院战略性先导科技专项项目(No.XDC02040100)
中国科学院网络测评技术重点实验室和网络安全防护技术北京市重点实验室资助。