Android malware has exploded in popularity in recent years,due to the platform’s dominance of the mobile market.With the advancement of deep learning technology,numerous deep learning-based works have been proposed f...Android malware has exploded in popularity in recent years,due to the platform’s dominance of the mobile market.With the advancement of deep learning technology,numerous deep learning-based works have been proposed for the classification of Android malware.Deep learning technology is designed to handle a large amount of raw and continuous data,such as image content data.However,it is incompatible with discrete features,i.e.,features gathered from multiple sources.Furthermore,if the feature set is already well-extracted and sparsely distributed,this technology is less effective than traditional machine learning.On the other hand,a wide learning model can expand the feature set to enhance the classification accuracy.To maximize the benefits of both methods,this study proposes combining the components of deep learning based on multi-branch CNNs(Convolutional Network Neural)with wide learning method.The feature set is evaluated and dynamically partitioned according to its meaning and generalizability to subsets when used as input to the model’s wide or deep component.The proposed model,partition,and feature set quality are all evaluated using the K-fold cross validation method on a composite dataset with three types of features:API,permission,and raw image.The accuracy with Wide and Deep CNN(WDCNN)model is 98.64%,improved by 1.38%compared to RNN(Recurrent Neural Network)model.展开更多
文摘Android malware has exploded in popularity in recent years,due to the platform’s dominance of the mobile market.With the advancement of deep learning technology,numerous deep learning-based works have been proposed for the classification of Android malware.Deep learning technology is designed to handle a large amount of raw and continuous data,such as image content data.However,it is incompatible with discrete features,i.e.,features gathered from multiple sources.Furthermore,if the feature set is already well-extracted and sparsely distributed,this technology is less effective than traditional machine learning.On the other hand,a wide learning model can expand the feature set to enhance the classification accuracy.To maximize the benefits of both methods,this study proposes combining the components of deep learning based on multi-branch CNNs(Convolutional Network Neural)with wide learning method.The feature set is evaluated and dynamically partitioned according to its meaning and generalizability to subsets when used as input to the model’s wide or deep component.The proposed model,partition,and feature set quality are all evaluated using the K-fold cross validation method on a composite dataset with three types of features:API,permission,and raw image.The accuracy with Wide and Deep CNN(WDCNN)model is 98.64%,improved by 1.38%compared to RNN(Recurrent Neural Network)model.