S-DCNN: stacked deep convolutional neural networks for malware classification

被引:9
作者
Parihar, Anil Singh [1 ]
Kumar, Shashank [1 ]
Khosla, Savya [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Machine Learning Res Lab, Delhi, India
关键词
Deep convolutional neural networks; Ensemble model; Malware classification; Pattern recognition; Security; Transfer learning;
D O I
10.1007/s11042-022-12615-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Malware classification continues to be exceedingly difficult due to the exponential growth in the number and variants of malicious files. It is crucial to classify malicious files based on their intent, activity, and threat to have a robust malware protection and post-attack recovery system in place. This paper proposes a novel deep learning-based model, S-DCNN, to classify malware binary files into their respective malware families efficiently. S-DCNN uses the image-based representation of the malware binaries and leverages the concepts of transfer learning and ensemble learning. The model incorporates three deep convolutional neural networks, namely ResNet50, Xception, and EfficientNet-B4. The ensemble technique is used to combine these component models' predictions and a multilayered perceptron is used as a meta classifier. The ensemble technique fuses the diverse knowledge of the component models, resulting in high generalizability and low variance of the S-DCNN. Further, it eliminates the use of feature engineering, reverse engineering, disassembly, and other domain-specific techniques earlier used for malware classification. To establish S-DCNN's robustness and generalizability, the performance of proposed model is evaluated on the Malimg dataset, a dataset collected from VirusShare, and packed malware dataset counterparts of both Malimg and VirusShare datasets. The proposed method achieves a state-of-the-art 10-fold accuracy of 99.43% on the Malimg dataset and an accuracy of 99.65% on the VirusShare dataset.
引用
收藏
页码:30997 / 31015
页数:19
相关论文
共 45 条
[1]  
Alsulami B, 2018, PROCEEDINGS OF THE 2018 13TH INTERNATIONAL CONFERENCE ON MALICIOUS AND UNWANTED SOFTWARE (MALWARE 2018), P103, DOI 10.1109/MALWARE.2018.8659358
[2]  
Beek C, 2019, MCAFEE LABS THREATS
[3]   Evolution of automatic visual description techniques-a methodological survey [J].
Bhowmik, Arka ;
Kumar, Sanjay ;
Bhat, Neeraj .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) :28015-28059
[4]   Eye Disease Prediction from Optical Coherence Tomography Images with Transfer Learning [J].
Bhowmik, Arka ;
Kumar, Sanjay ;
Bhat, Neeraj .
ENGINEERING APPLICATIONS OF NEURAL NETWORKSX, 2019, 1000 :104-114
[5]   Random CapsNet for est model for imbalanced malware type classification task [J].
Cayir, Aykut ;
Unal, Ugur ;
Dag, Hasan .
COMPUTERS & SECURITY, 2021, 102
[6]  
Chaudhary P, 2021, ADV COMMUNICATION CO, P1085, DOI DOI 10.1007/978-981-15-5341-7_82
[7]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]   Detection of Malicious Code Variants Based on Deep Learning [J].
Cui, Zhihua ;
Xue, Fei ;
Cai, Xingjuan ;
Cao, Yang ;
Wang, Gai-ge ;
Chen, Jinjun .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) :3187-3196
[9]   Association rule-based malware classification using common subsequences of API calls [J].
D'Angelo, Gianni ;
Ficco, Massimo ;
Palmieri, Francesco .
APPLIED SOFT COMPUTING, 2021, 105
[10]   Malware classification for the cloud via semi-supervised transfer learning [J].
Gao, Xianwei ;
Hu, Changzhen ;
Shan, Chun ;
Liu, Baoxu ;
Niu, Zequn ;
Xie, Hui .
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2020, 55