Image-based malware representation approach with EfficientNet convolutional neural networks for effective malware classification

被引:36
作者
Chaganti, Rajasekhar [1 ]
Ravi, Vinayakumar [2 ]
Pham, Tuan D. [2 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
[2] Prince Mohammad Bin Fahd Univ, Ctr Artificial Intelligence, Khobar, Saudi Arabia
关键词
CyberSecurity; Malware classification; Machine learning; Deep learning; CNN architectures; Transfer learning; EfficientNet; Malware image representation; Malware visualization; ARCHITECTURES; VISUALIZATION;
D O I
10.1016/j.jisa.2022.103306
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The targeted malware attacks are usually created by few crime groups. They may essentially use their existing malware sample malicious code to rebuild the variants for sophistication and evade the malware detection. This trend emphasizes the importance of performing the malware family classification for applying the effective malware mitigation and prevention strategies. In this paper, we propose an efficient neural network model EfficientNetB1 to perform the malware family classification using the malware byte level image representation technique. To alleviate the computation resource consumption caused by deep learning (DL) models training and testing the various Convolutional Neural Network (CNN) based models, we have performed the performance and computational efficiency evaluation of the various CNN pretrained models to select the best CNN network architecture for malware classification. Additionally, the CNN pretrained models are evaluated against the different types of malware image representation methods, which are distinguished based on selection of the image width size. Our evaluation of the proposed model EfficientNetB1 shows that it has achieved an accuracy of 99% to classify the Microsoft Malware Classification Challenge (MMCC) malware classes using the malware image representation with fixed image width and also require fewer network parameters compared to other pretrained models to achieve the performance accuracy. Furthermore, various visualization techniques were used to compare the performances of the various CNN pretrained models.
引用
收藏
页数:19
相关论文
共 36 条
[1]  
Angelo G, 2021, APPL SOFT COMPUT, V105
[2]   A Comprehensive Review on Malware Detection Approaches [J].
Aslan, Omer ;
Samet, Refik .
IEEE ACCESS, 2020, 8 :6249-6271
[3]   Parallel-CNN network for malware detection [J].
Bakhshinejad, Nazanin ;
Hamzeh, Ali .
IET INFORMATION SECURITY, 2020, 14 (02) :210-219
[4]  
Constantin Lucian, 2020, CSO Online
[5]  
Conti G, 2008, LECT NOTES COMPUT SC, V5210, P1, DOI 10.1007/978-3-540-85933-8_1
[6]  
Cook S., 2021, MALWARE STAT 2021 FR
[7]   Detection of Malicious Code Variants Based on Deep Learning [J].
Cui, Zhihua ;
Xue, Fei ;
Cai, Xingjuan ;
Cao, Yang ;
Wang, Gai-ge ;
Chen, Jinjun .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) :3187-3196
[8]   Malware detection in mobile environments based on Autoencoders and API-images [J].
D'Angelo, Gianni ;
Ficco, Massimo ;
Palmieri, Francesco .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 137 :26-33
[9]   Windows Malware Detector Using Convolutional Neural Network Based on Visualization Images [J].
Darshan, Shiva S. L. ;
Jaidhar, C. D. .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (02) :1057-1069
[10]   Malware Visualization for Fine-Grained Classification [J].
Fu, Jianwen ;
Xue, Jingfeng ;
Wang, Yong ;
Liu, Zhenyan ;
Shan, Chun .
IEEE ACCESS, 2018, 6 :14510-14523