Visualized Malware Multi-Classification Framework Using Fine-Tuned CNN-Based Transfer Learning Models

被引:37
作者
El-Shafai, Walid [1 ,2 ]
Almomani, Iman [1 ,3 ]
AlKhayer, Aala [1 ]
机构
[1] Prince Sultan Univ, Secur Engn Lab, Dept Comp Sci, Riyadh 11586, Saudi Arabia
[2] Menoufia Univ, Fac Elect Engn, Dept Elect & Elect Commun Engn, Menoufia 32952, Egypt
[3] Univ Jordan, King Abdullah Sch Informat Technol 2, Dept Comp Sci, Amman 11942, Jordan
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 14期
关键词
cybersecurity threats; malware visualization; detection; classification; deep learning; machine learning; CNN; transfer learning; fine-tuning; VGG16; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.3390/app11146446
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
There is a massive growth in malicious software (Malware) development, which causes substantial security threats to individuals and organizations. Cybersecurity researchers makes continuous efforts to defend against these malware risks. This research aims to exploit the significant advantages of Transfer Learning (TL) and Fine-Tuning (FT) methods to introduce efficient malware detection in the context of imbalanced families without the need to apply complex features extraction or data augmentation processes. Therefore, this paper proposes a visualized malware multi-classification framework to avoid false positives and imbalanced datasets' challenges through using the fine-tuned convolutional neural network (CNN)-based TL models. The proposed framework comprises eight different FT CNN models including VGG16, AlexNet, DarkNet-53, DenseNet-201, Inception-V3, Places365-GoogleNet, ResNet-50, and MobileNet-V2. First, the binary files of different malware families were transformed into 2D images and then forwarded to the FT CNN models to detect and classify the malware families. The detection and classification performance was examined on a benchmark Malimg imbalanced dataset using different, comprehensive evaluation metrics. The evaluation results prove the FT CNN models' significance in detecting malware types with high accuracy that reached 99.97% which also outperforms the performance of related machine learning (ML) and deep learning (DL)-based malware multi-classification approaches tested on the same malware dataset.
引用
收藏
页数:21
相关论文
共 56 条
[1]  
Abijah Roseline S., 2020, Computer Vision and Image Processing: 4th International Conference, CVIP 2019. Communications in Computer and Information Science (1148), P62, DOI 10.1007/978-981-15-4018-9_6
[2]  
Al Khayer Aala, 2020, 2020 First International Conference of Smart Systems and Emerging Technologies (SMARTTECH), P197, DOI 10.1109/SMART-TECH49988.2020.00053
[3]  
Almomani Iman, 2021, 2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA), P240, DOI 10.1109/CAIDA51941.2021.9425059
[4]   Android Ransomware Detection Based on a Hybrid Evolutionary Approach in the Context of Highly Imbalanced Data [J].
Almomani, Iman ;
Qaddoura, Raneeem ;
Habib, Maria ;
Alsoghyer, Samah ;
Al Khayer, Alaa ;
Aljarah, Ibrahim ;
Faris, Hossam .
IEEE ACCESS, 2021, 9 :57674-57691
[5]   Android Applications Scanning: The Guide [J].
Almomani, Iman ;
Khayer, Aala .
2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, :357-361
[6]   A Comprehensive Analysis of the Android Permissions System [J].
Almomani, Iman M. ;
Al Khayer, Aala .
IEEE ACCESS, 2020, 8 :216671-216688
[7]  
Alsoghyer Samah, 2020, 2020 6th Conference on Data Science and Machine Learning Applications (CDMA), P94, DOI 10.1109/CDMA47397.2020.00022
[8]   Ransomware Detection System for Android Applications [J].
Alsoghyer, Samah ;
Almomani, Iman .
ELECTRONICS, 2019, 8 (08)
[9]   A dynamic Windows malware detection and prediction method based on contextual understanding of API call sequence [J].
Amer, Eslam ;
Zelinka, Ivan .
COMPUTERS & SECURITY, 2020, 92
[10]   AutoTune: Automatically Tuning Convolutional Neural Networks for Improved Transfer Learning [J].
Basha, S. H. Shabbeer ;
Vinakota, Sravan Kumar ;
Pulabaigari, Viswanath ;
Mukherjee, Snehasis ;
Dubey, Shiv Ram .
NEURAL NETWORKS, 2021, 133 :112-122