Visualized Malware Multi-Classification Framework Using Fine-Tuned CNN-Based Transfer Learning Models

被引：37

作者：

El-Shafai, Walid ^{[1
,2
]}

Almomani, Iman ^{[1
,3
]}

AlKhayer, Aala ^{[1
]}

机构：

[1] Prince Sultan Univ, Secur Engn Lab, Dept Comp Sci, Riyadh 11586, Saudi Arabia

[2] Menoufia Univ, Fac Elect Engn, Dept Elect & Elect Commun Engn, Menoufia 32952, Egypt

[3] Univ Jordan, King Abdullah Sch Informat Technol 2, Dept Comp Sci, Amman 11942, Jordan

来源：

APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 14期

关键词：

cybersecurity threats; malware visualization; detection; classification; deep learning; machine learning; CNN; transfer learning; fine-tuning; VGG16; CONVOLUTIONAL NEURAL-NETWORKS;

D O I：

10.3390/app11146446

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

There is a massive growth in malicious software (Malware) development, which causes substantial security threats to individuals and organizations. Cybersecurity researchers makes continuous efforts to defend against these malware risks. This research aims to exploit the significant advantages of Transfer Learning (TL) and Fine-Tuning (FT) methods to introduce efficient malware detection in the context of imbalanced families without the need to apply complex features extraction or data augmentation processes. Therefore, this paper proposes a visualized malware multi-classification framework to avoid false positives and imbalanced datasets' challenges through using the fine-tuned convolutional neural network (CNN)-based TL models. The proposed framework comprises eight different FT CNN models including VGG16, AlexNet, DarkNet-53, DenseNet-201, Inception-V3, Places365-GoogleNet, ResNet-50, and MobileNet-V2. First, the binary files of different malware families were transformed into 2D images and then forwarded to the FT CNN models to detect and classify the malware families. The detection and classification performance was examined on a benchmark Malimg imbalanced dataset using different, comprehensive evaluation metrics. The evaluation results prove the FT CNN models' significance in detecting malware types with high accuracy that reached 99.97% which also outperforms the performance of related machine learning (ML) and deep learning (DL)-based malware multi-classification approaches tested on the same malware dataset.

引用

页数：21

共 56 条

[1]

Abijah Roseline S., 2020, Computer Vision and Image Processing: 4th International Conference, CVIP 2019. Communications in Computer and Information Science (1148), P62, DOI 10.1007/978-981-15-4018-9_6

[2]

Al Khayer Aala, 2020, 2020 First International Conference of Smart Systems and Emerging Technologies (SMARTTECH), P197, DOI 10.1109/SMART-TECH49988.2020.00053

[3]

Almomani Iman, 2021, 2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA), P240, DOI 10.1109/CAIDA51941.2021.9425059

[4] Android Ransomware Detection Based on a Hybrid Evolutionary Approach in the Context of Highly Imbalanced Data [J].