Multimodal malware classification using proposed ensemble deep neural network framework

被引：0

作者：

Sadia Nazim ^{[1
]}

Muhammad Mansoor Alam ^{[6
]}

Safdar Rizvi ^{[2
]}

Jawahir Che Mustapha ^{[3
]}

Syed Shujaa Hussain ^{[1
]}

Mazliham Mohd Su’ud ^{[4
]}

机构：

[1] Universiti Kuala Lumpur,Malaysian Institute of Information Technology

[2] Riphah International University,Faculty of Computing

[3] Bahria University Karachi Campus,Department of Computer Science

[4] Sir Syed CASE Institute of Technology,Department of Computer Science

[5] Multimedia University,Department of Computer Science

[6] Bahria University Islamabad Campus,undefined

来源：

Scientific Reports | / 15卷 / 1期

关键词：

Malware detection; Multimodal analysis; Deep learning; Late fusion; Android malware; Artificial intelligence; Cyberattacks;

D O I：

10.1038/s41598-025-96203-3

中图分类号：

学科分类号：

摘要：

In the contemporary technological world, fortifying cybersecurity defense against dynamic threat landscapes is imperative. Malware detectors play a critical role in this endeavor, utilizing various techniques such as statistical analysis, static and dynamic analysis, and machine learning (ML) to compare signatures and identify threats. Deep learning (DL) aids in accurately classifying complex malware features. The cross-domain research in data fusion strives to integrate information from multiple sources to augment reliability and minimize errors in detecting sophisticated cyber threats. This collaborative approach is the least addressed and pivotal for protecting against the advancing environment of modern malware attacks. This study presents a state-of-the-art malware analysis framework that employs a multimodal approach by integrating malware images and numeric features for effective malware classification. The experiments are performed sequentially, encompassing data preprocessing, feature selection using Neighbourhood Component Analysis (NCA), and dataset balancing with Synthetic Minority Over-sampling Technique (SMOTE). Subsequently, the late fusion technique is utilized for multimodal classification by employing Random Under Sampling and Boosting (RUSBoost) and the proposed ensemble deep neural network. The RUSBoost technique involves random undersampling and adaptive boosting to moderate bias toward majority classes while improving minority class (malware) detection. Multimodal Late fusion experimental results (95.36%) of RUSBoost (numeric) and the proposed model (imagery) outperform the standalone prevailing results for imagery (95.02%) and numeric (93.36%) data. The effectiveness of the proposed model is verified through the evaluation metrics such as Recall (86.5%), F1-score (85.0%), and Precision (79.9%). The multimodal late fusion of numeric and visual data makes the model more robust in detecting diverse malware variants. The experimental outcomes demonstrate that multimodal analysis may efficiently increase the identification strength and accuracy, especially when majority vote and bagging are employed for late fusion.

引用