Transfer Learning for Image-Based Malware Detection for IoT

被引:10
作者
Panda, Pratyush [1 ]
Om Kumar, C. U. [1 ]
Marappan, Suguna [1 ]
Ma, Suresh [2 ]
Manimurugan, S. [3 ]
Nandi, Deeksha Veesani [4 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai 600127, India
[2] Amrita Vishwa Vidyapeetham, Amrita Sch Business, Coimbatore 641112, India
[3] Univ Tabuk, Fac Comp & Informat Technol, Tabuk 71491, Saudi Arabia
[4] Virtusa Consulting Serv, Tech Lead, Chennai 603103, India
关键词
malware detection; CNN; transfer learning; ensemble; autoencoder; GRU; MLP; MalImg;
D O I
10.3390/s23063253
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The tremendous growth in online activity and the Internet of Things (IoT) led to an increase in cyberattacks. Malware infiltrated at least one device in almost every household. Various malware detection methods that use shallow or deep IoT techniques were discovered in recent years. Deep learning models with a visualization method are the most commonly and popularly used strategy in most works. This method has the benefit of automatically extracting features, requiring less technical expertise, and using fewer resources during data processing. Training deep learning models that generalize effectively without overfitting is not feasible or appropriate with large datasets and complex architectures. In this paper, a novel ensemble model, Stacked Ensemble-autoencoder, GRU, and MLP or SE-AGM, composed of three light-weight neural network models-autoencoder, GRU, and MLP-that is trained on the 25 essential and encoded extracted features of the benchmark MalImg dataset for classification was proposed. The GRU model was tested for its suitability in malware detection due to its lesser usage in this domain. The proposed model used a concise set of malware features for training and classifying the malware classes, which reduced the time and resource consumption in comparison to other existing models. The novelty lies in the stacked ensemble method where the output of one intermediate model works as input for the next model, thereby refining the features as compared to the general notion of an ensemble approach. Inspiration was drawn from earlier image-based malware detection works and transfer learning ideas. To extract features from the MalImg dataset, a CNN-based transfer learning model that was trained from scratch on domain data was used. Data augmentation was an important step in the image processing stage to investigate its effect on classifying grayscale malware images in the MalImg dataset. SE-AGM outperformed existing approaches on the benchmark MalImg dataset with an average accuracy of 99.43%, demonstrating that our method was on par with or even surpassed them.
引用
收藏
页数:30
相关论文
共 75 条
[61]  
Statista, OUR RES CONT PHIL
[62]   Lightweight Classification of IoT Malware Based on Image Recognition [J].
Su, Jiawei ;
Vargas, Danilo Vasconcellos ;
Prasad, Sanjiva ;
Sgandurra, Daniele ;
Feng, Yaokai ;
Sakurai, Kouichi .
2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2018), VOL 2, 2018, :664-669
[63]   MCFT-CNN: Malware classification with fine-tune convolution neural networks using traditional and transfer learning in Internet of Things [J].
Sudhakar ;
Kumar, Sushil .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 125 :334-351
[64]  
Tang Adrian, 2014, Research in Attacks, Intrusions and Defenses. 17th International Symposium (RAID 2014). Proceedings: LNCS 8688, P109, DOI 10.1007/978-3-319-11379-1_6
[65]  
Tang A., 2013, ACM SIGARCH computer architecture news, V41, P559
[66]  
towardsdatascience, APPL DEEP LEARNING 3
[67]  
Understanding GRU Networks, THIS ART I WILL TRY
[68]   Android malware detection based on image-based features and machine learning techniques [J].
Unver, Halil Murat ;
Bakour, Khaled .
SN APPLIED SCIENCES, 2020, 2 (07)
[69]   IMCFN: Image-based malware classification using fine-tuned convolutional neural network architecture [J].
Vasan, Danish ;
Alazab, Mamoun ;
Wassan, Sobia ;
Naeem, Hamad ;
Safaei, Babak ;
Zheng, Qin .
COMPUTER NETWORKS, 2020, 171
[70]   Image-Based malware classification using ensemble of CNN architectures (IMCEC) [J].
Vasan, Danish ;
Alazab, Mamoun ;
Wassan, Sobia ;
Safaei, Babak ;
Zheng, Qin .
COMPUTERS & SECURITY, 2020, 92