ViT4Mal: Lightweight Vision Transformer for Malware Detection on Edge Devices

被引:5
作者
Ravi, Akshara [1 ]
Chaturvedi, Vivek [1 ]
Shafique, Muhammad [2 ]
机构
[1] Indian Inst Technol Palakkad, Palakkad, Kerala, India
[2] NYU Abu Dhabi NYUAD, Abu Dhabi, U Arab Emirates
关键词
IoT; malware; vision transformer (ViT); FPGA; inference latency; hardware optimization; matrix multiplication; resource-constrained;
D O I
10.1145/3609112
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There has been a tremendous growth of edge devices connected to the network in recent years. Although these devices make our life simpler and smarter, they need to perform computations under severe resource and energy constraints, while being vulnerable to malware attacks. Once compromised, these devices are further exploited as attack vectors targeting critical infrastructure. Most existing malware detection solutions are resource and compute-intensive and hence perform poorly in protecting edge devices. In this paper, we propose a novel approach ViT4Mal that utilizes a lightweight vision transformer (ViT) for malware detection on an edge device. ViT4Mal first converts executable byte-code into images to learn malware features and later uses a customized lightweight ViT to detect malware with high accuracy. We have performed extensive experiments to compare our model with state-of-the-art CNNs in the malware detection domain. Experimental results corroborate that ViTs don't demand deeper networks to achieve comparable accuracy of around 97% corresponding to heavily structured CNN models. We have also performed hardware deployment of our proposed lightweight ViT4Mal model on the Xilinx PYNQ Z1 FPGA board by applying specialized hardware optimizations such as quantization, loop pipelining, and array partitioning. ViT4Mal achieved an accuracy of similar to 94% and a 41x speedup compared to the original ViT model.
引用
收藏
页数:26
相关论文
共 58 条
  • [1] Abnar S, 2020, Arxiv, DOI arXiv:2005.00928
  • [2] Alhanahnah M, 2018, IEEE CONF COMM NETW
  • [3] Anjali Raja K., 2022, Vision Transformers Shaping the Architecture of Computer Vision
  • [4] [Anonymous], 2016, Hacked Cameras, DVRs Powered Today's Massive Internet Outage
  • [5] Aslan O., 2021, EUROPEAN J ENG TECHN, V6, P1, DOI [10.24018/ejeng.2021.6.3.2372, DOI 10.24018/EJENG.2021.6.3.2372]
  • [6] Av-TEST, 2022, Malware
  • [7] Robust Malware Detection for Internet of (Battlefield) Things Devices Using Deep Eigenspace Learning
    Azmoodeh, Amin
    Dehghantanha, Ali
    Choo, Kim-Kwang Raymond
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2019, 4 (01): : 88 - 95
  • [8] Parallel-CNN network for malware detection
    Bakhshinejad, Nazanin
    Hamzeh, Ali
    [J]. IET INFORMATION SECURITY, 2020, 14 (02) : 210 - 219
  • [9] Vision Transformers for Remote Sensing Image Classification
    Bazi, Yakoub
    Bashmal, Laila
    Rahhal, Mohamad M. Al
    Dayil, Reham Al
    Ajlan, Naif Al
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 20
  • [10] Bekerman D, 2015, IEEE CONF COMM NETW, P134, DOI 10.1109/CNS.2015.7346821