ViT4Mal: Lightweight Vision Transformer for Malware Detection on Edge Devices

被引：5

作者：

Ravi, Akshara ^{[1
]}

Chaturvedi, Vivek ^{[1
]}

Shafique, Muhammad ^{[2
]}

机构：

[1] Indian Inst Technol Palakkad, Palakkad, Kerala, India

[2] NYU Abu Dhabi NYUAD, Abu Dhabi, U Arab Emirates

来源：

ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS | 2023年 / 22卷 / 05期

关键词：

IoT; malware; vision transformer (ViT); FPGA; inference latency; hardware optimization; matrix multiplication; resource-constrained;

D O I：

10.1145/3609112

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

There has been a tremendous growth of edge devices connected to the network in recent years. Although these devices make our life simpler and smarter, they need to perform computations under severe resource and energy constraints, while being vulnerable to malware attacks. Once compromised, these devices are further exploited as attack vectors targeting critical infrastructure. Most existing malware detection solutions are resource and compute-intensive and hence perform poorly in protecting edge devices. In this paper, we propose a novel approach ViT4Mal that utilizes a lightweight vision transformer (ViT) for malware detection on an edge device. ViT4Mal first converts executable byte-code into images to learn malware features and later uses a customized lightweight ViT to detect malware with high accuracy. We have performed extensive experiments to compare our model with state-of-the-art CNNs in the malware detection domain. Experimental results corroborate that ViTs don't demand deeper networks to achieve comparable accuracy of around 97% corresponding to heavily structured CNN models. We have also performed hardware deployment of our proposed lightweight ViT4Mal model on the Xilinx PYNQ Z1 FPGA board by applying specialized hardware optimizations such as quantization, loop pipelining, and array partitioning. ViT4Mal achieved an accuracy of similar to 94% and a 41x speedup compared to the original ViT model.

引用

页数：26

共 58 条

[1] Abnar S, 2020, Arxiv, DOI arXiv:2005.00928
[2] Alhanahnah M, 2018, IEEE CONF COMM NETW
[3] Anjali Raja K., 2022, Vision Transformers Shaping the Architecture of Computer Vision
[4] [Anonymous], 2016, Hacked Cameras, DVRs Powered Today's Massive Internet Outage
[5] Aslan O., 2021, EUROPEAN J ENG TECHN, V6, P1, DOI [10.24018/ejeng.2021.6.3.2372, DOI 10.24018/EJENG.2021.6.3.2372]
[6] Av-TEST, 2022, Malware
[7] Robust Malware Detection for Internet of (Battlefield) Things Devices Using Deep Eigenspace Learning
Azmoodeh, Amin
Dehghantanha, Ali
Choo, Kim-Kwang Raymond
[J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2019, 4 (01): : 88 - 95
[8] Parallel-CNN network for malware detection
Bakhshinejad, Nazanin
Hamzeh, Ali
[J]. IET INFORMATION SECURITY, 2020, 14 (02) : 210 - 219
[9] Vision Transformers for Remote Sensing Image Classification
Bazi, Yakoub
Bashmal, Laila
Rahhal, Mohamad M. Al
Dayil, Reham Al
Ajlan, Naif Al
[J]. REMOTE SENSING, 2021, 13 (03) : 1 - 20
[10] Bekerman D, 2015, IEEE CONF COMM NETW, P134, DOI 10.1109/CNS.2015.7346821

← 1 2 3 4 5 6 →