Global-Local Attention-Based Butterfly Vision Transformer for Visualization-Based Malware Classification

被引:6
|
作者
Belal, Mohamad Mulham [1 ]
Sundaram, Divya Meena [1 ]
机构
[1] VIT AP Univ, Sch Comp Sci & Engn, Amaravati 522237, India
关键词
Malware; Feature extraction; Transformers; Codes; Visualization; Convolutional neural networks; Behavioral sciences; Parallel processing; Vision transformer; INDEX TERMS; global attention; local attention; malware classification; visualization-based malware classifier; parallel processing;
D O I
10.1109/ACCESS.2023.3293530
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent studies, convolutional neural networks (CNNs) are mostly used as dynamic techniques for visualization-based malware classification and detection. Though vision transformer (ViT) proved its efficiency in image classification, a few of the earlier studies developed a ViT-based malware classifier. This paper proposes a butterfly construction-based vision transformer (B_ViT) model for visualization-based malware classification and detection. B_ViT has four phases: (1) image partitioning and patches embeddings; (2) local attention; (3) global attention; and (4) training and malware classification. B_ViT is an enhanced ViT architecture that supports the parallel processing of image patches and captures local and global spatial representations of malware images. B_ViT is a transfer learning-based model that uses a pre-trained ViT model on the ImageNet dataset to initialize the training parameters of transformers. Four B_ViT variants are experimented and evaluated on grayscale malware images collected from MalImg, Microsoft BIG datasets or converted from portable executable imports. The experiments show that B_ViT variants outperform the Input Enhanced vision transformer (IEViT) and ViT variants, achieving an accuracy equal to 99.49% and 99.99% for malware classification and detection respectively. The experiments also show that B_ViT is time effective for malware classification and detection where the average speed-up of B_ViT variants over IEViT and ViT variants are equal to 2.42 and 1.81 respectively. The analysis proves the efficiency of texture-based malware detection as well as the resilience of B_ViT to polymorphic obfuscation. Finally, the proposed B_ViT-based malware classifier outperforms the CNN-based malware classification methods in well.
引用
收藏
页码:69337 / 69355
页数:19
相关论文
共 50 条
  • [1] Performance comparison of visualization-based malware detection and classification techniques
    Shah, Syed Shakir Hameed
    Jamil, Norziana
    Khan, Atta Ur Rehman
    2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 200 - 205
  • [2] PAFE: A lightweight visualization-based fast malware classification method
    Li, Sicong
    Wang, Jian
    Wang, Shuo
    Song, Yafei
    HELIYON, 2024, 10 (16)
  • [3] Malware Family Classification Based on Vision Transformer
    Li, Jing
    Luo, Xueping
    Journal of Computers (Taiwan), 2023, 34 (01) : 87 - 99
  • [4] Global-Local Self-Attention Based Transformer for Speaker Verification
    Xie, Fei
    Zhang, Dalong
    Liu, Chengming
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [5] BViT: Broad Attention-Based Vision Transformer
    Li, Nannan
    Chen, Yaran
    Li, Weifan
    Ding, Zixiang
    Zhao, Dongbin
    Nie, Shuai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12772 - 12783
  • [6] Global and Local Attention-Based Transformer for Hyperspectral Image Change Detection
    Wang, Ziyi
    Gao, Feng
    Dong, Junyu
    Du, Qian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [7] VMCTE: Visualization-Based Malware Classification Using Transfer and Ensemble Learning
    Chen, Zhiguo
    Cao, Jiabing
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 4445 - 4465
  • [8] Dynamic Global-Local Attention Network Based On Capsules for Text Classification
    Wang, Ji
    Chen, Qiaohong
    Pei, Haolei
    Sun, Qi
    Jia, Yubo
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] High Resolution SAR Image Classification Using Global-Local Network Structure Based on Vision Transformer and CNN
    Liu, Xingyu
    Wu, Yan
    Liang, Wenkai
    Cao, Yice
    Li, Ming
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [10] SoK: Visualization-based Malware Detection Techniques
    Brosolo, Matteo
    Vinod, P.
    Asmitha, K. A.
    Rehiman, Rafidha K. A.
    Conti, Mauro
    19TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY, ARES 2024, 2024,