FastText-Based Local Feature Visualization Algorithm for Merged Image-Based Malware Classification Framework for Cyber Security and Cyber Defense

被引:20
|
作者
Jang, Sejun [1 ]
Li, Shuyu [1 ]
Sung, Yunsick [1 ]
机构
[1] Dongguk Univ Seoul, Dept Multimedia Engn, Seoul 04620, South Korea
关键词
cyber security; deep learning; malware classification; malware visualization; GENERATION ALGORITHM;
D O I
10.3390/math8030460
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The importance of cybersecurity has recently been increasing. A malware coder writes malware into normal executable files. A computer is more likely to be infected by malware when users have easy access to various executables. Malware is considered as the starting point for cyber-attacks; thus, the timely detection, classification and blocking of malware are important. Malware visualization is a method for detecting or classifying malware. A global image is visualized through binaries extracted from malware. The overall structure and behavior of malware are considered when global images are utilized. However, the visualization of obfuscated malware is tough, owing to the difficulties encountered when extracting local features. This paper proposes a merged image-based malware classification framework that includes local feature visualization, global image-based local feature visualization, and global and local image merging methods. This study introduces a fastText-based local feature visualization method: First, local features such as opcodes and API function names are extracted from the malware; second, important local features in each malware family are selected via the term frequency inverse document frequency algorithm; third, the fastText model embeds the selected local features; finally, the embedded local features are visualized through a normalization process. Malware classification based on the proposed method using the Microsoft Malware Classification Challenge dataset was experimentally verified. The accuracy of the proposed method was approximately 99.65%, which is 2.18% higher than that of another contemporary global image-based approach.
引用
收藏
页数:13
相关论文
共 13 条
  • [1] Image-based malware classification hybrid framework based on space-filling curves
    O'Shaughnessy, Stephen
    Sheridan, Stephen
    COMPUTERS & SECURITY, 2022, 116
  • [2] Generative Adversarial Network for Global Image-Based Local Image to Improve Malware Classification Using Convolutional Neural Network
    Jang, Sejun
    Li, Shuyu
    Sung, Yunsick
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 14
  • [3] A Fully Streaming Big Data Framework for Cyber Security Based on Optimized Deep Learning Algorithm
    Hussen, Noha
    Elghamrawy, Sally M.
    Salem, Mofreh
    El-Desouky, Ali I.
    IEEE ACCESS, 2023, 11 : 65675 - 65688
  • [4] Image-based Malware Family Detection: An Assessment between Feature Extraction and Classification Techniques
    Iadarola, Giacomo
    Martinelli, Fabio
    Mercaldo, Francesco
    Santone, Antonella
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 499 - 506
  • [5] A two-stage deep learning framework for image-based android malware detection and variant classification
    Yadav, Pooja
    Menon, Neeraj
    Ravi, Vinayakumar
    Vishvanathan, Sowmya
    Pham, Tuan D.
    COMPUTATIONAL INTELLIGENCE, 2022, 38 (05) : 1748 - 1771
  • [6] Enhancing Malware Detection Resilience: A U-Net GAN Denoising Framework for Image-Based Classification
    Dong, Huiyao
    Kotenko, Igor
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (03): : 4263 - 4285
  • [7] Feature Image-Based Automatic Modulation Classification Method Using CNN Algorithm
    Lee, Jung Ho
    Kim, Kwang-Yul
    Shin, Yoan
    2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 560 - 563
  • [8] Healthcare data analysis by feature extraction and classification using deep learning with cloud based cyber security
    Qamar, Shamimul
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [9] Approximation-based energy-efficient cyber-secured image classification framework
    Rahman, M. A.
    Tunny, Salma Sultana
    Kayes, A. S. M.
    Cheng, Peng
    Huq, Aminul
    Rana, M. S.
    Islam, Md. Rashidul
    Tusher, Animesh Sarkar
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 133
  • [10] MalSort: Lightweight and efficient image-based malware classification using masked self-supervised framework with Swin Transformer
    Wang, Fangwei
    Shi, Xipeng
    Yang, Fang
    Song, Ruixin
    Li, Qingru
    Tan, Zhiyuan
    Wang, Changguang
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2024, 83