Building Visual Malware Dataset using VirusShare Data and Comparing Machine Learning Baseline Model to CoAtNet for Malware Classification

被引:0
|
作者
Bruzzese, Roberto R. [1 ]
机构
[1] Sapienza Univ Rome, Rome, Italy
来源
2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024 | 2024年
关键词
Malware; Machine Learning; Visual Images; CoAtNet;
D O I
10.1145/3651671.3651735
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The present work takes inspiration from the work of Zihang Dai, Hanxiao Liu, Quoc V. Le, Mingxing Tan at Google Research, Brain Team about CoAtNet. In that work it was showed that it is possible to combine the strengths from both convolution and transformer architectures, by unifying convnets and self-attention into a machine learning model. We want to apply the CoAtNet to a visual dataset of malware images and compare its performances to a baseline CNN model. For this reason we need a data set of appropriate size and format. From these needs triggers the requirement to find or generate a visual dataset of the malware images capable to measure the accuracy of the constructed model. As will be seen, the creation of a new dataset will be preferred to the search for an existing dataset. Although the visual approach has already been extensively tested in recent years, there is still a need for more customised data for the model under examination. The work described in this paper can serve as a guide to a balanced and dimensioned construction of an optimal malware visual image dataset.
引用
收藏
页码:185 / 193
页数:9
相关论文
共 50 条
  • [21] Android malware classification using optimum feature selection and ensemble machine learning
    Islam R.
    Sayed M.I.
    Saha S.
    Hossain M.J.
    Masud M.A.
    Internet of Things and Cyber-Physical Systems, 2023, 3 : 100 - 111
  • [22] Machine learning based fileless malware traffic classification using image visualization
    Demmese, Fikirte Ayalke
    Neupane, Ajaya
    Khorsandroo, Sajad
    Wang, May
    Roy, Kaushik
    Fu, Yu
    CYBERSECURITY, 2023, 6 (01)
  • [23] Combining traditional machine learning and anomaly detection for several imbalanced Android malware dataset's classification
    Gan, Yiwei
    Han, Qian
    Gao, Yumeng
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 74 - 80
  • [24] Evaluating the effort of building a Machine Learning model for malware detection from ground zero
    Anton, Dan-Gabriel
    2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 365 - 370
  • [25] Comparison of Machine Learning Algorithms for Malware Detection Using EDGE-IIoTSET Dataset in IoT
    Alshehri, Jawaher
    Alhamed, Almaha
    Frikha, Mounir
    Rahman, M. M. Hafizur
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 1225 - 1238
  • [26] Malware Detection and Classification in Android Application Using Simhash-Based Feature Extraction and Machine Learning
    Al-Kahla, Wafaa
    Taqieddin, Eyad
    Shatnawi, Ahmed S.
    Al-Ouran, Rami
    IEEE ACCESS, 2024, 12 : 174255 - 174273
  • [27] Malware Classification of Portable Executables using Tree-Based Ensemble Machine Learning
    Atluri, Venkata
    2019 IEEE SOUTHEASTCON, 2019,
  • [28] Android Malware Classification Using Machine Learning and Bio-Inspired Optimisation Algorithms
    Pye, Jack
    Issac, Biju
    Aslam, Nauman
    Rafiq, Husnain
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1777 - 1782
  • [29] Machine learning based Malware Classification for Android Applications using Multimodal Image Representations
    Kumar, Ajit
    Sagar, Pramod K.
    Kuppusamy, K. S.
    Aghila, G.
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [30] A Novel Image-Based Malware Classification Model Using Deep Learning
    Jiang, Yongkang
    Li, Shenghong
    Wu, Yue
    Zou, Futai
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 150 - 161