Building Visual Malware Dataset using VirusShare Data and Comparing Machine Learning Baseline Model to CoAtNet for Malware Classification

被引:0
|
作者
Bruzzese, Roberto R. [1 ]
机构
[1] Sapienza Univ Rome, Rome, Italy
来源
2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024 | 2024年
关键词
Malware; Machine Learning; Visual Images; CoAtNet;
D O I
10.1145/3651671.3651735
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The present work takes inspiration from the work of Zihang Dai, Hanxiao Liu, Quoc V. Le, Mingxing Tan at Google Research, Brain Team about CoAtNet. In that work it was showed that it is possible to combine the strengths from both convolution and transformer architectures, by unifying convnets and self-attention into a machine learning model. We want to apply the CoAtNet to a visual dataset of malware images and compare its performances to a baseline CNN model. For this reason we need a data set of appropriate size and format. From these needs triggers the requirement to find or generate a visual dataset of the malware images capable to measure the accuracy of the constructed model. As will be seen, the creation of a new dataset will be preferred to the search for an existing dataset. Although the visual approach has already been extensively tested in recent years, there is still a need for more customised data for the model under examination. The work described in this paper can serve as a guide to a balanced and dimensioned construction of an optimal malware visual image dataset.
引用
收藏
页码:185 / 193
页数:9
相关论文
共 50 条
  • [31] On machine learning effectiveness for malware detection in Android OS using static analysis data
    Syrris, Vasileios
    Geneiatakis, Dimitris
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 59
  • [32] Dynamic Malware Classification and API Categorisation of Windows Portable Executable Files Using Machine Learning
    Syeda, Durre Zehra
    Asghar, Mamoona Naveed
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [33] A Meta-Classification Model for Optimized ZBot Malware Prediction Using Learning Algorithms
    Jagan, Shanmugam
    Ashish, Ashish
    Mahdal, Miroslav
    Isabels, Kenneth Ruth
    Dhanke, Jyoti
    Jain, Parita
    Elangovan, Muniyandy
    MATHEMATICS, 2023, 11 (13)
  • [34] Static analysis framework for permission-based dataset generation and android malware detection using machine learning
    Pathak, Amarjyoti
    Kumar, Th. Shanta
    Barman, Utpal
    EURASIP JOURNAL ON INFORMATION SECURITY, 2024, 2024 (01):
  • [35] TFDroid: Android Malware Detection by Topics and Sensitive Data Flows Using Machine Learning Techniques
    Lou, Songhao
    Cheng, Shaoyin
    Huang, Jingjing
    Jiang, Fan
    2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT), 2019, : 30 - 36
  • [36] Machine Learning based Malware Traffic Detection on IoT Devices using Summarized Packet Data
    Nakahara, Masataka
    Okui, Norihiro
    Kobayashi, Yasuaki
    Miyake, Yutaka
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2020, : 78 - 87
  • [37] Intelligent Dynamic Malware Detection using Machine Learning in IP Reputation for Forensics Data Analytics
    Usman, Nighat
    Usman, Saeeda
    Khan, Fazlullah
    Jan, Mian Ahmad
    Sajid, Ahthasham
    Alazab, Mamoun
    Watters, Paul
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 118 : 124 - 141
  • [38] Machine-Learning-Based Android Malware Family Classification Using Built-In and Custom Permissions
    Kim, Minki
    Kim, Daehan
    Hwang, Changha
    Cho, Seongje
    Han, Sangchul
    Park, Minkyu
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [39] Toward Semantic-Based Android Malware Detection Using Model Checking and Machine Learning
    El Hatib, Souad
    Ricaud, Loic
    Desharnais, Josee
    Tawbi, Nadia
    RISKS AND SECURITY OF INTERNET AND SYSTEMS (CRISIS 2020), 2021, 12528 : 289 - 307
  • [40] Static Malware Analysis Using Machine Learning Algorithms on APT1 Dataset with String and PE Header Features
    Balram, Neil
    Hsieh, George
    McFall, Christian
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 90 - 95