HYDRA: A multimodal deep learning framework for malware classification

被引:78
|
作者
Gibert, Daniel [1 ]
Mateu, Carles [1 ]
Planes, Jordi [1 ]
机构
[1] Univ Lleida, Jaume II 69, Lleida, Spain
关键词
Malware classification; Machine learning; Deep learning; Feature fusion; Multimodal learning; ENTROPY;
D O I
10.1016/j.cose.2020.101873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While traditional machine learning methods for malware detection largely depend on hand-designed features, which are based on experts' knowledge of the domain, end-to-end learning approaches take the raw executable as input, and try to learn a set of descriptive features from it. Although the latter might behave badly in problems where there are not many data available or where the dataset is imbalanced. In this paper we present HYDRA, a novel framework to address the task of malware detection and classification by combining various types of features to discover the relationships between distinct modalities. Our approach learns from various sources to maximize the benefits of multiple feature types to reflect the characteristics of malware executables. We propose a baseline system that consists of both hand-engineered and end-to-end components to combine the benefits of feature engineering and deep learning so that malware characteristics are effectively represented. An extensive analysis of state-of-the-art methods on the Microsoft Malware Classification Challenge benchmark shows that the proposed solution achieves comparable results to gradient boosting methods in the literature and higher yield in comparison with deep learning approaches. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A Deep Learning Approach to the Malware Classification Problem using Autoencoders
    Pinto, Dhiego Ramos
    Duarte, Julio Cesar
    Sant'Ana, Ricardo
    PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
  • [42] End-to-end Multimodel Deep Learning for Malware Classification
    Snow, Elijah
    Alam, Mahbubul
    Glandon, Alexander
    Iftekharuddin, Khan
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [43] Deep-Hook: A trusted deep learning-based framework for unknown malware detection and classification in Linux cloud environments
    Landman, Tom
    Nissim, Nir
    NEURAL NETWORKS, 2021, 144 : 648 - 685
  • [44] A Multimodal Deep Learning Method for Android Malware Detection Using Various Features
    Kim, TaeGuen
    Kang, BooJoong
    Rho, Mina
    Sezer, Sakir
    Im, Eul Gyu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (03) : 773 - 788
  • [45] A two-stage deep learning framework for image-based android malware detection and variant classification
    Yadav, Pooja
    Menon, Neeraj
    Ravi, Vinayakumar
    Vishvanathan, Sowmya
    Pham, Tuan D.
    COMPUTATIONAL INTELLIGENCE, 2022, 38 (05) : 1748 - 1771
  • [46] DeAF: A multimodal deep learning framework for disease prediction
    Li, Kangshun
    Chen, Can
    Cao, Wuteng
    Wang, Hui
    Han, Shuai
    Wang, Renjie
    Ye, Zaisheng
    Wu, Zhijie
    Wang, Wenxiang
    Cai, Leng
    Ding, Deyu
    Yuan, Zixu
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 156
  • [47] Multimodal skin lesion classification using deep learning
    Yap, Jordan
    Yolland, William
    Tschandl, Philipp
    EXPERIMENTAL DERMATOLOGY, 2018, 27 (11) : 1261 - 1267
  • [48] Multimodal Deep Learning Framework for Mental Disorder Recognition
    Zhang, Ziheng
    Lin, Weizhe
    Liu, Mingyu
    Mahmoud, Marwa
    2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 344 - 350
  • [49] Ensemble Deep Learning for Sustainable Multimodal UAV Classification
    McCoy, James
    Rawal, Atul
    Rawat, Danda B.
    Sadler, Brian M.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 15425 - 15434
  • [50] Multimodal deep learning for solar radio burst classification
    Ma, Lin
    Chen, Zhuo
    Xu, Long
    Yan, Yihua
    PATTERN RECOGNITION, 2017, 61 : 573 - 582