HYDRA: A multimodal deep learning framework for malware classification

被引:78
|
作者
Gibert, Daniel [1 ]
Mateu, Carles [1 ]
Planes, Jordi [1 ]
机构
[1] Univ Lleida, Jaume II 69, Lleida, Spain
关键词
Malware classification; Machine learning; Deep learning; Feature fusion; Multimodal learning; ENTROPY;
D O I
10.1016/j.cose.2020.101873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While traditional machine learning methods for malware detection largely depend on hand-designed features, which are based on experts' knowledge of the domain, end-to-end learning approaches take the raw executable as input, and try to learn a set of descriptive features from it. Although the latter might behave badly in problems where there are not many data available or where the dataset is imbalanced. In this paper we present HYDRA, a novel framework to address the task of malware detection and classification by combining various types of features to discover the relationships between distinct modalities. Our approach learns from various sources to maximize the benefits of multiple feature types to reflect the characteristics of malware executables. We propose a baseline system that consists of both hand-engineered and end-to-end components to combine the benefits of feature engineering and deep learning so that malware characteristics are effectively represented. An extensive analysis of state-of-the-art methods on the Microsoft Malware Classification Challenge benchmark shows that the proposed solution achieves comparable results to gradient boosting methods in the literature and higher yield in comparison with deep learning approaches. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Parallel Deep Learning with a hybrid BP-PSO framework for feature extraction and malware classification
    Al-Andoli, Mohammed Nasser
    Tan, Shing Chiang
    Sim, Kok Swee
    Lim, Chee Peng
    Goh, Pey Yun
    APPLIED SOFT COMPUTING, 2022, 131
  • [22] DTMIC: Deep transfer learning for malware image classification
    Kumar, Sanjeev
    Janet, B.
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
  • [23] MCSMGS: Malware Classification Model Based on Deep Learning
    Meng, Xi
    Shan, Zhen
    Liu, Fudong
    Zhao, Bingling
    Han, Jin
    Wang, Jing
    Wang, Hongyan
    2017 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2017, : 272 - 275
  • [24] An Effective Ensemble Deep Learning Framework for Malware Detection
    Dinh Viet Sang
    Dang Manh Cuong
    Le Tran Bao Cuong
    PROCEEDINGS OF THE NINTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2018), 2018, : 192 - 199
  • [25] Deep Learning Applied to Imbalanced Malware Datasets Classification
    Salas, Marcelo Palma
    de Geus, Paulo Licio
    JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2024, 15 (01) : 342 - 359
  • [26] Deep Learning Model with Sequential Features for Malware Classification
    Wu, Xuan
    Song, Yafei
    Hou, Xiaoyi
    Ma, Zexuan
    Chen, Chen
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [27] Efficient Characterization and Classification of Malware Using Deep Learning
    De la Rosa, Leonardo
    Kilgallon, Sean
    Vanderbruggen, Tristan
    Cavazos, John
    2018 RESILIENCE WEEK (RWS), 2018, : 77 - 83
  • [28] MEMTD: Encrypted Malware Traffic Detection Using Multimodal Deep Learning
    Zhang, Xiaotian
    Lu, Jintian
    Sun, Jiakun
    Xiao, Ruizhi
    Jin, Shuyuan
    WEB ENGINEERING (ICWE 2022), 2022, 13362 : 357 - 372
  • [29] A Novel Malware Analysis Framework for Malware Detection and Classification using Machine Learning Approach
    Sethi, Kamalakanta
    Chaudhary, Shankar Kumar
    Tripathy, Bata Krishan
    Bera, Padmalochan
    ICDCN'18: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, 2018,
  • [30] A deep semantic framework for multimodal representation learning
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 9255 - 9276