PEPC: A Deep Parallel Convolutional Neural Network Model with Pre-trained Embeddings for DGA Detection

被引:3
|
作者
Huang, Weiqing [1 ,2 ]
Zong, Yangyang [1 ,2 ]
Shi, Zhixin [1 ]
Wang, Leiqi [1 ,2 ]
Liu, Pengcheng [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
关键词
cybersecurity; domain generation algorithm; pre-trained embeddings; convolution neural network; deep learning;
D O I
10.1109/IJCNN55064.2022.9892081
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering domain generation algorithms (DGAs) used to build command and control (C&C) infrastructures of botnets is crucial for recognizing botnets. Recent studies in DGA detection benefit from deep learning, such as convolutional neural network (CNN) and long short-term memory neural network (LSTM). However, these studies need massive supervised data to train their models, while obtaining enough labeled samples is consistently time-consuming and labor-intensive. In this paper, we propose a deep learning model, called PEPC, to detect and classify DGA domain names with only a small dataset. PEPC consists of two modules: (1) the pre-trained embeddings (PTE) module to quantify domain names to numeric vectors; and (2) the deep parallel convolutional neural networks (DPCNN) module to better extract features of vectors for prediction. Comparing our model with the 5 common deep learning-based DGA detection approaches, results show that our model yields an average improvement of 10 F1 points, while it requires just 30 training samples for each class. Significantly, PTE can help models achieve better detection and classification performances on small training samples.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] CONVOLUTIONAL NEURAL NETWORK PRE-TRAINED WITH PROJECTION MATRICES ON LINEAR DISCRIMINANT ANALYSIS
    Fukuda, Takashi
    Ichikawa, Osamu
    Tachibana, Ryuki
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5345 - 5349
  • [32] SAR Image Despeckling Using Pre-trained Convolutional Neural Network Models
    Yang, Xiangli
    Denis, Loic
    Tupin, Florence
    Yang, Wen
    2019 JOINT URBAN REMOTE SENSING EVENT (JURSE), 2019,
  • [33] Food Detection by Fine-Tuning Pre-trained Convolutional Neural Network Using Noisy Labels
    Alshomrani, Shroog
    Aljoudi, Lina
    Aljabri, Banan
    Al-Shareef, Sarah
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (07): : 182 - 190
  • [34] Object Recognition using Template Matching and Pre-trained convolutional neural network
    Abbas, Qaisar
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (08): : 69 - 79
  • [35] A pre-trained convolutional neural network based method for thyroid nodule diagnosis
    Ma, Jinlian
    Wu, Fa
    Zhu, Jiang
    Xu, Dong
    Kong, Dexing
    ULTRASONICS, 2017, 73 : 221 - 230
  • [36] Deep Convolutional Neural Networks for DGA Detection
    Catania, Carlos
    Garcia, Sebastian
    Torres, Pablo
    COMPUTER SCIENCE - CACIC 2018, 2019, 995 : 327 - 340
  • [37] Diagnosis of Tomato Plant Diseases Using Pre-trained Architectures and A Proposed Convolutional Neural Network Model
    Koc, Dilara Gerdan
    Koc, Caner
    Vatandas, Mustafa
    JOURNAL OF AGRICULTURAL SCIENCES-TARIM BILIMLERI DERGISI, 2023, 29 (02): : 627 - 638
  • [38] Exploiting Pre-Trained Network Embeddings for Recommendations in Social Networks
    Guo, Lei
    Wen, Yu-Fei
    Wang, Xin-Hua
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (04) : 682 - 696
  • [39] Exploiting Pre-Trained Network Embeddings for Recommendations in Social Networks
    Lei Guo
    Yu-Fei Wen
    Xin-Hua Wang
    Journal of Computer Science and Technology, 2018, 33 : 682 - 696
  • [40] Pre-Trained Deep Convolutional Neural Network for Clostridioides Difficile Bacteria Cytotoxicity Classification Based on Fluorescence Images
    Brodzicki, Andrzej
    Jaworek-Korjakowska, Joanna
    Kleczek, Pawel
    Garland, Megan
    Bogyo, Matthew
    SENSORS, 2020, 20 (23) : 1 - 17