CNN-based DGA Detection with High Coverage

被引:0
作者
Zhou, Shaofang [1 ]
Lin, Lanfen [1 ]
Yuan, Junkun [2 ]
Wang, Feng [3 ]
Ling, Zhaoting [1 ]
Cui, Jia [4 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Zhejiang Lab, Res Ctr Network Big Data Hlth Care, Hangzhou, Peoples R China
[3] Zhejiang Police Coll, Dept Comp & Informat Technol, Hangzhou, Peoples R China
[4] China Informat Technol Secur Evaluat Ctr, Beijing, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI) | 2019年
关键词
domain generation algorithm; malicious domain names; deep learning; convolutional neural network;
D O I
10.1109/isi.2019.8823200
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attackers often use domain generation algorithms (DGAs) to create various kinds of pseudorandom domains dynamically and select a part of them to connect with command and control servers, therefore it is important to automatically detect the algorithmically generated domains (AGDs). AGDs can be broken down into two categories: character-based domains and wordlist-based domains. Recently, methods based on machine learning and deep learning have been widely explored. However, much of the previous work perform well in detecting one kind of DGA families but poorly in classifying another kind. A general detection system which is applicable to both kinds of domains still remains a challenge. To address this problem, we propose a novel real-time detection method with high accuracy as well as high coverage. We first convey a domain name into a sequence of word-level or character-level components, then design a deep neural network based on temporal convolutional network to extract the implicit pattern and classify the domain into two or more categories. Our experimental results demonstrate that our model outperforms state-of-the-art approaches in both binary classification and multi-class classification, and shows a good performance in detecting different kinds of DGAs. Besides, the high training efficiency of our model makes it adjust to new malicious domains quickly.
引用
收藏
页码:62 / 67
页数:6
相关论文
共 23 条
[1]  
Antonakakis M., 2012, 21 USENIX SEC S, V12, P491
[2]  
Bai Shaojie, 2018, Universal language model fine-tuning for text classification
[3]   Post-natal induction of PGC-1α protects against severe muscle dystrophy independently of utrophin [J].
Chan, Mun Chun ;
Rowe, Glenn C. ;
Raghuram, Srilatha ;
Patten, Ian S. ;
Farrell, Caitlin ;
Arany, Zolt .
SKELETAL MUSCLE, 2014, 4
[4]  
Choudhary C., 2019, COMPUT COMMUN
[5]  
Curtin R. R., 2018, ARXIV181002023CSCR
[6]  
Koh J. J., 2018, 2018 IEEE INT C BIG
[7]  
Lison P., 2017, ARXIV170907102CSCR
[8]  
Mac H., 2017, P 8 INT S INF COMM T
[9]  
Mowbray M., 2014, 2014 IEEE INT S SOFT
[10]  
Pereira M., 2018, LNCS, V11050