Multi-attributed heterogeneous graph convolutional network for bot detection

被引:65
作者
Zhao, Jun [1 ,2 ]
Liu, Xudong [1 ,2 ]
Yan, Qiben [3 ]
Li, Bo [1 ,2 ]
Shao, Minglai [1 ,2 ]
Peng, Hao [1 ,2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[2] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, Beijing, Peoples R China
[3] Michigan State Univ, Comp Sci & Engn, E Lansing, MI 48824 USA
基金
美国国家科学基金会; 国家重点研发计划;
关键词
Botnet detection; Bot behavioral model; Multi-attributed graph; GCN;
D O I
10.1016/j.ins.2020.03.113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bot detection is a fundamental and crucial task for tracing and mitigating cyber threats in the Internet. This paper aims to address two major limitations of current bot detection systems. First, existing flow-based bot detection approaches ignore structural information of botnets, which lead to false detection. Second, they cannot identify the interactive behavioral patterns among heterogeneous botnet objects. In this paper, we propose a novel bot detection framework, namely Bot-AHGCN, which models fine-grained network flow objects (e.g., IP, response) as a multi-attributed heterogeneous graph and transforms bot detection problem into a semi-supervised node classification task on the graph. Particularly, we first build a multi-attributed heterogeneous information network (AHIN) to model the interdependent relationships among botnet objects. Second, we present a weight-learning based node embedding method, which learns the interactive behavioral patterns among hots and integrates them into weighted similarity graphs. Finally, we perform graph convolution on the learned similarity graphs to characterize more comprehensive and discriminative features of hots, and feed them into a forward neural network to identify hots. The overall experimental results on two real-world datasets confirm that Bot-AHGCN outperforms the existing state-of-the-art approaches, and presents better interpretability by introducing meaningful meta-paths and meta-graphs. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:380 / 393
页数:14
相关论文
共 45 条
[1]  
Abou Daya A, 2019, 2019 IFIP/IEEE SYMPOSIUM ON INTEGRATED NETWORK AND SERVICE MANAGEMENT (IM), P144
[2]  
Aggarwal C., 2012, ACM SIGKDD EXPLORATI, V14, P49, DOI [DOI 10.1145/2481244.2481252, 10.1145/2481244.2481252]
[3]  
Al-Duwairi Basheer, 2010, Proceedings of the Fifth International Conference on Internet Monitoring and Protection (ICIMP 2010), P16, DOI 10.1109/ICIMP.2010.11
[4]  
Albanese M., 2019, Adaptive Cyber Defenses for Botnet Detection and Mitigation, P156
[5]   A survey of botnet detection based on DNS [J].
Alieyan, Kamal ;
ALmomani, Ammar ;
Manasrah, Ahmad ;
Kadhum, Mohammed M. .
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (07) :1541-1558
[6]  
Amini Pedram, 2014, Adv. Comput. Sci.: Int. J., V3, P139
[7]  
Antonakakis M, 2017, PROCEEDINGS OF THE 26TH USENIX SECURITY SYMPOSIUM (USENIX SECURITY '17), P1093
[8]  
Arshad S., 2011, 2011 IEEE International Conference on Computer Applications and Industrial Electronics (ICCAIE 2011), P564, DOI 10.1109/ICCAIE.2011.6162198
[9]   Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks [J].
Chiang, Wei-Lin ;
Liu, Xuanqing ;
Si, Si ;
Li, Yang ;
Bengio, Samy ;
Hsieh, Cho-Jui .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :257-266
[10]   Botnet detection using graph-based feature clustering [J].
Chowdhury S. ;
Khanzadeh M. ;
Akula R. ;
Zhang F. ;
Zhang S. ;
Medal H. ;
Marufuzzaman M. ;
Bian L. .
Journal of Big Data, 2017, 4 (01)