GAIL-PT: An intelligent penetration testing framework with generative adversarial imitation learning

被引:26
作者
Chen, Jinyin [1 ,2 ]
Hu, Shulong [1 ,2 ]
Zheng, Haibin [1 ,2 ]
Xing, Changyou [3 ]
Zhang, Guomin [3 ]
机构
[1] Zhejiang Univ Technol, Inst Cyberspace Secur, Hangzhou, Peoples R China
[2] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou, Peoples R China
[3] Army Engn Univ PLA, Coll Command & Control Engn, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Penetration testing; Deep reinforcement learning; Generative adversarial imitation learning; Metasploitable2; DeepExploit; Q-learning;
D O I
10.1016/j.cose.2022.103055
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Penetration testing (PT) is an efficient tool for network testing and vulnerability mining by simulating the hackers' attacks to obtain valuable information applied in operating and database systems. Most of the traditional manual solutions are strongly relying on the domain knowledge of human experts with high penetration costs. Therefore, solutions based on the artificial intelligent algorithm such as reinforcement learning (RL) and deep reinforcement learning (DRL), with less time-consuming and lower labor costs, be-come a great solution to address the challenge. However, there are still a few challenges for RL/DRL-based PT in real penetration scenarios, such as the large dimension size of the agent's discrete action space usu-ally causing difficulties in convergence. To address the above issue, this paper proposes a novel frame-work named G enerative A dversarial I mitation L earning based intelligent P enetration T esting (GAIL-PT), which utilizes expert knowledge base and GAIL network to guide the policy generation of RL/DRL agents with lower costs. Specifically, we first construct the expert knowledge bases by collecting state-action pairs from the successful exploitations of pre-trained RL/DRL models. Secondly, we feed the expert knowledge bases generated by different RL/DRL models online into the discriminator of GAIL-PT to guide its training process. Besides, we integrate the losses of the generator and the discriminator in GAIL-PT to optimize the overall objective and use the discriminator's discounted rewards for policy generation. The exten-sive experiments conducted on the practical target hosts and simulated network scenarios demonstrate that GAIL-PT achieves outstanding performance, and outperforms the state-of-art method DeepExploit in exploiting Metasploitable2 and Q-learning in different scale networks. It also verified that GAIL-PT is a general leading framework suitable for RL/DRL-based methods. The code of GAIL-PT is open-sourced at https://github.com/Shulong98/GAIL-PT// .(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:20
相关论文
共 58 条
[1]   Why attacking systems is a good idea [J].
Arce, I ;
McGraw, G .
IEEE SECURITY & PRIVACY, 2004, 2 (04) :17-19
[2]   Software penetration testing [J].
Arkin, B ;
Stender, S ;
McGraw, G .
IEEE SECURITY & PRIVACY, 2005, 3 (01) :84-87
[3]   A survey of inverse reinforcement learning: Challenges, methods and progress [J].
Arora, Saurabh ;
Doshi, Prashant .
ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
[4]  
Berner C., 2019, arXiv
[5]   Machine Learning Cyberattack and Defense Strategies [J].
Bland, John A. ;
Petty, Mikel D. ;
Whitaker, Tymaine S. ;
Maxwell, Katia P. ;
Cantrell, Walter Alan .
COMPUTERS & SECURITY, 2020, 92
[6]  
Brockman G, 2016, Arxiv, DOI arXiv:1606.01540
[7]   Multi-Agent Reinforcement Learning: A Review of Challenges and Applications [J].
Canese, Lorenzo ;
Cardarilli, Gian Carlo ;
Di Nunzio, Luca ;
Fazzolari, Rocco ;
Giardino, Daniele ;
Re, Marco ;
Spano, Sergio .
APPLIED SCIENCES-BASEL, 2021, 11 (11)
[8]  
Dulac-Arnold G, 2016, Arxiv, DOI arXiv:1512.07679
[9]   A Smart Agent Design for Cyber Security Based on Honeypot and Machine Learning [J].
El Kamel, Nadiya ;
Eddabbah, Mohamed ;
Lmoumen, Youssef ;
Touahni, Raja .
SECURITY AND COMMUNICATION NETWORKS, 2020, 2020
[10]   Adversarial Reinforcement Learning in a Cyber Security Simulation [J].
Elderman, Richard ;
Pater, Leon J. J. ;
Thie, Albert S. ;
Drugan, Madalina M. ;
Wiering, Marco A. .
ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, :559-566