Network Defense Strategy Selection with Reinforcement Learning and Pareto Optimization

被引:7
作者
Sun, Yang [1 ]
Xiong, Wei [1 ]
Yao, Zhonghua [1 ]
Moniz, Krishna [2 ]
Zahir, Ahmed [2 ]
机构
[1] Equipment Acad, Sci & Technol Complex Elect Syst Simulat Lab, Beijing 101416, Peoples R China
[2] Beijing Univ Posts & Telecommun, Key Lab Trustworthy Distributed Comp & Serv, Beijing 100876, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2017年 / 7卷 / 11期
关键词
Pareto front; Q-learning; multi-objective optimization; network security; FLOW-CONTROL; MODEL;
D O I
10.3390/app7111138
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Improving network security is a difficult problem that requires balancing several goals, such as defense cost and need for network efficiency, in order to achieve proper results. In this paper, we devise method of modeling network attack in a zero-sum multi-objective game and attempt to find the best defense against such an attack. We combined Pareto optimization and Q-learning methods to determine the most harmful attacks and consequently to find the best defense against those attacks. The results should help network administrators in search of a hands-on method of improving network security.
引用
收藏
页数:24
相关论文
共 56 条
[1]   Wireless mesh networks: a survey [J].
Akyildiz, IF ;
Wang, XD ;
Wang, WL .
COMPUTER NETWORKS, 2005, 47 (04) :445-487
[2]  
Alese B., 2014, P WORLD C ENG 2014 L
[3]  
Alpcan T., 2010, Network Security: A Decision and Game-Theoretic Approach
[4]   FLOW-CONTROL USING THE THEORY OF ZERO-SUM MARKOV GAMES [J].
ALTMAN, E .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1994, 39 (04) :814-818
[5]  
[Anonymous], P 2006 WORKSH GAM TH
[6]  
[Anonymous], P 2013 IEEE S AD DYN
[7]  
[Anonymous], 2012, MULTIPLE OBJECTIVE D
[8]   The Internet of Things: A survey [J].
Atzori, Luigi ;
Iera, Antonio ;
Morabito, Giacomo .
COMPUTER NETWORKS, 2010, 54 (15) :2787-2805
[9]  
Barto AG, 2003, DISCRETE EVENT DYN S, V13, P343
[10]  
Blum A., 2006, P 25 ANN ACM S PRINC