Query-based Targeted Action-Space Adversarial Policies on Deep Reinforcement Learning Agents

被引:10
|
作者
Lee, Xian Yeow [1 ]
Esfandiari, Yasaman [1 ]
Tan, Kai Liang [1 ]
Sarkar, Soumik [1 ]
机构
[1] Iowa State Univ, Dept Mech Engn, Ames, IA 50011 USA
来源
ICCPS'21: PROCEEDINGS OF THE 2021 ACM/IEEE 12TH INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (WITH CPS-IOT WEEK 2021) | 2021年
关键词
Deep Reinforcement Learning; Adversarial Attacks; Black-box Attacks; Adversarial Policies; Adversarial Training;
D O I
10.1145/3450267.3450537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in computing resources have resulted in the increasing complexity of cyber-physical systems (CPS). As the complexity of CPS evolved, the focus has shifted to deep reinforcement learning-based (DRL) methods for control of these systems. This is in part due to: 1) difficulty of obtaining accurate models of complex CPS for traditional control 2) DRL algorithms' capability of learning control policies from data which can be adapted and scaled to real, complex CPS. To securely deploy DRL in production, it is essential to examine the weaknesses of DRL-based controllers (policies) towards malicious attacks from all angles. This work investigates targeted attacks in the action-space domain (actuation attacks), which perturbs the outputs of a controller. We show that a black-box attack model that generates perturbations with respect to an adversarial goal can be formulated as another reinforcement learning problem. Thus, an adversarial policy can be trained using conventional DRL methods. Experimental results showed that adversarial policies which only observe the nominal policy's output generate stronger attacks than adversarial policies that observe the nominal policy's input and output. Further analysis revealed that nominal policies whose outputs are frequently at the boundaries of the action space are naturally more robust towards adversarial policies. Lastly, we propose the use of adversarial training with transfer learning to induce robust behaviors into the nominal policy, which decreases the rate of successful targeted attacks by approximately 50%.
引用
收藏
页码:87 / 97
页数:11
相关论文
共 50 条
  • [1] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
    Zhou, Ziyuan
    Liu, Guanjun
    Guo, Weiran
    Zhou, MengChu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646
  • [2] A Generalized Deep Reinforcement Learning Model for Distribution Network Reconfiguration with Power Flow-Based Action-Space Sampling
    Gholizadeh, Nastaran
    Musilek, Petr
    ENERGIES, 2024, 17 (20)
  • [3] RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies
    Behzadan, Vahid
    Hsu, William
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2019, 2019, 11699 : 314 - 325
  • [4] FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks
    Tekgul, Buse G. A.
    Asokan, N.
    39TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2023, 2023, : 492 - 505
  • [5] Adversarial Jamming Attacks on Deep Reinforcement Learning Based Dynamic Multichannel Access
    Zhong, Chen
    Wang, Feng
    Gursoy, M. Cenk
    Velipasalar, Senem
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [6] Deep Reinforcement Learning with Parameterized Action Space for Object Detection
    Wu, Zheng
    Khan, Naimul Mefraz
    Gao, Lei
    Guan, Ling
    2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 101 - 104
  • [7] Adversarial Attacks in a Deep Reinforcement Learning based Cluster Scheduler
    Zhang, Shaojun
    Wang, Chen
    Zomaya, Albert Y.
    2020 IEEE 28TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2020), 2020, : 1 - 8
  • [8] Adversarial Attack for Deep Reinforcement Learning Based Demand Response
    Wan, Zhiqiang
    Li, Hepeng
    Shuai, Hang
    Sun, Yan
    He, Haibo
    2021 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2021,
  • [9] Proxemics-based deep reinforcement learning for robot navigation in continuous action space
    Cimurs R.
    Suh I.-H.
    Journal of Institute of Control, Robotics and Systems, 2020, 26 (03) : 168 - 176
  • [10] Deep Reinforcement Learning-Based Adversarial Attack and Defense in Industrial Control Systems
    Kim, Mun-Suk
    MATHEMATICS, 2024, 12 (24)