Learning key steps to attack deep reinforcement learning agents

被引：7

作者：

Yu, Chien-Min ^{[1
]}

Chen, Ming-Hsin ^{[1
]}

Lin, Hsuan-Tien ^{[1
]}

机构：

[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan

来源：

MACHINE LEARNING | 2023年 / 112卷 / 05期

关键词：

Deep learning; Reinforcement learning; Adversarial attacks; Robustness; ENVIRONMENT; GO;

D O I：

10.1007/s10994-023-06318-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep reinforcement learning agents are vulnerable to adversarial attacks. In particular, recent studies have shown that attacking a few key steps can effectively decrease the agent's cumulative reward. However, all existing attacking methods define those key steps with human-designed heuristics, and it is not clear how more effective key steps can be identified. This paper introduces a novel reinforcement learning framework that learns key steps through interacting with the agent. The proposed framework does not require any human heuristics nor knowledge, and can be flexibly coupled with any white-box or black-box adversarial attack scenarios. Experiments on benchmark Atari games across different scenarios demonstrate that the proposed framework is superior to existing methods for identifying effective key steps. The results highlight the weakness of RL agents even under budgeted attacks.

引用

页码：1499 / 1522

页数：24

共 43 条

[1]

Behzadan Vahid, 2017, Machine Learning and Data Mining in Pattern Recognition. 13th International Conference, MLDM 2017. Proceedings: LNAI 10358, P262, DOI 10.1007/978-3-319-62416-7_19

[2]

Behzadan V., 2018, PREPRINT

[3]

Behzadan V., 2017, PREPRINT

[4] The Arcade Learning Environment: An Evaluation Platform for General Agents [J].

Bellemare, Marc G. ;

Naddaf, Yavar ;

Veness, Joel ;

Bowling, Michael .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 :253-279

[5]

Biggio Battista, 2013, ECML, P387, DOI [10.1007/978-3-642-40994-3_25, DOI 10.1007/978-3-642-40994-3_25]

[6]

Boyd SP., 2004, Convex optimization, DOI 10.1017/CBO9780511804441

[7] Towards Evaluating the Robustness of Neural Networks [J].

Carlini, Nicholas ;

Wagner, David .

2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57

[8]

Castro P.S., 2018, PREPRINT

[9]

Chen PY, 2017, PROCEEDINGS OF THE 10TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2017, P15, DOI 10.1145/3128572.3140448

[10] Optimization of anemia treatment in hemodialysis patients via reinforcement learning [J].

Escandell-Montero, Pablo ;

Chermisi, Milena ;

Martinez-Martinez, Jose M. ;

Gomez-Sanchis, Juan ;

Barbieri, Carlo ;

Soria-Olivas, Emilio ;

Mari, Flavio ;

Vila-Frances, Joan ;

Stopper, Andrea ;

Gatti, Emanuele ;

Martin-Guerrero, Jose D. .

ARTIFICIAL INTELLIGENCE IN MEDICINE, 2014, 62 (01) :47-60

← 1 2 3 4 5 →