Reinforcement Learning Interpretation Methods: A Survey

被引：46

作者：

Alharin, Alnour ^{[1
]}

Doan, Thanh-Nam ^{[1
]}

Sartipi, Mina ^{[1
]}

机构：

[1] Univ Tennessee Chattanooga, Dept Comp Sci & Engn, Chattanooga, TN 37403 USA

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

美国国家科学基金会;

关键词：

Mathematical model; Measurement; Learning (artificial intelligence); Machine learning; Markov processes; Medical services; Law; Reinforcement learning; machine learning; interpretability; interpretation; survey;

D O I：

10.1109/ACCESS.2020.3023394

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement Learning (RL) systems achieved outstanding performance in different domains such as Atari games, finance, healthcare, and self-driving cars. However, their black-box nature complicates their use, especially in critical applications such as healthcare. To solve this problem, researchers have proposed different approaches to interpret RL models. Some of these methods were adopted from machine learning, while others were designed specifically for RL. The main objective of this paper is to show and explain RL interpretation methods, the metrics used to classify them, and how these metrics were applied to understand the internal details of RL models. We reviewed papers that propose new RL interpretation methods, improve the old ones, or discuss the pros and cons of the existing methods.

引用

页码：171058 / 171077

页数：20

共 103 条

[31]

Dodson T, 2011, LECT NOTES ARTIF INT, V6992, P42, DOI 10.1007/978-3-642-24873-3_4

[32] Interpretable Explanations of Black Boxes by Meaningful Perturbation [J].

Fong, Ruth C. ;

Vedaldi, Andrea .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3449-3457

[33]

Franceschi L., 2017, ACM Trans. Intell. Syst. Technol., V3, P1903, DOI DOI 10.1145/3298981

[34] Explaining Explanations: An Overview of Interpretability of Machine Learning [J].

Gilpin, Leilani H. ;

Bau, David ;

Yuan, Ben Z. ;

Bajwa, Ayesha ;

Specter, Michael ;

Kagal, Lalana .

2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, :80-89

[35]

Greydanus S., 2017, ARXIV PREPRINT ARXIV

[36] A Survey of Methods for Explaining Black Box Models [J].

Guidotti, Riccardo ;

Monreale, Anna ;

Ruggieri, Salvatore ;

Turin, Franco ;

Giannotti, Fosca ;

Pedreschi, Dino .

ACM COMPUTING SURVEYS, 2019, 51 (05)

[37]

Gupta P., 2019, ARXIV191212191

[38] Better - but is it good enough? On the need to consider both eco-efficiency and eco-effectiveness to gauge industrial sustainability [J].

Hauschild, Michael Z. .

22ND CIRP CONFERENCE ON LIFE CYCLE ENGINEERING, 2015, 29 :1-7

[39] Improving Robot Controller Transparency Through Autonomous Policy Explanation [J].

Hayes, Bradley ;

Shah, Julie A. .

PROCEEDINGS OF THE 2017 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'17), 2017, :303-312

[40]

Hein D., 2017, ARXIV171204170

← 1 2 3 4 5 6 7 8 9 10 →