Reinforcement Learning Interpretation Methods: A Survey

被引:46
作者
Alharin, Alnour [1 ]
Doan, Thanh-Nam [1 ]
Sartipi, Mina [1 ]
机构
[1] Univ Tennessee Chattanooga, Dept Comp Sci & Engn, Chattanooga, TN 37403 USA
基金
美国国家科学基金会;
关键词
Mathematical model; Measurement; Learning (artificial intelligence); Machine learning; Markov processes; Medical services; Law; Reinforcement learning; machine learning; interpretability; interpretation; survey;
D O I
10.1109/ACCESS.2020.3023394
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement Learning (RL) systems achieved outstanding performance in different domains such as Atari games, finance, healthcare, and self-driving cars. However, their black-box nature complicates their use, especially in critical applications such as healthcare. To solve this problem, researchers have proposed different approaches to interpret RL models. Some of these methods were adopted from machine learning, while others were designed specifically for RL. The main objective of this paper is to show and explain RL interpretation methods, the metrics used to classify them, and how these metrics were applied to understand the internal details of RL models. We reviewed papers that propose new RL interpretation methods, improve the old ones, or discuss the pros and cons of the existing methods.
引用
收藏
页码:171058 / 171077
页数:20
相关论文
共 103 条
[1]   Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].
Adadi, Amina ;
Berrada, Mohammed .
IEEE ACCESS, 2018, 6 :52138-52160
[2]  
Adebayo J, 2018, ADV NEUR IN, V31
[3]  
Ahmad MA, 2018, IEEE INT CONF HEALT, P447, DOI [10.1109/ICHI.2018.00095, 10.1145/3233547.3233667]
[4]  
Akhtar E., 2017, PRACTICAL REINFORCEM
[5]  
Akrour R., REINFORCEMENT LEARNI
[6]  
Amir D, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1168
[7]  
Amir O, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1203
[8]  
Annasamy RM, 2019, AAAI CONF ARTIF INTE, P4561
[9]  
[Anonymous], 2015, arXiv preprint arXiv:1512.01693
[10]  
[Anonymous], 2014, P INT C LEARN REPR I