Explainable reinforcement learning (XRL): a systematic literature review and taxonomy

被引:6
作者
Bekkemoen, Yanzhe [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Comp Sci, NO-7491 Trondheim, Norway
关键词
Reinforcement learning; Explainable artificial intelligence; Interpretability; Explainability; Explanation; INTERPRETABLE POLICIES; DECISION-MAKING; VISUAL ANALYSIS; EXPLANATIONS; ATARI; GO; CLASSIFICATION; APPROXIMATION; FEATURES; MODELS;
D O I
10.1007/s10994-023-06479-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, reinforcement learning (RL) systems have shown impressive performance and remarkable achievements. Many achievements can be attributed to combining RL with deep learning. However, those systems lack explainability, which refers to our understanding of the system's decision-making process. In response to this challenge, the new explainable RL (XRL) field has emerged and grown rapidly to help us understand RL systems. This systematic literature review aims to give a unified view of the field by reviewing ten existing XRL literature reviews and 189 XRL studies from the past five years. Furthermore, we seek to organize these studies into a new taxonomy, discuss each area in detail, and draw connections between methods and stakeholder questions (e.g., "how can I get the agent to do _?"). Finally, we look at the research trends in XRL, recommend XRL methods, and present some exciting research directions for future research. We hope stakeholders, such as RL researchers and practitioners, will utilize this literature review as a comprehensive resource to overview existing state-of-the-art XRL methods. Additionally, we strive to help find research gaps and quickly identify methods that answer stakeholder questions.
引用
收藏
页码:355 / 441
页数:87
相关论文
共 345 条
[91]  
García J, 2015, J MACH LEARN RES, V16, P1437
[92]   Interpretable AI Agent Through Nonlinear Decision Trees for Lane Change Problem [J].
Ghosh, Abhiroop ;
Dhebar, Yashesh ;
Guha, Ritam ;
Deb, Kalyanmoy ;
Nageshrao, Subramanya ;
Zhu, Ling ;
Tseng, Eric ;
Filev, Dimitar .
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[93]  
Gilpin LH, 2022, Arxiv, DOI [arXiv:2207.00007, DOI 10.48550/ARXIV.2207.00007, 10.48550/arXiv.2207.00007]
[94]   Explaining Explanations: An Overview of Interpretability of Machine Learning [J].
Gilpin, Leilani H. ;
Bau, David ;
Yuan, Ben Z. ;
Bajwa, Ayesha ;
Specter, Michael ;
Kagal, Lalana .
2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, :80-89
[95]  
Gjærum VB, 2021, 2021 EUROPEAN CONTROL CONFERENCE (ECC), P1465
[96]   Explaining a Deep Reinforcement Learning Docking Agent Using Linear Model Trees with User Adapted Visualization [J].
Gjaerum, Vilde B. ;
Strumke, Inga ;
Alsos, Ole Andreas ;
Lekkas, Anastasios M. .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2021, 9 (11)
[97]  
Glanois C, 2022, Arxiv, DOI arXiv:2112.13112
[98]  
Goel Vikash., 2018, Advances in Neural Information Processing Systems, P5683
[99]   Peeking Inside the Black Box: Visualizing Statistical Learning With Plots of Individual Conditional Expectation [J].
Goldstein, Alex ;
Kapelner, Adam ;
Bleich, Justin ;
Pitkin, Emil .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2015, 24 (01) :44-65
[100]   European Union Regulations on Algorithmic Decision Making and a "Right to Explanation" [J].
Goodman, Bryce ;
Flaxman, Seth .
AI MAGAZINE, 2017, 38 (03) :50-57