A Bayesian Network Approach to Explainable Reinforcement Learning with Distal Information

被引：0

作者：

Milani, Rudy ^{[1
]}

Moll, Maximilian ^{[1
]}

De Leone, Renato ^{[2
]}

Pickl, Stefan ^{[1
]}

机构：

[1] Univ Bundeswehr Muenchen, Fac Comp Sci, Werner Heisenberg Weg 39, D-85577 Neubiberg, Germany

[2] Univ Camerino, Sch Sci & Technol, Via Madonna Carceri 9, I-62032 Camerino, Italy

来源：

SENSORS | 2023年 / 23卷 / 04期

基金：

英国科研创新办公室;

关键词：

Explainable Reinforcement Learning; Bayesian Network; model-free methods; causal explanation; human study; MODEL;

D O I：

10.3390/s23042013

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Nowadays, Artificial Intelligence systems have expanded their competence field from research to industry and daily life, so understanding how they make decisions is becoming fundamental to reducing the lack of trust between users and machines and increasing the transparency of the model. This paper aims to automate the generation of explanations for model-free Reinforcement Learning algorithms by answering "why" and "why not" questions. To this end, we use Bayesian Networks in combination with the NOTEARS algorithm for automatic structure learning. This approach complements an existing framework very well and demonstrates thus a step towards generating explanations with as little user input as possible. This approach is computationally evaluated in three benchmarks using different Reinforcement Learning methods to highlight that it is independent of the type of model used and the explanations are then rated through a human study. The results obtained are compared to other baseline explanation models to underline the satisfying performance of the framework presented in terms of increasing the understanding, transparency and trust in the action chosen by the agent.

引用

页数：38

共 61 条

[1] Optuna: A Next-generation Hyperparameter Optimization Framework
Akiba, Takuya
Sano, Shotaro
Yanase, Toshihiko
Ohta, Takeru
Koyama, Masanori
[J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2623 - 2631
[2] [Anonymous], 1936, Teoria Statistica Delle Classi e Calcolo Delle Probabilita, DOI DOI 10.4135/9781412961288.N455
[3] Baader M, 2020, Arxiv, DOI arXiv:1909.13846
[4] Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
Barredo Arrieta, Alejandro
Diaz-Rodriguez, Natalia
Del Ser, Javier
Bennetot, Adrien
Tabik, Siham
Barbado, Alberto
Garcia, Salvador
Gil-Lopez, Sergio
Molina, Daniel
Benjamins, Richard
Chatila, Raja
Herrera, Francisco
[J]. INFORMATION FUSION, 2020, 58 : 82 - 115
[5] Bhatt U., 2020, ARXIV
[6] Brockman Greg, 2016, arXiv
[7] Byrne RMJ, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P6276
[8] Reinforcement Learning in Economics and Finance
Charpentier, Arthur
Elie, Romuald
Remlinger, Carl
[J]. COMPUTATIONAL ECONOMICS, 2023, 62 (01) : 425 - 462
[9] Chen J. Y., 2014, ARL-TR-6905
[10] Attention cutting and padding learning for fine-grained image recognition
Cheng, Zhuo
Li, Hongjian
Duan, Xiaolin
Zeng, Xiangyan
He, Mingxuan
Luo, Hao
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805

← 1 2 3 4 5 6 7 →