A Bayesian Network Approach to Explainable Reinforcement Learning with Distal Information

被引:0
作者
Milani, Rudy [1 ]
Moll, Maximilian [1 ]
De Leone, Renato [2 ]
Pickl, Stefan [1 ]
机构
[1] Univ Bundeswehr Muenchen, Fac Comp Sci, Werner Heisenberg Weg 39, D-85577 Neubiberg, Germany
[2] Univ Camerino, Sch Sci & Technol, Via Madonna Carceri 9, I-62032 Camerino, Italy
基金
英国科研创新办公室;
关键词
Explainable Reinforcement Learning; Bayesian Network; model-free methods; causal explanation; human study; MODEL;
D O I
10.3390/s23042013
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Nowadays, Artificial Intelligence systems have expanded their competence field from research to industry and daily life, so understanding how they make decisions is becoming fundamental to reducing the lack of trust between users and machines and increasing the transparency of the model. This paper aims to automate the generation of explanations for model-free Reinforcement Learning algorithms by answering "why" and "why not" questions. To this end, we use Bayesian Networks in combination with the NOTEARS algorithm for automatic structure learning. This approach complements an existing framework very well and demonstrates thus a step towards generating explanations with as little user input as possible. This approach is computationally evaluated in three benchmarks using different Reinforcement Learning methods to highlight that it is independent of the type of model used and the explanations are then rated through a human study. The results obtained are compared to other baseline explanation models to underline the satisfying performance of the framework presented in terms of increasing the understanding, transparency and trust in the action chosen by the agent.
引用
收藏
页数:38
相关论文
共 61 条
  • [1] Optuna: A Next-generation Hyperparameter Optimization Framework
    Akiba, Takuya
    Sano, Shotaro
    Yanase, Toshihiko
    Ohta, Takeru
    Koyama, Masanori
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2623 - 2631
  • [2] [Anonymous], 1936, Teoria Statistica Delle Classi e Calcolo Delle Probabilita, DOI DOI 10.4135/9781412961288.N455
  • [3] Baader M, 2020, Arxiv, DOI arXiv:1909.13846
  • [4] Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
    Barredo Arrieta, Alejandro
    Diaz-Rodriguez, Natalia
    Del Ser, Javier
    Bennetot, Adrien
    Tabik, Siham
    Barbado, Alberto
    Garcia, Salvador
    Gil-Lopez, Sergio
    Molina, Daniel
    Benjamins, Richard
    Chatila, Raja
    Herrera, Francisco
    [J]. INFORMATION FUSION, 2020, 58 : 82 - 115
  • [5] Bhatt U., 2020, ARXIV
  • [6] Brockman Greg, 2016, arXiv
  • [7] Byrne RMJ, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P6276
  • [8] Reinforcement Learning in Economics and Finance
    Charpentier, Arthur
    Elie, Romuald
    Remlinger, Carl
    [J]. COMPUTATIONAL ECONOMICS, 2023, 62 (01) : 425 - 462
  • [9] Chen J. Y., 2014, ARL-TR-6905
  • [10] Attention cutting and padding learning for fine-grained image recognition
    Cheng, Zhuo
    Li, Hongjian
    Duan, Xiaolin
    Zeng, Xiangyan
    He, Mingxuan
    Luo, Hao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805