A Closer Look at Reward Decomposition for High-level Robotic Explanations

被引：3

作者：

Lu, Wenhao ^{[1
]}

Zhao, Xufeng ^{[1
]}

Magg, Sven ^{[2
]}

Gromniak, Martin ^{[1
,3
]}

Li, Mengdi ^{[1
]}

Wermter, Stefan ^{[1
]}

机构：

[1] Univ Hamburg, Dept Informat, Knowledge Technol Grp, Hamburg, Germany

[2] Hamburger Informat Technol Ctr HITeC, Hamburg, Germany

[3] ZAL Ctr Appl Aeronaut Res, Hamburg, Germany

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL | 2023年

关键词：

D O I：

10.1109/ICDL55364.2023.10364407

中图分类号：

B84 [心理学]; C [社会科学总论]; Q98 [人类学];

学科分类号：

03 ; 0303 ; 030303 ; 04 ; 0402 ;

摘要：

Explaining the behaviour of intelligent agents learned by reinforcement learning (RL) to humans is challenging yet crucial due to their incomprehensible proprioceptive states, variational intermediate goals, and resultant unpredictability. Moreover, one-step explanations for RL agents can be ambiguous as they fail to account for the agent's future behaviour at each transition, adding to the complexity of explaining robot actions. By leveraging abstracted actions that map to task-specific primitives, we avoid explanations on the movement level. To further improve the transparency and explainability of robotic systems, we propose an explainable Q-Map learning framework that combines reward decomposition (RD) with abstracted action spaces, allowing for non-ambiguous and high-level explanations based on object properties in the task. We demonstrate the effectiveness of our framework through quantitative and qualitative analysis of two robotic scenarios, showcasing visual and textual explanations, from output artefacts of RD explanations, that are easy for humans to comprehend. Additionally, we demonstrate the versatility of integrating these artefacts with large language models (LLMs) for reasoning and interactive querying.

引用

页码：429 / 436

页数：8

共 42 条

[1] Ahn M, 2022, Arxiv, DOI arXiv:2204.01691
[2] Atrey A, 2019, ARXIV
[3] Bacon PL, 2016, Arxiv, DOI arXiv:1609.05140
[4] Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
Barredo Arrieta, Alejandro
Diaz-Rodriguez, Natalia
Del Ser, Javier
Bennetot, Adrien
Tabik, Siham
Barbado, Alberto
Garcia, Salvador
Gil-Lopez, Sergio
Molina, Daniel
Benjamins, Richard
Chatila, Raja
Herrera, Francisco
[J]. INFORMATION FUSION, 2020, 58 : 82 - 115
[5] Brown TB, 2020, ADV NEUR IN, V33
[6] Bubeck S., 2023, Sparks of artificial general intelligence: Early experiments with GPT-4 (Talk)
[7] Calli B, 2015, Arxiv, DOI arXiv:1502.03143
[8] Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback
Celemin, Carlos
Kober, Jens
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (23) : 16821 - 16839
[9] Recognizing object surface material from impact sounds for robot manipulation
Dimiccoli, Mariella
Patni, Shubhan
Hoffmann, Matej
Moreno-Noguer, Francesc
[J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9280 - 9287
[10] Gaede C., 2022, KI-Kunstliche Intelligenz, P1

← 1 2 3 4 5 →