Learning Sparse Evidence-Driven Interpretation to Understand Deep Reinforcement Learning Agents

被引：2

作者：

Dao, Giang ^{[1
]}

Huff, Wesley Houston ^{[1
]}

Lee, Minwoo ^{[1
]}

机构：

[1] Univ North Carolina Charlotte, Dept Comp Sci, Charlotte, NC 28223 USA

来源：

2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年

关键词：

explanation; sparsity; evidence-driven interpretation; reinforcement learning;

D O I：

10.1109/SSCI50451.2021.9660192

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advances in machine learning require interpretability and explainability for reliable and trustworthy systems. However, explanations of machine learning models are often hard to achieve given the large amount of information from the complex machine learning models. Evidence-driven reinforcement learning provides snapshot images to understand the learning experiences and the learned behaviors; however, it requires human labor to analyze a large number of retrieved snapshot images. Imposing sparsity of the evidence collection process for interpretation is, thus, significant to make human interpretation easy. In this paper, we proposed novel sparse evidence collection methods to discarding less important images for interpretation. We discuss the trade-offs between the sparsity and re-approximation accuracy and the quality of evidence in different Atari game environments.

引用

页数：7

共 28 条

[1] Reinforcement Learning Interpretation Methods: A Survey [J].

Alharin, Alnour ;

Doan, Thanh-Nam ;

Sartipi, Mina .

IEEE ACCESS, 2020, 8 :171058-171077

[2]

Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640

[3]

Carlini N., 2017, P AISEC

[4]

Dao G, 2020, ARXIV201207119

[5] Deep Reinforcement Learning Monitor for Snapshot Recording [J].

Dao, Giang ;

Mishra, Indrajeet ;

Lee, Minwoo .

2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, :591-598

[6]

Frey D, 1978, PRINCIPAL COMPONENT

[7] RULE GENERATION FROM NEURAL NETWORKS [J].

FU, LM .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (08) :1114-1124

[8] NEURAL NETWORKS AND THE BIAS VARIANCE DILEMMA [J].

GEMAN, S ;

BIENENSTOCK, E ;

DOURSAT, R .

NEURAL COMPUTATION, 1992, 4 (01) :1-58

[9]

Jia Robin, 2017, P 2017 C EMP METH NA, P2021

[10]

Kingma D.P., 2013, INT C LEARN REPR

← 1 2 3 →