Learning Sparse Evidence-Driven Interpretation to Understand Deep Reinforcement Learning Agents

被引:2
作者
Dao, Giang [1 ]
Huff, Wesley Houston [1 ]
Lee, Minwoo [1 ]
机构
[1] Univ North Carolina Charlotte, Dept Comp Sci, Charlotte, NC 28223 USA
来源
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年
关键词
explanation; sparsity; evidence-driven interpretation; reinforcement learning;
D O I
10.1109/SSCI50451.2021.9660192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in machine learning require interpretability and explainability for reliable and trustworthy systems. However, explanations of machine learning models are often hard to achieve given the large amount of information from the complex machine learning models. Evidence-driven reinforcement learning provides snapshot images to understand the learning experiences and the learned behaviors; however, it requires human labor to analyze a large number of retrieved snapshot images. Imposing sparsity of the evidence collection process for interpretation is, thus, significant to make human interpretation easy. In this paper, we proposed novel sparse evidence collection methods to discarding less important images for interpretation. We discuss the trade-offs between the sparsity and re-approximation accuracy and the quality of evidence in different Atari game environments.
引用
收藏
页数:7
相关论文
共 28 条
[1]   Reinforcement Learning Interpretation Methods: A Survey [J].
Alharin, Alnour ;
Doan, Thanh-Nam ;
Sartipi, Mina .
IEEE ACCESS, 2020, 8 :171058-171077
[2]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[3]  
Carlini N., 2017, P AISEC
[4]  
Dao G, 2020, ARXIV201207119
[5]   Deep Reinforcement Learning Monitor for Snapshot Recording [J].
Dao, Giang ;
Mishra, Indrajeet ;
Lee, Minwoo .
2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, :591-598
[6]  
Frey D, 1978, PRINCIPAL COMPONENT
[7]   RULE GENERATION FROM NEURAL NETWORKS [J].
FU, LM .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (08) :1114-1124
[8]   NEURAL NETWORKS AND THE BIAS VARIANCE DILEMMA [J].
GEMAN, S ;
BIENENSTOCK, E ;
DOURSAT, R .
NEURAL COMPUTATION, 1992, 4 (01) :1-58
[9]  
Jia Robin, 2017, P 2017 C EMP METH NA, P2021
[10]  
Kingma D.P., 2013, INT C LEARN REPR