Visual Analytics for RNN-Based Deep Reinforcement Learning

被引:21
作者
Wang, Junpeng [1 ]
Zhang, Wei [1 ]
Yang, Hao [1 ]
Yeh, Chin-Chia Michael [1 ]
Wang, Liang [1 ]
机构
[1] Visa Res, Palo Alto, CA 94301 USA
关键词
Games; Visual analytics; Analytical models; Reinforcement learning; Recurrent neural networks; Perturbation methods; Data models; Deep reinforcement learning (DRL); recurrent neural network (RNN); model interpretation; visual analytics;
D O I
10.1109/TVCG.2021.3076749
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep reinforcement learning (DRL) targets to train an autonomous agent to interact with a pre-defined environment and strives to achieve specific goals through deep neural networks (DNN). Recurrent neural network (RNN) based DRL has demonstrated superior performance, as RNNs can effectively capture the temporal evolution of the environment and respond with proper agent actions. However, apart from the outstanding performance, little is known about how RNNs understand the environment internally and what has been memorized over time. Revealing these details is extremely important for deep learning experts to understand and improve DRLs, which in contrast, is also challenging due to the complicated data transformations inside these models. In this article, we propose Deep Reinforcement Learning Interactive Visual Explorer (DRLIVE), a visual analytics system to effectively explore, interpret, and diagnose RNN-based DRLs. Having focused on DRL agents trained for different Atari games, DRLIVE accomplishes three tasks: game episode exploration, RNN hidden/cell state examination, and interactive model perturbation. Using the system, one can flexibly explore a DRL agent through interactive visualizations, discover interpretable RNN cells by prioritizing RNN hidden/cell states with a set of metrics, and further diagnose the DRL model by interactively perturbing its inputs. Through concrete studies with multiple deep learning experts, we validated the efficacy of DRLIVE.
引用
收藏
页码:4141 / 4155
页数:15
相关论文
共 48 条
[1]   Do Convolutional Neural Networks Learn Class Hierarchy? [J].
Alsallakh, Bilal ;
Jourabloo, Amin ;
Ye, Mao ;
Liu, Xiaoming ;
Ren, Liu .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) :152-162
[2]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[3]   The Arcade Learning Environment: An Evaluation Platform for General Agents [J].
Bellemare, Marc G. ;
Naddaf, Yavar ;
Veness, Joel ;
Bowling, Michael .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 :253-279
[4]   RNNbow: Visualizing Learning Via Backpropagation Gradients in RNNs [J].
Cashman, Dylan ;
Patterson, Genevieve ;
Mosca, Abigail ;
Watts, Nathan ;
Robinson, Shannon ;
Chang, Remco .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2018, 38 (06) :39-50
[5]   Visual Analytics for Explainable Deep Learning [J].
Choo, Jaegul ;
Liu, Shixia .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2018, 38 (04) :84-92
[6]   VATLD: A Visual Analytics System to Assess, Understand and Improve Traffic Light Detection [J].
Gou, Liang ;
Zou, Lincan ;
Li, Nanxiang ;
Hofmann, Michael ;
Shekar, Arvind Kumar ;
Wendt, Axel ;
Ren, Liu .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) :261-271
[7]  
Greydanus S, 2018, PR MACH LEARN RES, V80
[8]   A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].
Grondman, Ivo ;
Busoniu, Lucian ;
Lopes, Gabriel A. D. ;
Babuska, Robert .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307
[9]   DynamicsExplorer: Visual Analytics for Robot Control Tasks involving Dynamics and LSTM-based Control Policies [J].
He, Wenbin ;
Lee, Teng-Yok ;
van Baar, Jeroen ;
Wittenburg, Kent ;
Shen, Han-Wei .
2020 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2020, :36-45
[10]   Visual Analytics in Deep Learning: An Interrogative Survey for the Next Frontiers [J].
Hohman, Fred ;
Kahng, Minsuk ;
Pienta, Robert ;
Chau, Duen Horng .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (08) :2674-2693