Interpreting Recurrent Neural Networks Behaviour via Excitable Network Attractors

被引:31
作者
Ceni, Andrea [1 ]
Ashwin, Peter [2 ]
Livi, Lorenzo [1 ,3 ,4 ]
机构
[1] Univ Exeter, Dept Comp Sci, Exeter EX4 4QF, Devon, England
[2] Univ Exeter, Dept Math, Exeter EX4 4QF, Devon, England
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
[4] Univ Manitoba, Dept Math, Winnipeg, MB R3T 2N2, Canada
基金
英国工程与自然科学研究理事会;
关键词
Recurrent neural networks; Dynamical systems; Network attractors; Bifurcations; ECHO STATE PROPERTY; APPROXIMATION; ITINERANCY; DYNAMICS; SYSTEMS;
D O I
10.1007/s12559-019-09634-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning provides fundamental tools both for scientific research and for the development of technologies with significant impact on society. It provides methods that facilitate the discovery of regularities in data and that give predictions without explicit knowledge of the rules governing a system. However, a price is paid for exploiting such flexibility: machine learning methods are typically black boxes where it is difficult to fully understand what the machine is doing or how it is operating. This poses constraints on the applicability and explainability of such methods. Our research aims to open the black box of recurrent neural networks, an important family of neural networks used for processing sequential data. We propose a novel methodology that provides a mechanistic interpretation of behaviour when solving a computational task. Our methodology uses mathematical constructs called excitable network attractors, which are invariant sets in phase space composed of stable attractors and excitable connections between them. As the behaviour of recurrent neural networks depends both on training and on inputs to the system, we introduce an algorithm to extract network attractors directly from the trajectory of a neural network while solving tasks. Simulations conducted on a controlled benchmark task confirm the relevance of these attractors for interpreting the behaviour of recurrent neural networks, at least for tasks that involve learning a finite number of stable states and transitions between them.
引用
收藏
页码:330 / 356
页数:27
相关论文
共 62 条
[1]   Low-dimensional dynamics of structured random networks [J].
Aljadeff, Johnatan ;
Renfrew, David ;
Vegue, Marina ;
Sharpee, Tatyana O. .
PHYSICAL REVIEW E, 2016, 93 (02)
[2]  
[Anonymous], 2014, Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering
[3]   An extensive comparative study of cluster validity indices [J].
Arbelaitz, Olatz ;
Gurrutxaga, Ibai ;
Muguerza, Javier ;
Perez, Jesus M. ;
Perona, Inigo .
PATTERN RECOGNITION, 2013, 46 (01) :243-256
[4]  
Arjovsky M, 2016, PR MACH LEARN RES, V48
[5]   Sensitive Finite-State Computations Using a Distributed Network With a Noisy Network Attractor [J].
Ashwin, Peter ;
Postlethwaite, Claire .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) :5847-5858
[6]   Designing Heteroclinic and Excitable Networks in Phase Space Using Two Populations of Coupled Cells [J].
Ashwin, Peter ;
Postlethwaite, Claire .
JOURNAL OF NONLINEAR SCIENCE, 2016, 26 (02) :345-364
[7]   On designing heteroclinic networks from graphs [J].
Ashwin, Peter ;
Postlethwaite, Claire .
PHYSICA D-NONLINEAR PHENOMENA, 2013, 265 :26-39
[8]   Parameter space structure of continuous-time recurrent neural networks [J].
Beer, Randall D. .
NEURAL COMPUTATION, 2006, 18 (12) :3009-3051
[9]   Investigating Echo-State Networks Dynamics by Means of Recurrence Analysis [J].
Bianchi, Filippo Maria ;
Livi, Lorenzo ;
Alippi, Cesare .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (02) :427-439
[10]   Prediction of telephone calls load using Echo State Network with exogenous variables [J].
Bianchi, Filippo Maria ;
Scardapane, Simone ;
Uncini, Aurelio ;
Rizzi, Antonello ;
Sadeghian, Alireza .
NEURAL NETWORKS, 2015, 71 :204-213