Learning Generative State Space Models for Active Inference

被引：29

作者：

Catal, Ozan ^{[1
]}

Wauthier, Samuel ^{[1
]}

De Boom, Cedric ^{[1
]}

Verbelen, Tim ^{[1
]}

Dhoedt, Bart ^{[1
]}

机构：

[1] Univ Ghent, Dept Informat Technol, IDLab, IMEC, Ghent, Belgium

来源：

FRONTIERS IN COMPUTATIONAL NEUROSCIENCE | 2020年 / 14卷

关键词：

active inference; free energy; deep learning; generative modeling; robotics; FREE-ENERGY PRINCIPLE;

D O I：

10.3389/fncom.2020.574372

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

In this paper we investigate the active inference framework as a means to enable autonomous behavior in artificial agents. Active inference is a theoretical framework underpinning the way organisms act and observe in the real world. In active inference, agents act in order to minimize their so called free energy, or prediction error. Besides being biologically plausible, active inference has been shown to solve hard exploration problems in various simulated environments. However, these simulations typically require handcrafting a generative model for the agent. Therefore we propose to use recent advances in deep artificial neural networks to learn generative state space models from scratch, using only observation-action sequences. This way we are able to scale active inference to new and challenging problem domains, whilst still building on the theoretical backing of the free energy principle. We validate our approach on the mountain car problem to illustrate that our learnt models can indeed trade-off instrumental value and ambiguity. Furthermore, we show that generative models can also be learnt using high-dimensional pixel observations, both in the OpenAI Gym car racing environment and a real-world robotic navigation task. Finally we show that active inference based policies are an order of magnitude more sample efficient than Deep Q Networks on RL tasks.

引用

页数：17

共 54 条

[1]

Abbeel P., 2005, P INT C MACH LEARN, DOI [10.1145/1102351.1102352, 10.1145/1102351, DOI 10.1145/1102351]

[2]

Angelucci A, 2002, J NEUROSCI, V22, P8633

[3] Canonical Microcircuits for Predictive Coding [J].

Bastos, Andre M. ;

Usrey, W. Martin ;

Adams, Rick A. ;

Mangun, George R. ;

Fries, Pascal ;

Friston, Karl J. .

NEURON, 2012, 76 (04) :695-711

[4]

Beal M. J., 2003, Algorithms in Bayesian Auction Games

[5]

Buesing L., 2018, ARXIV180203006

[6]

Cornell D., 2018, 35 INT C MACH LEARN, P1708

[7] THE HELMHOLTZ MACHINE [J].

DAYAN, P ;

HINTON, GE ;

NEAL, RM ;

ZEMEL, RS .

NEURAL COMPUTATION, 1995, 7 (05) :889-904

[8] Active inference and epistemic value [J].

Friston, Karl ;

Rigoli, Francesco ;

Ognibene, Dimitri ;

Mathys, Christoph ;

Fitzgerald, Thomas ;

Pezzulo, Giovanni .

COGNITIVE NEUROSCIENCE, 2015, 6 (04) :187-214

[9] Active inference and learning [J].

Friston, Karl ;

FitzGerald, Thomas ;

Rigoli, Francesco ;

Schwartenbeck, Philipp ;

O'Doherty, John ;

Pezzulo, Giovanni .

NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 2016, 68 :862-879

[10] Life as we know it [J].

Friston, Karl .

JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2013, 10 (86)

← 1 2 3 4 5 6 →