Exploration, novelty, surprise, and free energy minimization

被引:136
作者
Schwartenbeck, Philipp [1 ]
FitzGerald, Thomas [1 ]
Dolan, Raymond J. [1 ]
Friston, Karl J. [1 ]
机构
[1] UCL, Inst Neurol, Wellcome Trust Ctr Neuroimaging, London WC1N 3BG, England
基金
英国惠康基金;
关键词
active inference; exploration; exploitation; novelty; reinforcement learning; free energy; ACTIVE INFERENCE; BRAIN;
D O I
10.3389/fpsyg.2013.00710
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
This paper reviews recent developments under the free energy principle that introduce a normative perspective on classical economic (utilitarian) decision-making based on (active) Bayesian inference. It has been suggested that the free energy principle precludes novelty and complexity, because it assumes that biological systems-like ourselves-try to minimize the long-term average of surprise to maintain their homeostasis. However, recent formulations show that minimizing surprise leads naturally to concepts such as exploration and novelty bonuses. In this approach, agents infer a policy that minimizes surprise by minimizing the difference (or relative entropy) between likely and desired outcomes, which involves both pursuing the goal-state that has the highest expected utility (often termed exploitation) and visiting a number of different goal-states (exploration). Crucially, the opportunity to visit new states increases the value of the current state. Casting decision-making problems within a variational framework, therefore, predicts that our behavior is governed by both the entropy and expected utility of future states. This dissolves any dialectic between minimizing surprise and exploration or novelty seeking.
引用
收藏
页数:5
相关论文
共 32 条
[1]  
Adams Rick A, 2013, Front Psychiatry, V4, P47, DOI 10.3389/fpsyt.2013.00047
[2]   Predictions not commands: active inference in the motor system [J].
Adams, Rick A. ;
Shipp, Stewart ;
Friston, Karl J. .
BRAIN STRUCTURE & FUNCTION, 2013, 218 (03) :611-643
[3]  
[Anonymous], 1993, P 6 ANN C COMPUTATIO, DOI DOI 10.1145/168304.168306
[4]   The free-energy self: A predictive coding account of self-recognition [J].
Apps, Matthew A. J. ;
Tsakiris, Manos .
NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 2014, 41 :85-97
[5]   Canonical Microcircuits for Predictive Coding [J].
Bastos, Andre M. ;
Usrey, W. Martin ;
Adams, Rick A. ;
Mangun, George R. ;
Fries, Pascal ;
Friston, Karl J. .
NEURON, 2012, 76 (04) :695-711
[6]   Free-energy and illusions: the Cornsweet effect [J].
Brown, Harriet ;
Friston, Karl J. .
FRONTIERS IN PSYCHOLOGY, 2012, 3
[7]   Active inference, attention, and motor preparation [J].
Brown, Harriet ;
Friston, Karl J. ;
Bestmann, Sven .
FRONTIERS IN PSYCHOLOGY, 2011, 2
[8]   Whatever next? Predictive brains, situated agents, and the future of cognitive science [J].
Clark, Andy .
BEHAVIORAL AND BRAIN SCIENCES, 2013, 36 (03) :181-204
[9]   Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration [J].
Cohen, Jonathan D. ;
McClure, Samuel M. ;
Yu, Angela J. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2007, 362 (1481) :933-942
[10]   A Bayesian account of 'hysteria' [J].
Edwards, Mark J. ;
Adams, Rick A. ;
Brown, Harriet ;
Parees, Isabel ;
Friston, Karl J. .
BRAIN, 2012, 135 :3495-3512