共 50 条
Active inference and learning
被引:348
|作者:
Friston, Karl
[1
]
FitzGerald, Thomas
[1
,2
]
Rigoli, Francesco
[1
]
Schwartenbeck, Philipp
[1
,2
,3
,4
]
O'Doherty, John
[5
]
Pezzulo, Giovanni
[6
]
机构:
[1] UCL, Wellcome Trust Ctr Neuroimaging, 12 Queen Sq, London, England
[2] Max Planck UCL Ctr Computat Psychiat & Ageing Res, London, England
[3] Salzburg Univ, Ctr Neurocognit Res, Salzburg, Austria
[4] Paracelsus Med Univ Salzburg, Inst Neurosci, Christian Doppler Klin, Salzburg, Austria
[5] CALTECH, Caltech Brain Imaging Ctr, Pasadena, CA 91125 USA
[6] CNR, Inst Cognit Sci & Technol, Rome, Italy
基金:
英国惠康基金;
关键词:
Active inference;
Habit learning;
Bayesian inference;
Goal-directed;
Free energy;
Information gain;
Bayesian surprise;
Epistemic value;
Exploration;
Exploitation;
DECISION-MAKING;
INFORMATION;
PREDICTION;
CHOICE;
MODELS;
EXPLORATION;
SURPRISE;
NOVELTY;
UNCERTAINTY;
COMPLEXITY;
D O I:
10.1016/j.neubiorev.2016.06.022
中图分类号:
B84 [心理学];
C [社会科学总论];
Q98 [人类学];
学科分类号:
03 ;
0303 ;
030303 ;
04 ;
0402 ;
摘要:
This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity. (C) 2016 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license
引用
收藏
页码:862 / 879
页数:18
相关论文