Beyond simple reinforcement learning: the computational neurobiology of reward-learning and valuation

被引:24
作者
O'Doherty, John P. [1 ]
机构
[1] CALTECH, Pasadena, CA 91125 USA
关键词
basal ganglia; computational neuroscience; conditioning; decision-making; prefrontal cortex; ORBITOFRONTAL CORTEX; PREDICTION ERROR; DECISION-MAKING; DOPAMINE; AMYGDALA; MODEL; REPRESENTATION; STRIATUM; SYSTEMS; ENCODE;
D O I
10.1111/j.1460-9568.2012.08074.x
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Neural computational accounts of reward-learning have been dominated by the hypothesis that dopamine neurons behave like a reward-prediction error and thus facilitate reinforcement learning in striatal target neurons. While this framework is consistent with a lot of behavioral and neural evidence, this theory fails to account for a number of behavioral and neurobiological observations. In this special issue of EJN we feature a combination of theoretical and experimental papers highlighting some of the explanatory challenges faced by simple reinforcement-learning models and describing some of the ways in which the framework is being extended in order to address these challenges.
引用
收藏
页码:987 / 990
页数:4
相关论文
共 43 条
[1]   Neural control of dopamine neurotransmission: implications for reinforcement learning [J].
Aggarwal, Mayank ;
Hyland, Brian I. ;
Wickens, Jeffery R. .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) :1115-1123
[2]   Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action [J].
Balleine, Bernard W. ;
O'Doherty, John P. .
NEUROPSYCHOPHARMACOLOGY, 2010, 35 (01) :48-69
[3]   Goal-directed instrumental action: contingency and incentive learning and their cortical substrates [J].
Balleine, BW ;
Dickinson, A .
NEUROPHARMACOLOGY, 1998, 37 (4-5) :407-419
[4]   Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[5]   Learning the value of information in an uncertain world [J].
Behrens, Timothy E. J. ;
Woolrich, Mark W. ;
Walton, Mark E. ;
Rushworth, Matthew F. S. .
NATURE NEUROSCIENCE, 2007, 10 (09) :1214-1221
[6]   What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience? [J].
Berridge, KC ;
Robinson, TE .
BRAIN RESEARCH REVIEWS, 1998, 28 (03) :309-369
[7]   The debate over dopamine's role in reward: the case for incentive salience [J].
Berridge, Kent C. .
PSYCHOPHARMACOLOGY, 2007, 191 (03) :391-431
[8]   From prediction error to incentive salience: mesolimbic computation of reward motivation [J].
Berridge, Kent C. .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) :1124-1143
[9]   Dissociating hippocampal and striatal contributions to sequential prediction learning [J].
Bornstein, Aaron M. ;
Daw, Nathaniel D. .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) :1011-1023
[10]   Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective [J].
Botvinick, Matthew M. ;
Niv, Yael ;
Barto, Andrew C. .
COGNITION, 2009, 113 (03) :262-280