Hierarchical Learning Induces Two Simultaneous, But Separable, Prediction Errors in Human Basal Ganglia

被引:63
作者
Diuk, Carlos [1 ,2 ]
Tsai, Karin [3 ]
Wallis, Jonathan [4 ,5 ]
Botvinick, Matthew [1 ,2 ]
Niv, Yael [1 ,2 ]
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08544 USA
[2] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA
[4] Univ Calif Berkeley, Dept Psychol, Berkeley, CA 94720 USA
[5] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
NEURAL SIGNATURE; HUMAN STRIATUM; SIGNALS; REWARD; DOPAMINE; PROBABILITY; FRAMEWORK;
D O I
10.1523/JNEUROSCI.5445-12.2013
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Studies suggest that dopaminergic neurons report a unitary, global reward prediction error signal. However, learning in complex real-life tasks, in particular tasks that show hierarchical structure, requires multiple prediction errors that may coincide in time. We used functional neuroimaging to measure prediction error signals in humans performing such a hierarchical task involving simultaneous, uncorrelated prediction errors. Analysis of signals in a priori anatomical regions of interest in the ventral striatum and the ventral tegmental area indeed evidenced two simultaneous, but separable, prediction error signals corresponding to the two levels of hierarchy in the task. This result suggests that suitably designed tasks may reveal a more intricate pattern of firing in dopaminergic neurons. Moreover, the need for downstream separation of these signals implies possible limitations on the number of different task levels that we can learn about simultaneously.
引用
收藏
页码:5797 / 5805
页数:9
相关论文
共 42 条
[1]   Prediction error as a linear function of reward probability is coded in human nucleus accumbens [J].
Abler, Birgit ;
Walter, Henrik ;
Erk, Susanne ;
Kammerer, Hannes ;
Spitzer, Manfred .
NEUROIMAGE, 2006, 31 (02) :790-795
[2]  
[Anonymous], 2009, DECISION MAKING AFFE
[3]  
Barto A. G., 1995, Models of Information Processing in the Basal Ganglia, P215
[4]  
Barto AG, 2003, DISCRETE EVENT DYN S, V13, P343
[5]   Associative learning of social value [J].
Behrens, Timothy E. J. ;
Hunt, Laurence T. ;
Woolrich, Mark W. ;
Rushworth, Matthew F. S. .
NATURE, 2008, 456 (7219) :245-U45
[6]   Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective [J].
Botvinick, Matthew M. ;
Niv, Yael ;
Barto, Andrew C. .
COGNITION, 2009, 113 (03) :262-280
[7]   Hierarchical reinforcement learning and decision making [J].
Botvinick, Matthew Michael .
CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) :956-962
[8]   The psychophysics toolbox [J].
Brainard, DH .
SPATIAL VISION, 1997, 10 (04) :433-436
[9]   Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli [J].
Brischoux, Frederic ;
Chakraborty, Subhojit ;
Brierley, Daniel I. ;
Ungless, Mark A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (12) :4894-4899
[10]   Neural mechanisms of observational learning [J].
Burke, Christopher J. ;
Tobler, Philippe N. ;
Baddeley, Michelle ;
Schultz, Wolfram .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (32) :14431-14436