Hierarchical Learning Induces Two Simultaneous, But Separable, Prediction Errors in Human Basal Ganglia

被引：63

作者：

Diuk, Carlos ^{[1
,2
]}

Tsai, Karin ^{[3
]}

Wallis, Jonathan ^{[4
,5
]}

Botvinick, Matthew ^{[1
,2
]}

Niv, Yael ^{[1
,2
]}

机构：

[1] Princeton Univ, Dept Psychol, Princeton, NJ 08544 USA

[2] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA

[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA

[4] Univ Calif Berkeley, Dept Psychol, Berkeley, CA 94720 USA

[5] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA

来源：

JOURNAL OF NEUROSCIENCE | 2013年 / 33卷 / 13期

基金：

美国国家科学基金会;

关键词：

NEURAL SIGNATURE; HUMAN STRIATUM; SIGNALS; REWARD; DOPAMINE; PROBABILITY; FRAMEWORK;

D O I：

10.1523/JNEUROSCI.5445-12.2013

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

Studies suggest that dopaminergic neurons report a unitary, global reward prediction error signal. However, learning in complex real-life tasks, in particular tasks that show hierarchical structure, requires multiple prediction errors that may coincide in time. We used functional neuroimaging to measure prediction error signals in humans performing such a hierarchical task involving simultaneous, uncorrelated prediction errors. Analysis of signals in a priori anatomical regions of interest in the ventral striatum and the ventral tegmental area indeed evidenced two simultaneous, but separable, prediction error signals corresponding to the two levels of hierarchy in the task. This result suggests that suitably designed tasks may reveal a more intricate pattern of firing in dopaminergic neurons. Moreover, the need for downstream separation of these signals implies possible limitations on the number of different task levels that we can learn about simultaneously.

引用

页码：5797 / 5805

页数：9

共 42 条

[21]

Glimcher PW, 2011, P NATL ACAD SCI US, P108

[22] Dissociating the role of the orbitofrontal cortex and the striatum in the computation of goal values and prediction errors [J].

Hare, Todd A. ;

O'Doherty, John ;

Camerer, Colin F. ;

Schultz, Wolfram ;

Rangel, Antonio .

JOURNAL OF NEUROSCIENCE, 2008, 28 (22) :5623-5630

[23] A novel method for analyzing sequential eye movements reveals strategic influence on Raven's Advanced Progressive Matrices [J].

Hayes, Taylor R. ;

Petrov, Alexander A. ;

Sederberg, Per B. .

JOURNAL OF VISION, 2011, 11 (10)

[24] Dissociable Reward and Timing Signals in Human Midbrain and Ventral Striatum [J].

Klein-Fluegge, Miriam C. ;

Hunt, Laurence T. ;

Bach, Dominik R. ;

Dolan, Raymond J. ;

Behrens, Timothy E. J. .

NEURON, 2011, 72 (04) :654-664

[25]

Koller D, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P1332

[26] Policy Adjustment in a Dynamic Economic Game [J].

Li, Jian ;

McClure, Samuel M. ;

King-Casas, Brooks ;

Montague, P. Read .

PLOS ONE, 2006, 1 (01)

[27] Signals in Human Striatum Are Appropriate for Policy Update Rather than Value Prediction [J].

Li, Jian ;

Daw, Nathaniel D. .

JOURNAL OF NEUROSCIENCE, 2011, 31 (14) :5504-5511

[28] Neural signature of fictive learning signals in a sequential investment task [J].

Lohrenz, Terry ;

McCabe, Kevin ;

Camerer, Colin F. ;

Montague, P. Read .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (22) :9493-9498

[29] Temporal prediction errors in a passive learning task activate human striatum [J].

McClure, SM ;

Berns, GS ;

Montague, PR .

NEURON, 2003, 38 (02) :339-346

[30] A framework for mesencephalic dopamine systems based on predictive Hebbian learning [J].

Montague, PR ;

Dayan, P ;

Sejnowski, TJ .

JOURNAL OF NEUROSCIENCE, 1996, 16 (05) :1936-1947

← 1 2 3 4 5 →