Striatal activations signal prediction errors on confidence in the absence of external feedback

被引:55
作者
Daniel, Reka [1 ]
Pollmann, Stefan [1 ]
机构
[1] Univ Magdeburg, Dept Expt Psychol, D-39016 Magdeburg, Germany
关键词
Reinforcement learning; Feedback; Observational learning; Striatum; Nucleus accumbens; COGNITIVE FEEDBACK; REWARD; DOPAMINE; INFORMATION; UNCERTAINTY; NUCLEUS; CORTEX; CATEGORIZATION; NEUROBIOLOGY; PROBABILITY;
D O I
10.1016/j.neuroimage.2011.11.058
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Research on the neural bases of learning has mainly focused on reinforcement learning where the central role of the dopaminergic system is well established. However, in everyday life many decisions are not followed by feedback, in which case humans have been shown to code the most probable outcome into memory. We used functional magnetic resonance imaging (fMRI) to examine the neural basis of internally generated signals on correctness and decision confidence in the complete absence of feedback in a categorization task. During test trials after observational training activation in dopaminergic target regions was modulated by the correctness of the answer similarly as during feedback-based training. Moreover, activation in the nucleus accumbens and putamen was correlated with the prediction error on confidence as estimated by a reinforcement learning model. In this model subjective confidence ratings acquired after each trial served as outcome measure. Activation in the striatum therefore follows a similar pattern in response to prediction errors on confidence as it does during reinforcement learning in response to reward prediction errors, but with respect to internally generated signals based on knowledge of the structure of the environment. Furthermore, ventral striatal activation decreased with stimulus novelty, which might support the allocation of attention to unfamiliar stimuli. These results provide a parsimonious account for the neural bases of learning, indicating overlapping neural substrates of reinforcement learning and learning when outcome information has to be internally constructed. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:3457 / 3467
页数:11
相关论文
共 86 条
[1]   Prediction error as a linear function of reward probability is coded in human nucleus accumbens [J].
Abler, Birgit ;
Walter, Henrik ;
Erk, Susanne ;
Kammerer, Hannes ;
Spitzer, Manfred .
NEUROIMAGE, 2006, 31 (02) :790-795
[2]   Beautiful faces have variable reward value: fMRI and behavioral evidence [J].
Aharon, I ;
Etcoff, N ;
Ariely, D ;
Chabris, CF ;
O'Connor, E ;
Breiter, HC .
NEURON, 2001, 32 (03) :537-551
[3]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[4]   Realism in confidence judgements of performance based on implicit learning [J].
Allwood, CM ;
Granhag, PA ;
Johansson, H .
EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 2000, 12 (02) :165-188
[5]  
[Anonymous], LNCS LNAI
[6]   Human midbrain sensitivity to cognitive feedback and uncertainty during classification learning [J].
Aron, AR ;
Shohamy, D ;
Clark, J ;
Myers, C ;
Gluck, MA ;
Poldrack, RA .
JOURNAL OF NEUROPHYSIOLOGY, 2004, 92 (02) :1144-1152
[7]   Human category learning [J].
Ashby, EG ;
Maddox, WT .
ANNUAL REVIEW OF PSYCHOLOGY, 2005, 56 :149-178
[8]  
Ashby F.G., 1992, Multidimensional models of perception and cognition, P1
[9]   DECISION RULES IN THE PERCEPTION AND CATEGORIZATION OF MULTIDIMENSIONAL STIMULI [J].
ASHBY, FG ;
GOTT, RE .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1988, 14 (01) :33-53
[10]   A neuropsychological theory of multiple systems in category learning [J].
Ashby, FG ;
Alfonso-Reese, LA ;
Turken, AU ;
Waldron, EM .
PSYCHOLOGICAL REVIEW, 1998, 105 (03) :442-481