Temporal difference models and reward-related learning in the human brain

被引:1060
作者
O'Doherty, JP
Dayan, P
Friston, KJ
Critchley, H
Dolan, RJ
机构
[1] Inst Neurol, Wellcome Dept Imaging Neurosci, London WC1N 3BG, England
[2] UCL, Gatsby Computat Neurosci Unit, London WC1N 3BG, England
基金
英国惠康基金;
关键词
D O I
10.1016/S0896-6273(03)00169-7
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Temporal difference learning has been proposed as a model for Pavlovian conditioning, in which an animal learns to predict delivery of reward following presentation of a conditioned stimulus (CS). A key component of this model is a prediction error signal, which, before learning, responds at the time of presentation of reward but, after learning, shifts its response to the time of onset of the CS. In order to test for regions manifesting this signal profile, subjects were scanned using event-related fMRI while undergoing appetitive conditioning with a pleasant taste reward. Regression analyses revealed that responses in ventral striatum and orbitofrontal cortex were significantly correlated with this error signal, suggesting that, during appetitive conditioning, computations described by temporal difference learning are expressed in the human brain.
引用
收藏
页码:329 / 337
页数:9
相关论文
共 35 条
  • [1] NEURONAL-ACTIVITY IN MONKEY STRIATUM RELATED TO THE EXPECTATION OF PREDICTABLE ENVIRONMENTAL EVENTS
    APICELLA, P
    SCARNATI, E
    LJUNGBERG, T
    SCHULTZ, W
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 1992, 68 (03) : 945 - 960
  • [2] Predictability modulates human brain response to reward
    Berns, GS
    McClure, SM
    Pagnoni, G
    Montague, PR
    [J]. JOURNAL OF NEUROSCIENCE, 2001, 21 (08) : 2793 - 2798
  • [3] Functional imaging of neural responses to expectancy and experience of monetary gains and losses
    Breiter, HC
    Aharon, I
    Kahneman, D
    Dale, A
    Shizgal, P
    [J]. NEURON, 2001, 30 (02) : 619 - 639
  • [4] Learning and selective attention
    Dayan, Peter
    Kakade, Sham
    Montague, P. Read
    [J]. NATURE NEUROSCIENCE, 2000, 3 (11) : 1218 - 1223
  • [5] Tracking the hemodynamic responses to reward and punishment in the striatum
    Delgado, MR
    Nystrom, LE
    Fissell, C
    Noll, DC
    Fiez, JA
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2000, 84 (06) : 3072 - 3077
  • [6] DUVERNOY HM, 1999, HUMAN BRAIN
  • [7] Duvernoy HM, 1995, HUMAN BRAIN STEM CER
  • [8] Dissociable neural responses in human reward systems
    Elliott, R
    Friston, KJ
    Dolan, RJ
    [J]. JOURNAL OF NEUROSCIENCE, 2000, 20 (16) : 6159 - 6165
  • [9] Responses of human frontal cortex to surprising events are predicted by formal associative learning theory
    Fletcher, PC
    Anderson, JM
    Shanks, DR
    Honey, R
    Carpenter, TA
    Donovan, T
    Papadakis, N
    Bullmore, ET
    [J]. NATURE NEUROSCIENCE, 2001, 4 (10) : 1043 - 1048
  • [10] The representation of pleasant touch in the brain and its relationship with taste and olfactory areas
    Francis, S
    Rolls, ET
    Bowtell, R
    McGlone, F
    O'Doherty, J
    Browning, A
    Clare, S
    Smith, E
    [J]. NEUROREPORT, 1999, 10 (03) : 453 - 459