Dopamine transients do not act as model-free prediction errors during associative learning

被引:39
|
作者
Sharpe, Melissa J. [1 ,2 ,3 ,4 ]
Batchelor, Hannah M. [1 ]
Mueller, Lauren E. [1 ]
Chang, Chun Yun [1 ]
Maes, Etienne J. P. [1 ]
Niv, Yael [2 ,5 ]
Schoenbaum, Geoffrey [1 ,6 ,7 ,8 ]
机构
[1] NIDA, Intramural Res Program, Baltimore, MD 21224 USA
[2] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA
[3] UNSW, Sch Psychol, Sydney, NSW, Australia
[4] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[5] Princeton Univ, Psychol Dept, Princeton, NJ 08544 USA
[6] Univ Maryland, Sch Med, Dept Anat & Neurobiol, Baltimore, MD 21201 USA
[7] Univ Maryland, Sch Med, Dept Psychiat, Baltimore, MD 21201 USA
[8] Johns Hopkins Univ, Solomon H Snyder Dept Neurosci, Baltimore, MD 21287 USA
关键词
ORBITOFRONTAL CORTEX; REINFORCEMENT; ACQUISITION; BEHAVIOR; RELEASE; SUFFICIENT; PSYCHOSIS; NEURONS;
D O I
10.1038/s41467-019-13953-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Dopamine neurons are proposed to signal the reward prediction error in model-free reinforcement learning algorithms. This term represents the unpredicted or 'excess' value of the rewarding event, value that is then added to the intrinsic value of any antecedent cues, contexts or events. To support this proposal, proponents cite evidence that artificially-induced dopamine transients cause lasting changes in behavior. Yet these studies do not generally assess learning under conditions where an endogenous prediction error would occur. Here, to address this, we conducted three experiments where we optogenetically activated dopamine neurons while rats were learning associative relationships, both with and without reward. In each experiment, the antecedent cues failed to acquire value and instead entered into associations with the later events, whether valueless cues or valued rewards. These results show that in learning situations appropriate for the appearance of a prediction error, dopamine transients support associative, rather than model-free, learning.
引用
收藏
页数:10
相关论文
共 39 条
  • [21] Dopamine precursor depletion improves punishment prediction during reversal learning in healthy females but not males
    Robinson, Oliver J.
    Standing, Holly R.
    DeVito, Elise E.
    Cools, Roshan
    Sahakian, Barbara J.
    PSYCHOPHARMACOLOGY, 2010, 211 (02) : 187 - 195
  • [22] The role of training variability for model-based and model-free learning of an arbitrary visuomotor mapping
    Velazquez-Vargas, Carlos A.
    Daw, Nathaniel D.
    Taylor, Jordan A.
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (09)
  • [23] Cholinergic and dopaminergic effects on prediction error and uncertainty responses during sensory associative learning
    Iglesias, Sandra
    Kasper, Lars
    Harrison, Samuel J.
    Manka, Robert
    Mathys, Christoph
    Stephan, Klaas E.
    NEUROIMAGE, 2021, 226
  • [24] Prediction error drives associative learning and conditioned behavior in a spiking model of Drosophila larva
    Juergensen, Anna-Maria
    Sakagiannis, Panagiotis
    Schleyer, Michael
    Gerber, Bertram
    Nawrot, Martin Paul
    ISCIENCE, 2024, 27 (01)
  • [25] Effects of serotonin and dopamine depletion on neural prediction computations during social learning
    Frey, Anna-Lena
    McCabe, Ciara
    NEUROPSYCHOPHARMACOLOGY, 2020, 45 (09) : 1431 - 1437
  • [26] The Rodent Lateral Orbitofrontal Cortex as an Arbitrator Selecting Between Model-Based and Model-Free Learning Systems
    Panayi, Marios C.
    Khamassi, Mehdi
    Killcross, Simon
    BEHAVIORAL NEUROSCIENCE, 2021, 135 (02) : 226 - 244
  • [27] Impairment of arbitration between model-based and model-free reinforcement learning in obsessive-compulsive disorder
    Ruan, Zhongqiang
    Seger, Carol A.
    Yang, Qiong
    Kim, Dongjae
    Lee, Sang Wan
    Chen, Qi
    Peng, Ziwen
    FRONTIERS IN PSYCHIATRY, 2023, 14
  • [28] Model-free learning on robot kinematic chains using a nested multi-agent topology
    Karigiannis, John N.
    Tzafestas, Costas S.
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2016, 28 (06) : 913 - 954
  • [29] Machine Learning in Model-free Mechanical Property Imaging: Novel Integration of Physics With the Constrained Optimization Process
    Hoerig, Cameron
    Ghaboussi, Jamshid
    Wang, Yiliang
    Insana, Michael F.
    FRONTIERS IN PHYSICS, 2021, 9
  • [30] Processing speed enhances model-based over model-free reinforcement learning in the presence of high working memory functioning
    Schad, Daniel J.
    Juenger, Elisabeth
    Sebold, Miriam
    Garbusow, Maria
    Bernhardt, Nadine
    Javadi, Amir-Homayoun
    Zimmermann, Ulrich S.
    Smolka, Michael N.
    Heinz, Andreas
    Rapp, Michael A.
    Huys, Quentin J. M.
    FRONTIERS IN PSYCHOLOGY, 2014, 5