Dopamine transients do not act as model-free prediction errors during associative learning

被引:39
|
作者
Sharpe, Melissa J. [1 ,2 ,3 ,4 ]
Batchelor, Hannah M. [1 ]
Mueller, Lauren E. [1 ]
Chang, Chun Yun [1 ]
Maes, Etienne J. P. [1 ]
Niv, Yael [2 ,5 ]
Schoenbaum, Geoffrey [1 ,6 ,7 ,8 ]
机构
[1] NIDA, Intramural Res Program, Baltimore, MD 21224 USA
[2] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA
[3] UNSW, Sch Psychol, Sydney, NSW, Australia
[4] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[5] Princeton Univ, Psychol Dept, Princeton, NJ 08544 USA
[6] Univ Maryland, Sch Med, Dept Anat & Neurobiol, Baltimore, MD 21201 USA
[7] Univ Maryland, Sch Med, Dept Psychiat, Baltimore, MD 21201 USA
[8] Johns Hopkins Univ, Solomon H Snyder Dept Neurosci, Baltimore, MD 21287 USA
关键词
ORBITOFRONTAL CORTEX; REINFORCEMENT; ACQUISITION; BEHAVIOR; RELEASE; SUFFICIENT; PSYCHOSIS; NEURONS;
D O I
10.1038/s41467-019-13953-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Dopamine neurons are proposed to signal the reward prediction error in model-free reinforcement learning algorithms. This term represents the unpredicted or 'excess' value of the rewarding event, value that is then added to the intrinsic value of any antecedent cues, contexts or events. To support this proposal, proponents cite evidence that artificially-induced dopamine transients cause lasting changes in behavior. Yet these studies do not generally assess learning under conditions where an endogenous prediction error would occur. Here, to address this, we conducted three experiments where we optogenetically activated dopamine neurons while rats were learning associative relationships, both with and without reward. In each experiment, the antecedent cues failed to acquire value and instead entered into associations with the later events, whether valueless cues or valued rewards. These results show that in learning situations appropriate for the appearance of a prediction error, dopamine transients support associative, rather than model-free, learning.
引用
收藏
页数:10
相关论文
共 39 条
  • [1] Dopamine transients encode reward prediction errors independent of learning rates
    Mah, Andrew
    Golden, Carla E. M.
    Constantinople, Christine M.
    CELL REPORTS, 2024, 43 (10):
  • [2] A causal link between prediction errors, dopamine neurons and learning
    Steinberg, Elizabeth E.
    Keiflin, Ronald
    Boivin, Josiah R.
    Witten, Ilana B.
    Deisseroth, Karl
    Janak, Patricia H.
    NATURE NEUROSCIENCE, 2013, 16 (07) : 966 - U248
  • [3] The curious case of dopaminergic prediction errors and learning associative information beyond value
    Kahnt, Thorsten
    Schoenbaum, Geoffrey
    NATURE REVIEWS NEUROSCIENCE, 2025, 26 (03) : 169 - 178
  • [4] Dopamine, prediction error and associative learning: A model-based account
    Smith, Andrew
    Li, Ming
    Becker, Sue
    Kapur, Shitij
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2006, 17 (01) : 61 - 84
  • [5] The "Proactive" Model of Learning: Integrative Framework for Model-Free and Model-Based Reinforcement Learning Utilizing the Associative Learning-Based Proactive Brain Concept
    Zsuga, Judit
    Biro, Klara
    Papp, Csaba
    Tajti, Gabor
    Gesztelyi, Rudolf
    BEHAVIORAL NEUROSCIENCE, 2016, 130 (01) : 6 - 18
  • [6] Dopamine enhances model-free credit assignment through boosting of retrospective model-based inference
    Deserno, Lorenz
    Moran, Rani
    Michely, Jochen
    Lee, Ying
    Dayan, Peter
    Dolan, Raymond J.
    ELIFE, 2021, 10
  • [7] The Role of Dopamine in Associative Learning in Drosophila: An Updated Unified Model
    Adel, Mohamed
    Griffith, Leslie C.
    NEUROSCIENCE BULLETIN, 2021, 37 (06) : 831 - 852
  • [8] Stress reduces both model-based and model-free neural computations during flexible learning
    Cremer, Anna
    Kalbe, Felix
    Glaescher, Jan
    Schwabe, Lars
    NEUROIMAGE, 2021, 229
  • [9] Prospective contingency explains behavior and dopamine signals during associative learning
    Qian, Lechen
    Burrell, Mark
    Hennig, Jay A.
    Matias, Sara
    Murthy, Venkatesh N.
    Gershman, Samuel J.
    Uchida, Naoshige
    NATURE NEUROSCIENCE, 2025,
  • [10] Value-Driven Adaptations of Mesolimbic Dopamine Release Are Governed by Both Model-Based and Model-Free Mechanisms
    Robke, Rhiannon
    Arbab, Tara
    Smith, Rachel
    Willuhn, Ingo
    ENEURO, 2024, 11 (07)