Signed and unsigned reward prediction errors dynamically enhance learning and memory

被引:54
作者
Rouhani, Nina [1 ]
Niv, Yael [2 ,3 ]
机构
[1] CALTECH, Chen Neurosci Inst, Pasadena, CA 91125 USA
[2] Princeton Univ, Dept Psychol, Princeton, NJ 08544 USA
[3] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA
关键词
LOCUS-COERULEUS; HIPPOCAMPUS; MODEL; CONSOLIDATION; INFORMATION; ACTIVATION; MOTIVATION; ATTENTION; NETWORKS; SURPRISE;
D O I
10.7554/eLife.61077
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Memory helps guide behavior, but which experiences from the past are prioritized? Classic models of learning posit that events associated with unpredictable outcomes as well as, paradoxically, predictable outcomes, deploy more attention and learning for those events. Here, we test reinforcement learning and subsequent memory for those events, and treat signed and unsigned reward prediction errors (RPEs), experienced at the reward-predictive cue or reward outcome, as drivers of these two seemingly contradictory signals. By fitting reinforcement learning models to behavior, we find that both RPEs contribute to learning by modulating a dynamically changing learning rate. We further characterize the effects of these RPE signals on memory and show that both signed and unsigned RPEs enhance memory, in line with midbrain dopamine and locus-coeruleus modulation of hippocampal plasticity, thereby reconciling separate findings in the literature.
引用
收藏
页数:28
相关论文
共 66 条
[1]   Reward-motivated learning: Mesolimbic activation precedes memory formation [J].
Adcock, R. Alison ;
Thangavel, Arul ;
Whitfield-Gabrieli, Susan ;
Knutson, Brian ;
Gabrieli, John D. E. .
NEURON, 2006, 50 (03) :507-517
[2]   Behavioral, Physiological, and Neural Signatures of Surprise during Naturalistic Sports Viewing [J].
Antony, James W. ;
Hartshorne, Thomas H. ;
Pomeroy, Ken ;
Gureckis, Todd M. ;
Hasson, Uri ;
McDougle, Samuel D. ;
Norman, Kenneth A. .
NEURON, 2021, 109 (02) :377-+
[3]  
Barto AG, 1995, MODELS INFORM PROCES, P1
[4]   Fitting Linear Mixed-Effects Models Using lme4 [J].
Bates, Douglas ;
Maechler, Martin ;
Bolker, Benjamin M. ;
Walker, Steven C. .
JOURNAL OF STATISTICAL SOFTWARE, 2015, 67 (01) :1-48
[5]   Uncertainty and predictiveness determine attention to cues during human associative learning [J].
Beesley, Tom ;
Nguyen, Katherine P. ;
Pearson, Daniel ;
Le Pelley, Mike E. .
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2015, 68 (11) :2175-2199
[6]   Mnemonic prediction errors bias hippocampal states [J].
Bein, Oded ;
Duncan, Katherine ;
Davachi, Lila .
NATURE COMMUNICATIONS, 2020, 11 (01)
[7]   Retroactive and graded prioritization of memory by reward [J].
Braun, Erin Kendall ;
Wimmer, G. Elliott ;
Shohamy, Daphna .
NATURE COMMUNICATIONS, 2018, 9
[8]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29
[9]   Echoes of Emotions Past: How Neuromodulators Determine What We Recollect [J].
Clewett, David ;
Murty, Vishnu P. .
ENEURO, 2019, 6 (02)
[10]   An Upside to Reward Sensitivity: The Hippocampus Supports Enhanced Reinforcement Learning in Adolescence [J].
Davidow, Juliet Y. ;
Foerde, Karin ;
Galvan, Adriana ;
Shohamy, Daphna .
NEURON, 2016, 92 (01) :93-99