Striatal dopamine signals reflect perceived cue-action-outcome associations in mice

被引:12
作者
Bernklau, Tobias W. [1 ,2 ]
Righetti, Beatrice [1 ]
Mehrke, Leonie S. [1 ]
Jacob, Simon N. [1 ]
机构
[1] Tech Univ Munich, Dept Neurosurg, Klinikum Rechts Isar, Translat Neurotechnol Lab, Munich, Germany
[2] Ludwig Maximilians Univ Munchen, Grad Sch Syst Neurosci, Munich, Germany
基金
欧洲研究理事会;
关键词
NEURONS ENCODE; REWARD; PREDICTION; HISTORY;
D O I
10.1038/s41593-023-01567-2
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Striatal dopamine drives associative learning by acting as a teaching signal. Much work has focused on simple learning paradigms, including Pavlovian and instrumental learning. However, higher cognition requires that animals generate internal concepts of their environment, where sensory stimuli, actions and outcomes become flexibly associated. Here, we performed fiber photometry dopamine measurements across the striatum of male mice as they learned cue-action-outcome associations based on implicit and changing task rules. Reinforcement learning models of the behavioral and dopamine data showed that rule changes lead to adjustments of learned cue-action-outcome associations. After rule changes, mice discarded learned associations and reset outcome expectations. Cue- and outcome-triggered dopamine signals became uncoupled and dependent on the adopted behavioral strategy. As mice learned the new association, coupling between cue- and outcome-triggered dopamine signals and task performance re-emerged. Our results suggest that dopaminergic reward prediction errors reflect an agent's perceived locus of control.
引用
收藏
页码:747 / 757
页数:11
相关论文
共 66 条
[1]   Adaptable history biases in human perceptual decisions [J].
Abrahamyan, Arman ;
Silva, Laura Luz ;
Dakin, Steven C. ;
Carandini, Matteo ;
Gardner, Justin L. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (25) :E3548-E3557
[2]   Posterior parietal cortex represents sensory history and mediates its effects on behaviour [J].
Akrami, Athena ;
Kopec, Charles D. ;
Diamond, Mathew E. ;
Brody, Carlos D. .
NATURE, 2018, 554 (7692) :368-+
[3]   A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning [J].
Amo, Ryunosuke ;
Matias, Sara ;
Yamanaka, Akihiro ;
Tanaka, Kenji F. ;
Uchida, Naoshige ;
Watabe-Uchida, Mitsuko .
NATURE NEUROSCIENCE, 2022, 25 (08) :1082-+
[4]   Belief state representation in the dopamine system [J].
Babayan, Benedicte M. ;
Uchida, Naoshige ;
Gershman, Samuel. J. .
NATURE COMMUNICATIONS, 2018, 9
[5]  
Blanco-Pozo M., 2021, BIORXIV, DOI [10.1101/2021.06.25.449995, DOI 10.1101/2021.06.25.449995]
[6]   Primary food reward and reward-predictive stimuli evoke different patterns of phasic dopamine signaling throughout the striatum [J].
Brown, Holden D. ;
McCutcheon, James E. ;
Cone, Jackson J. ;
Ragozzino, Michael E. ;
Roitman, Mitchell F. .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2011, 34 (12) :1997-2006
[7]   The Detection of Visual Contrast in the Behaving Mouse [J].
Busse, Laura ;
Ayaz, Asli ;
Dhruv, Neel T. ;
Katzner, Steffen ;
Saleem, Aman B. ;
Schoelvinck, Marieke L. ;
Zaharia, Andrew D. ;
Carandini, Matteo .
JOURNAL OF NEUROSCIENCE, 2011, 31 (31) :11351-11361
[8]   Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors [J].
Chang, Chun Yun ;
Esber, Guillem R. ;
Marrero-Garcia, Yasmin ;
Yau, Hau-Jie ;
Bonci, Antonello ;
Schoenbaum, Geoffrey .
NATURE NEUROSCIENCE, 2016, 19 (01) :111-+
[9]   Neuron-type-specific signals for reward and punishment in the ventral tegmental area [J].
Cohen, Jeremiah Y. ;
Haesler, Sebastian ;
Vong, Linh ;
Lowell, Bradford B. ;
Uchida, Naoshige .
NATURE, 2012, 482 (7383) :85-U109
[10]   Amygdala and Ventral Striatum Make Distinct Contributions to Reinforcement Learning [J].
Costa, Vincent D. ;
Dal Monte, Olga ;
Lucas, Daniel R. ;
Murray, Elisabeth A. ;
Averbeck, Bruno B. .
NEURON, 2016, 92 (02) :505-517