The timing of action determines reward prediction signals in identified midbrain dopamine neurons

被引:122
作者
Coddington, Luke T. [1 ]
Dudman, Joshua T. [1 ]
机构
[1] Howard Hughes Med Inst, Janelia Res Campus, Ashburn, VA 20147 USA
关键词
RESPONSES; INITIATION; ACQUISITION; PERFORMANCE; EXTINCTION; MECHANISM; CIRCUITRY;
D O I
10.1038/s41593-018-0245-7
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Animals adapt their behavior in response to informative sensory cues using multiple brain circuits. The activity of midbrain dopaminergic neurons is thought to convey a critical teaching signal: reward-prediction error. Although reward-prediction error signals are thought to be essential to learning, little is known about the dynamic changes in the activity of midbrain dopaminergic neurons as animals learn about novel sensory cues and appetitive rewards. Here we describe a large dataset of cell-attached recordings of identified dopaminergic neurons as naive mice learned a novel cue-reward association. During learning midbrain dopaminergic neuron activity results from the summation of sensory cue-related and movement initiation-related response components. These components are both a function of reward expectation yet they are dissociable. Learning produces an increasingly precise coordination of action initiation following sensory cues that results in apparent reward-prediction error correlates. Our data thus provide new insights into the circuit mechanisms that underlie a critical computation in a highly conserved learning circuit.
引用
收藏
页码:1563 / +
页数:15
相关论文
共 53 条
[1]  
[Anonymous], 2015, Reinforcement Learning: An Introduction
[2]  
[Anonymous], ELIFE
[3]   Beyond reward prediction errors: the role of dopamine in movement kinematics [J].
Barter, Joseph W. ;
Li, Suellen ;
Lu, Dongye ;
Bartholomew, Ryan A. ;
Rossi, Mark A. ;
Shoemaker, Charles T. ;
Salas-Meza, Daniel ;
Gaidis, Erin ;
Yin, Henry H. .
FRONTIERS IN INTEGRATIVE NEUROSCIENCE, 2015, 9
[4]   Neurons for hunger and thirst transmit a negative-valence teaching signal [J].
Betley, J. Nicholas ;
Xu, Shengjin ;
Cao, Zhen Fang Huang ;
Gong, Rong ;
Magnus, Christopher J. ;
Yu, Yang ;
Sternson, Scott M. .
NATURE, 2015, 521 (7551) :180-+
[5]   Multiplicity of control in the basal ganglia: computational roles of striatal subregions [J].
Bornstein, Aaron M. ;
Daw, Nathaniel D. .
CURRENT OPINION IN NEUROBIOLOGY, 2011, 21 (03) :374-380
[6]   Millisecond-timescale, genetically targeted optical control of neural activity [J].
Boyden, ES ;
Zhang, F ;
Bamberg, E ;
Nagel, G ;
Deisseroth, K .
NATURE NEUROSCIENCE, 2005, 8 (09) :1263-1268
[7]   Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors [J].
Chang, Chun Yun ;
Esber, Guillem R. ;
Marrero-Garcia, Yasmin ;
Yau, Hau-Jie ;
Bonci, Antonello ;
Schoenbaum, Geoffrey .
NATURE NEUROSCIENCE, 2016, 19 (01) :111-+
[8]   Neuron-type-specific signals for reward and punishment in the ventral tegmental area [J].
Cohen, Jeremiah Y. ;
Haesler, Sebastian ;
Vong, Linh ;
Lowell, Bradford B. ;
Uchida, Naoshige .
NATURE, 2012, 482 (7383) :85-U109
[9]   Dynamic mesolimbic dopamine signaling during action sequence learning and expectation violation [J].
Collins, Anne L. ;
Greenfield, Venuz Y. ;
Bye, Jeffrey K. ;
Linker, Kay E. ;
Wang, Alice S. ;
Wassum, Kate M. .
SCIENTIFIC REPORTS, 2016, 6
[10]   Dopamine neuron activity before action initiation gates and invigorates future movements [J].
da Silva, Joaquim Alves ;
Tecuapetla, Fatuel ;
Paixao, Vitor ;
Costa, Rui M. .
NATURE, 2018, 554 (7691) :244-+