The timing of action determines reward prediction signals in identified midbrain dopamine neurons

被引:119
作者
Coddington, Luke T. [1 ]
Dudman, Joshua T. [1 ]
机构
[1] Howard Hughes Med Inst, Janelia Res Campus, Ashburn, VA 20147 USA
关键词
RESPONSES; INITIATION; ACQUISITION; PERFORMANCE; EXTINCTION; MECHANISM; CIRCUITRY;
D O I
10.1038/s41593-018-0245-7
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Animals adapt their behavior in response to informative sensory cues using multiple brain circuits. The activity of midbrain dopaminergic neurons is thought to convey a critical teaching signal: reward-prediction error. Although reward-prediction error signals are thought to be essential to learning, little is known about the dynamic changes in the activity of midbrain dopaminergic neurons as animals learn about novel sensory cues and appetitive rewards. Here we describe a large dataset of cell-attached recordings of identified dopaminergic neurons as naive mice learned a novel cue-reward association. During learning midbrain dopaminergic neuron activity results from the summation of sensory cue-related and movement initiation-related response components. These components are both a function of reward expectation yet they are dissociable. Learning produces an increasingly precise coordination of action initiation following sensory cues that results in apparent reward-prediction error correlates. Our data thus provide new insights into the circuit mechanisms that underlie a critical computation in a highly conserved learning circuit.
引用
收藏
页码:1563 / +
页数:15
相关论文
共 53 条
  • [1] [Anonymous], 2015, Reinforcement Learning: An Introduction
  • [2] [Anonymous], ELIFE
  • [3] Beyond reward prediction errors: the role of dopamine in movement kinematics
    Barter, Joseph W.
    Li, Suellen
    Lu, Dongye
    Bartholomew, Ryan A.
    Rossi, Mark A.
    Shoemaker, Charles T.
    Salas-Meza, Daniel
    Gaidis, Erin
    Yin, Henry H.
    [J]. FRONTIERS IN INTEGRATIVE NEUROSCIENCE, 2015, 9
  • [4] Neurons for hunger and thirst transmit a negative-valence teaching signal
    Betley, J. Nicholas
    Xu, Shengjin
    Cao, Zhen Fang Huang
    Gong, Rong
    Magnus, Christopher J.
    Yu, Yang
    Sternson, Scott M.
    [J]. NATURE, 2015, 521 (7551) : 180 - +
  • [5] Multiplicity of control in the basal ganglia: computational roles of striatal subregions
    Bornstein, Aaron M.
    Daw, Nathaniel D.
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2011, 21 (03) : 374 - 380
  • [6] Millisecond-timescale, genetically targeted optical control of neural activity
    Boyden, ES
    Zhang, F
    Bamberg, E
    Nagel, G
    Deisseroth, K
    [J]. NATURE NEUROSCIENCE, 2005, 8 (09) : 1263 - 1268
  • [7] Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors
    Chang, Chun Yun
    Esber, Guillem R.
    Marrero-Garcia, Yasmin
    Yau, Hau-Jie
    Bonci, Antonello
    Schoenbaum, Geoffrey
    [J]. NATURE NEUROSCIENCE, 2016, 19 (01) : 111 - +
  • [8] Neuron-type-specific signals for reward and punishment in the ventral tegmental area
    Cohen, Jeremiah Y.
    Haesler, Sebastian
    Vong, Linh
    Lowell, Bradford B.
    Uchida, Naoshige
    [J]. NATURE, 2012, 482 (7383) : 85 - U109
  • [9] Dynamic mesolimbic dopamine signaling during action sequence learning and expectation violation
    Collins, Anne L.
    Greenfield, Venuz Y.
    Bye, Jeffrey K.
    Linker, Kay E.
    Wang, Alice S.
    Wassum, Kate M.
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [10] Dopamine neuron activity before action initiation gates and invigorates future movements
    da Silva, Joaquim Alves
    Tecuapetla, Fatuel
    Paixao, Vitor
    Costa, Rui M.
    [J]. NATURE, 2018, 554 (7691) : 244 - +