Optogenetic Mimicry of the Transient Activation of Dopamine Neurons by Natural Reward Is Sufficient for Operant Reinforcement

被引:79
作者
Kim, Kyung Man [1 ]
Baratta, Michael V. [2 ]
Yang, Aimei [2 ]
Lee, Doheon [1 ]
Boyden, Edward S. [2 ]
Fiorillo, Christopher D. [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Taejon 305701, South Korea
[2] MIT, Media Lab, McGovern Inst, Cambridge, MA 02139 USA
来源
PLOS ONE | 2012年 / 7卷 / 04期
基金
美国国家科学基金会; 美国国家卫生研究院; 新加坡国家研究基金会;
关键词
INTRACRANIAL SELF-STIMULATION; PROGRESSIVE-RATIO SCHEDULE; NUCLEUS-ACCUMBENS; INCENTIVE SALIENCE; MIDBRAIN DOPAMINE; PREDICTION ERROR; D-AMPHETAMINE; BEHAVIOR; FOOD; GLUTAMATE;
D O I
10.1371/journal.pone.0033612
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Activation of dopamine receptors in forebrain regions, for minutes or longer, is known to be sufficient for positive reinforcement of stimuli and actions. However, the firing rate of dopamine neurons is increased for only about 200 milliseconds following natural reward events that are better than expected, a response which has been described as a "reward prediction error'' (RPE). Although RPE drives reinforcement learning (RL) in computational models, it has not been possible to directly test whether the transient dopamine signal actually drives RL. Here we have performed optical stimulation of genetically targeted ventral tegmental area (VTA) dopamine neurons expressing Channelrhodopsin-2 (ChR2) in mice. We mimicked the transient activation of dopamine neurons that occurs in response to natural reward by applying a light pulse of 200 ms in VTA. When a single light pulse followed each self-initiated nose poke, it was sufficient in itself to cause operant reinforcement. Furthermore, when optical stimulation was delivered in separate sessions according to a predetermined pattern, it increased locomotion and contralateral rotations, behaviors that are known to result from activation of dopamine neurons. All three of the optically induced operant and locomotor behaviors were tightly correlated with the number of VTA dopamine neurons that expressed ChR2, providing additional evidence that the behavioral responses were caused by activation of dopamine neurons. These results provide strong evidence that the transient activation of dopamine neurons provides a functional reward signal that drives learning, in support of RL theories of dopamine function.
引用
收藏
页数:8
相关论文
共 50 条
[21]   Two types of dopamine neuron distinctly convey positive and negative motivational signals [J].
Matsumoto, Masayuki ;
Hikosaka, Okihide .
NATURE, 2009, 459 (7248) :837-U4
[22]   Maternal separation alters ICSS responding in adult male and female rats, but morphine and naltrexone have little affect on that behavior [J].
Michaels, Clifford C. ;
Easterling, Keith W. ;
Holtzman, Stephen G. .
BRAIN RESEARCH BULLETIN, 2007, 73 (4-6) :310-318
[23]   A framework for mesencephalic dopamine systems based on predictive Hebbian learning [J].
Montague, PR ;
Dayan, P ;
Sejnowski, TJ .
JOURNAL OF NEUROSCIENCE, 1996, 16 (05) :1936-1947
[24]  
Morikawa H, 2003, J NEUROSCI, V23, P149
[25]   Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons [J].
Morris, G ;
Arkadir, D ;
Nevet, A ;
Vaadia, E ;
Bergman, H .
NEURON, 2004, 43 (01) :133-143
[26]   DIFFERENTIAL EFFECT OF SELF-STIMULATION ON DOPAMINE RELEASE AND METABOLISM IN THE RAT MEDIAL FRONTAL-CORTEX, NUCLEUS-ACCUMBENS AND STRIATUM STUDIED BY INVIVO MICRODIALYSIS [J].
NAKAHARA, D ;
FUCHIKAMI, K ;
OZAKI, N ;
IWASAKI, T ;
NAGATSU, T .
BRAIN RESEARCH, 1992, 574 (1-2) :164-170
[27]   Dopamine neurons can represent context-dependent prediction error [J].
Nakahara, H ;
Itoh, H ;
Kawagoe, R ;
Takikawa, Y ;
Hikosaka, O .
NEURON, 2004, 41 (02) :269-280
[28]   Temporally Extended Dopamine Responses to Perceptually Demanding Reward-Predictive Stimuli [J].
Nomoto, Kensaku ;
Schultz, Wolfram ;
Watanabe, Takeo ;
Sakagami, Masamichi .
JOURNAL OF NEUROSCIENCE, 2010, 30 (32) :10692-10702
[29]   POSITIVE REINFORCEMENT PRODUCED BY ELECTRICAL STIMULATION OF SEPTAL AREA AND OTHER REGIONS OF RAT BRAIN [J].
OLDS, J ;
MILNER, P .
JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1954, 47 (06) :419-427
[30]   Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward-learning network [J].
Pan, WX ;
Schmidt, R ;
Wickens, JR ;
Hyland, BI .
JOURNAL OF NEUROSCIENCE, 2005, 25 (26) :6235-6242