Belief state representation in the dopamine system

被引:54
作者
Babayan, Benedicte M. [1 ,2 ]
Uchida, Naoshige [1 ]
Gershman, Samuel. J. [2 ]
机构
[1] Harvard Univ, Ctr Brain Sci, Dept Mol & Cellular Biol, 16 Divin Ave, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, Dept Psychol, 52 Oxford St, Cambridge, MA 02138 USA
基金
美国国家卫生研究院;
关键词
REWARD PREDICTION ERROR; BAYESIAN MODEL SELECTION; CHOLINERGIC INTERNEURONS; ORBITOFRONTAL CORTEX; RHESUS-MONKEYS; NEURONS; HIPPOCAMPUS; SIGNALS; RELEASE; RAT;
D O I
10.1038/s41467-018-04397-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Learning to predict future outcomes is critical for driving appropriate behaviors. Reinforcement learning (RL) models have successfully accounted for such learning, relying on reward prediction errors (RPEs) signaled by midbrain dopamine neurons. It has been proposed that when sensory data provide only ambiguous information about which state an animal is in, it can predict reward based on a set of probabilities assigned to hypothetical states (called the belief state). Here we examine how dopamine RPEs and subsequent learning are regulated under state uncertainty. Mice are first trained in a task with two potential states defined by different reward amounts. During testing, intermediate-sized rewards are given in rare trials. Dopamine activity is a non-monotonic function of reward size, consistent with RL models operating on belief states. Furthermore, the magnitude of dopamine responses quantitatively predicts changes in behavior. These results establish the critical role of state inference in RL.
引用
收藏
页数:10
相关论文
共 54 条
[1]   Structural learning and the hippocampus [J].
Aggleton, John P. ;
Sanderson, David J. ;
Pearce, John M. .
HIPPOCAMPUS, 2007, 17 (09) :723-734
[2]   Optimization of a GCaMP Calcium Indicator for Neural Activity Imaging [J].
Akerboom, Jasper ;
Chen, Tsai-Wen ;
Wardill, Trevor J. ;
Tian, Lin ;
Marvin, Jonathan S. ;
Mutlu, Sevinc ;
Calderon, Nicole Carreras ;
Esposti, Federico ;
Borghuis, Bart G. ;
Sun, Xiaonan Richard ;
Gordus, Andrew ;
Orger, Michael B. ;
Portugues, Ruben ;
Engert, Florian ;
Macklin, John J. ;
Filosa, Alessandro ;
Aggarwal, Aman ;
Kerr, Rex A. ;
Takagi, Ryousuke ;
Kracun, Sebastian ;
Shigetomi, Eiji ;
Khakh, Baljit S. ;
Baier, Herwig ;
Lagnado, Leon ;
Wang, Samuel S. -H. ;
Bargmann, Cornelia I. ;
Kimmel, Bruce E. ;
Jayaraman, Vivek ;
Svoboda, Karel ;
Kim, Douglas S. ;
Schreiter, Eric R. ;
Looger, Loren L. .
JOURNAL OF NEUROSCIENCE, 2012, 32 (40) :13819-13840
[3]  
[Anonymous], 2015, Reinforcement Learning: An Introduction
[4]  
[Anonymous], 2020, Reinforcement Learning, An Introduction
[5]   Characterization of a mouse strain expressing Cre recombinase from the 3′ untranslated region of the dopamine transporter locus [J].
Baeckman, Cristina M. ;
Malik, Nasir ;
Zhang, YaJun ;
Shan, Lufei ;
Grinberg, Alex ;
Hoffer, Barry J. ;
Westphal, Heiner ;
Tomac, Andreas C. .
GENESIS, 2006, 44 (08) :383-390
[6]   Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[7]   A Pallidus-Habenula-Dopamine Pathway Signals Inferred Stimulus Values [J].
Bromberg-Martin, Ethan S. ;
Matsumoto, Masayuki ;
Hong, Simon ;
Hikosaka, Okihide .
JOURNAL OF NEUROPHYSIOLOGY, 2010, 104 (02) :1068-1076
[8]   Selective Activation of Cholinergic Interneurons Enhances Accumbal Phasic Dopamine Release: Setting the Tone for Reward Processing [J].
Cachope, Roger ;
Mateo, Yolanda ;
Mathur, Brian N. ;
Irving, James ;
Wang, Hui-Ling ;
Morales, Marisela ;
Lovinger, David M. ;
Cheer, Joseph F. .
CELL REPORTS, 2012, 2 (01) :33-41
[9]   Ultrasensitive fluorescent proteins for imaging neuronal activity [J].
Chen, Tsai-Wen ;
Wardill, Trevor J. ;
Sun, Yi ;
Pulver, Stefan R. ;
Renninger, Sabine L. ;
Baohan, Amy ;
Schreiter, Eric R. ;
Kerr, Rex A. ;
Orger, Michael B. ;
Jayaraman, Vivek ;
Looger, Loren L. ;
Svoboda, Karel ;
Kim, Douglas S. .
NATURE, 2013, 499 (7458) :295-+
[10]   Neuron-type-specific signals for reward and punishment in the ventral tegmental area [J].
Cohen, Jeremiah Y. ;
Haesler, Sebastian ;
Vong, Linh ;
Lowell, Bradford B. ;
Uchida, Naoshige .
NATURE, 2012, 482 (7383) :85-U109