Pure correlates of exploration and exploitation in the human brain

被引:57
作者
Blanchard, Tommy C. [1 ,2 ]
Gershman, Samuel J. [1 ,2 ]
机构
[1] Harvard Univ, Dept Psychol, 52 Oxford St,Room 295-05, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, 52 Oxford St,Room 295-05, Cambridge, MA 02138 USA
关键词
reinforcement learning; fMRI; decision making; ANTERIOR CINGULATE CORTEX; NEURAL MECHANISMS; INDIVIDUAL-DIFFERENCES; FRONTOPOLAR CORTEX; PREFRONTAL CORTEX; EXPECTED VALUE; UNCERTAINTY; INFORMATION; DECISIONS; REPRESENTATION;
D O I
10.3758/s13415-017-0556-2
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Balancing exploration and exploitation is a fundamental problem in reinforcement learning. Previous neuroimaging studies of the exploration-exploitation dilemma could not completely disentangle these two processes, making it difficult to unambiguously identify their neural signatures. We overcome this problem using a task in which subjects can either observe (pure exploration) or bet (pure exploitation). Insula and dorsal anterior cingulate cortex showed significantly greater activity on observe trials compared to bet trials, suggesting that these regions play a role in driving exploration. A model-based analysis of task performance suggested that subjects chose to observe until a critical evidence threshold was reached. We observed a neural signature of this evidence accumulation process in the ventromedial prefrontal cortex. These findings support theories positing an important role for anterior cingulate cortex in exploration, while also providing a new perspective on the roles of insula and ventromedial prefrontal cortex.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 46 条
[11]   Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration [J].
Cohen, Jonathan D. ;
McClure, Samuel M. ;
Yu, Angela J. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2007, 362 (1481) :933-942
[12]   Activity in Inferior Parietal and Medial Prefrontal Cortex Signals the Accumulation of Evidence in a Probability Learning Task [J].
d'Acremont, Mathieu ;
Fornari, Eleonora ;
Bossaerts, Peter .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (01)
[13]   Cortical substrates for exploratory decisions in humans [J].
Daw, Nathaniel D. ;
O'Doherty, John P. ;
Dayan, Peter ;
Seymour, Ben ;
Dolan, Raymond J. .
NATURE, 2006, 441 (7095) :876-879
[14]   Foundations of human reasoning in the prefrontal cortex [J].
Donoso, Mael ;
Collins, Anne G. E. ;
Koechlin, Etienne .
SCIENCE, 2014, 344 (6191) :1481-1486
[15]  
Erev I, 1998, AM ECON REV, V88, P848
[16]   Multiplexed Echo Planar Imaging for Sub-Second Whole Brain FMRI and Fast Diffusion Imaging [J].
Feinberg, David A. ;
Moeller, Steen ;
Smith, Stephen M. ;
Auerbach, Edward ;
Ramanna, Sudhir ;
Glasser, Matt F. ;
Miller, Karla L. ;
Ugurbil, Kamil ;
Yacoub, Essa .
PLOS ONE, 2010, 5 (12)
[17]   fMRI and EEG Predictors of Dynamic Decision Parameters during Human Reinforcement Learning [J].
Frank, Michael J. ;
Gagne, Chris ;
Nyhus, Erika ;
Masters, Sean ;
Wiecki, Thomas V. ;
Cavanagh, James F. ;
Badre, David .
JOURNAL OF NEUROSCIENCE, 2015, 35 (02) :485-494
[18]   Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation [J].
Frank, Michael J. ;
Doll, Bradley B. ;
Oas-Terpstra, Jen ;
Moreno, Francisco .
NATURE NEUROSCIENCE, 2009, 12 (08) :1062-U145
[19]   Novelty and Inductive Generalization in Human Reinforcement Learning [J].
Gershman, Samuel J. ;
Niv, Yael .
TOPICS IN COGNITIVE SCIENCE, 2015, 7 (03) :391-415
[20]   Neuronal basis of sequential foraging decisions in a patchy environment [J].
Hayden, Benjamin Y. ;
Pearson, John M. ;
Platt, Michael L. .
NATURE NEUROSCIENCE, 2011, 14 (07) :933-U165