Pure correlates of exploration and exploitation in the human brain

被引:57
作者
Blanchard, Tommy C. [1 ,2 ]
Gershman, Samuel J. [1 ,2 ]
机构
[1] Harvard Univ, Dept Psychol, 52 Oxford St,Room 295-05, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, 52 Oxford St,Room 295-05, Cambridge, MA 02138 USA
关键词
reinforcement learning; fMRI; decision making; ANTERIOR CINGULATE CORTEX; NEURAL MECHANISMS; INDIVIDUAL-DIFFERENCES; FRONTOPOLAR CORTEX; PREFRONTAL CORTEX; EXPECTED VALUE; UNCERTAINTY; INFORMATION; DECISIONS; REPRESENTATION;
D O I
10.3758/s13415-017-0556-2
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Balancing exploration and exploitation is a fundamental problem in reinforcement learning. Previous neuroimaging studies of the exploration-exploitation dilemma could not completely disentangle these two processes, making it difficult to unambiguously identify their neural signatures. We overcome this problem using a task in which subjects can either observe (pure exploration) or bet (pure exploitation). Insula and dorsal anterior cingulate cortex showed significantly greater activity on observe trials compared to bet trials, suggesting that these regions play a role in driving exploration. A model-based analysis of task performance suggested that subjects chose to observe until a critical evidence threshold was reached. We observed a neural signature of this evidence accumulation process in the ventromedial prefrontal cortex. These findings support theories positing an important role for anterior cingulate cortex in exploration, while also providing a new perspective on the roles of insula and ventromedial prefrontal cortex.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 46 条
[1]   Modulation of feedback related activity in the rostral anterior cingulate cortex during trial and error exploration [J].
Amiez, Celine ;
Sallet, Jerome ;
Procyk, Emmanuel ;
Petrides, Michael .
NEUROIMAGE, 2012, 63 (03) :1078-1090
[2]  
[Anonymous], 2017, RSTAN R INTERFACE ST
[3]   Rostrolateral Prefrontal Cortex and Individual Differences in Uncertainty-Driven Exploration [J].
Badre, David ;
Doll, Bradley B. ;
Long, Nicole M. ;
Frank, Michael J. .
NEURON, 2012, 73 (03) :595-607
[4]   The valuation system: A coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value [J].
Bartra, Oscar ;
McGuire, Joseph T. ;
Kable, Joseph W. .
NEUROIMAGE, 2013, 76 (01) :412-427
[5]   Transcranial Stimulation over Frontopolar Cortex Elucidates the Choice Attributes and Neural Mechanisms Used to Resolve Exploration-Exploitation Trade-Offs [J].
Beharelle, Anjali Raja ;
Polania, Rafael ;
Hare, Todd A. ;
Ruff, Christian C. .
JOURNAL OF NEUROSCIENCE, 2015, 35 (43) :14544-14556
[6]   Neurons in Dorsal Anterior Cingulate Cortex Signal Postdecisional Variables in a Foraging Task [J].
Blanchard, Tommy C. ;
Hayden, Benjamin Y. .
JOURNAL OF NEUROSCIENCE, 2014, 34 (02) :646-655
[7]   Ventromedial Prefrontal and Anterior Cingulate Cortex Adopt Choice and Default Reference Frames during Sequential Multi-Alternative Choice [J].
Boorman, Erie D. ;
Rushworth, Matthew F. ;
Behrens, Tim E. .
JOURNAL OF NEUROSCIENCE, 2013, 33 (06) :2242-2253
[8]   How Green Is the Grass on the Other Side? Frontopolar Cortex and the Evidence in Favor of Alternative Courses of Action [J].
Boorman, Erie D. ;
Behrens, Timothy E. J. ;
Woolrich, Mark W. ;
Rushworth, Matthew F. S. .
NEURON, 2009, 62 (05) :733-743
[9]   A MATHEMATICAL MODEL FOR SIMPLE LEARNING [J].
BUSH, RR ;
MOSTELLER, F .
PSYCHOLOGICAL REVIEW, 1951, 58 (05) :313-323
[10]   A Probability Distribution over Latent Causes, in the Orbitofrontal Cortex [J].
Chan, Stephanie C. Y. ;
Niv, Yael ;
Norman, Kenneth A. .
JOURNAL OF NEUROSCIENCE, 2016, 36 (30) :7817-7828