Pure correlates of exploration and exploitation in the human brain

被引:56
作者
Blanchard, Tommy C. [1 ,2 ]
Gershman, Samuel J. [1 ,2 ]
机构
[1] Harvard Univ, Dept Psychol, 52 Oxford St,Room 295-05, Cambridge, MA 02138 USA
[2] Harvard Univ, Ctr Brain Sci, 52 Oxford St,Room 295-05, Cambridge, MA 02138 USA
关键词
reinforcement learning; fMRI; decision making; ANTERIOR CINGULATE CORTEX; NEURAL MECHANISMS; INDIVIDUAL-DIFFERENCES; FRONTOPOLAR CORTEX; PREFRONTAL CORTEX; EXPECTED VALUE; UNCERTAINTY; INFORMATION; DECISIONS; REPRESENTATION;
D O I
10.3758/s13415-017-0556-2
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Balancing exploration and exploitation is a fundamental problem in reinforcement learning. Previous neuroimaging studies of the exploration-exploitation dilemma could not completely disentangle these two processes, making it difficult to unambiguously identify their neural signatures. We overcome this problem using a task in which subjects can either observe (pure exploration) or bet (pure exploitation). Insula and dorsal anterior cingulate cortex showed significantly greater activity on observe trials compared to bet trials, suggesting that these regions play a role in driving exploration. A model-based analysis of task performance suggested that subjects chose to observe until a critical evidence threshold was reached. We observed a neural signature of this evidence accumulation process in the ventromedial prefrontal cortex. These findings support theories positing an important role for anterior cingulate cortex in exploration, while also providing a new perspective on the roles of insula and ventromedial prefrontal cortex.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 46 条
  • [1] Modulation of feedback related activity in the rostral anterior cingulate cortex during trial and error exploration
    Amiez, Celine
    Sallet, Jerome
    Procyk, Emmanuel
    Petrides, Michael
    [J]. NEUROIMAGE, 2012, 63 (03) : 1078 - 1090
  • [2] [Anonymous], 2017, RSTAN R INTERFACE ST
  • [3] Rostrolateral Prefrontal Cortex and Individual Differences in Uncertainty-Driven Exploration
    Badre, David
    Doll, Bradley B.
    Long, Nicole M.
    Frank, Michael J.
    [J]. NEURON, 2012, 73 (03) : 595 - 607
  • [4] The valuation system: A coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value
    Bartra, Oscar
    McGuire, Joseph T.
    Kable, Joseph W.
    [J]. NEUROIMAGE, 2013, 76 (01) : 412 - 427
  • [5] Transcranial Stimulation over Frontopolar Cortex Elucidates the Choice Attributes and Neural Mechanisms Used to Resolve Exploration-Exploitation Trade-Offs
    Beharelle, Anjali Raja
    Polania, Rafael
    Hare, Todd A.
    Ruff, Christian C.
    [J]. JOURNAL OF NEUROSCIENCE, 2015, 35 (43) : 14544 - 14556
  • [6] Neurons in Dorsal Anterior Cingulate Cortex Signal Postdecisional Variables in a Foraging Task
    Blanchard, Tommy C.
    Hayden, Benjamin Y.
    [J]. JOURNAL OF NEUROSCIENCE, 2014, 34 (02) : 646 - 655
  • [7] Ventromedial Prefrontal and Anterior Cingulate Cortex Adopt Choice and Default Reference Frames during Sequential Multi-Alternative Choice
    Boorman, Erie D.
    Rushworth, Matthew F.
    Behrens, Tim E.
    [J]. JOURNAL OF NEUROSCIENCE, 2013, 33 (06) : 2242 - 2253
  • [8] How Green Is the Grass on the Other Side? Frontopolar Cortex and the Evidence in Favor of Alternative Courses of Action
    Boorman, Erie D.
    Behrens, Timothy E. J.
    Woolrich, Mark W.
    Rushworth, Matthew F. S.
    [J]. NEURON, 2009, 62 (05) : 733 - 743
  • [9] A MATHEMATICAL MODEL FOR SIMPLE LEARNING
    BUSH, RR
    MOSTELLER, F
    [J]. PSYCHOLOGICAL REVIEW, 1951, 58 (05) : 313 - 323
  • [10] A Probability Distribution over Latent Causes, in the Orbitofrontal Cortex
    Chan, Stephanie C. Y.
    Niv, Yael
    Norman, Kenneth A.
    [J]. JOURNAL OF NEUROSCIENCE, 2016, 36 (30) : 7817 - 7828