Neural signatures of reinforcement learning correlate with strategy adoption during spatial navigation

被引:19
作者
Anggraini, Dian [1 ,4 ]
Glasauer, Stefan [2 ,3 ,4 ]
Wunderlich, Klaus [1 ,3 ,4 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Psychol, D-80802 Munich, Germany
[2] Ludwig Maximilians Univ Munchen, Klinikum Grosshadern, Dept Neurol, Ctr Sensorimotor Res, D-81377 Munich, Germany
[3] Bernstein Ctr Computat Neurosci Munich, D-82152 Martinsried, Germany
[4] Ludwig Maximilians Univ Munchen, Grad Sch Syst Neurosci, D-82152 Martinsried, Germany
关键词
DECISION-MAKING; BASAL GANGLIA; HUMAN HIPPOCAMPUS; COGNITIVE MAPS; GRID CELLS; MEMORY; SYSTEMS; CORTEX; PLACE; REPRESENTATION;
D O I
10.1038/s41598-018-28241-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human navigation is generally believed to rely on two types of strategy adoption, route-based and map-based strategies. Both types of navigation require making spatial decisions along the traversed way although formal computational and neural links between navigational strategies and mechanisms of value-based decision making have so far been underexplored in humans. Here we employed functional magnetic resonance imaging (fMRI) while subjects located different objects in a virtual environment. We then modelled their paths using reinforcement learning (RL) algorithms, which successfully explained decision behavior and its neural correlates. Our results show that subjects used a mixture of route and map-based navigation and their paths could be well explained by the model-free and model-based RL algorithms. Furthermore, the value signals of model-free choices during routebased navigation modulated the BOLD signals in the ventro-medial prefrontal cortex (vmPFC), whereas the BOLD signals in parahippocampal and hippocampal regions pertained to model-based value signals during map-based navigation. Our findings suggest that the brain might share computational mechanisms and neural substrates for navigation and value-based decisions such that model-free choice guides route-based navigation and model-based choice directs map-based navigation. These findings open new avenues for computational modelling of wayfinding by directing attention to value-based decision, differing from common direction and distances approaches.
引用
收藏
页数:14
相关论文
共 92 条
[1]   Topographical disorientation: a synthesis and taxonomy [J].
Aguirre, GK ;
D'Esposito, M .
BRAIN, 1999, 122 :1613-1628
[2]  
Aguirre GK, 1997, J NEUROSCI, V17, P2512
[3]  
[Anonymous], 2005, The cambridge handbook of visuospatial thinking, DOI [DOI 10.1017/CBO9780511610448.005, 10.1017/CBO9780511610448.005]
[4]  
[Anonymous], 1999, Wayfinding behavior: Cognitive mapping and other spatial processes
[5]   The Anterior Cingulate Gyrus Signals the Net Value of Others' Rewards [J].
Apps, Matthew A. J. ;
Ramnani, Narender .
JOURNAL OF NEUROSCIENCE, 2014, 34 (18) :6190-6200
[6]   Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action [J].
Balleine, Bernard W. ;
O'Doherty, John P. .
NEUROPSYCHOPHARMACOLOGY, 2010, 35 (01) :48-69
[7]   Scenes unseen: The parahippocampal cortex intrinsically subserves contextual associations, not scenes or places per se [J].
Bar, Moshe ;
Aminoff, Elissa ;
Schacter, Daniel L. .
JOURNAL OF NEUROSCIENCE, 2008, 28 (34) :8539-8544
[8]   How the brain integrates costs and benefits during decision making [J].
Basten, Ulrike ;
Biele, Guido ;
Heekeren, Hauke R. ;
Fiebach, Christian J. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (50) :21767-21772
[9]   Learning the value of information in an uncertain world [J].
Behrens, Timothy E. J. ;
Woolrich, Mark W. ;
Walton, Mark E. ;
Rushworth, Matthew F. S. .
NATURE NEUROSCIENCE, 2007, 10 (09) :1214-1221
[10]   Modeling effects of intrinsic and extrinsic rewards on the competition between striatal learning systems [J].
Boedecker, Joschka ;
Lampe, Thomas ;
Riedmiller, Martin .
FRONTIERS IN PSYCHOLOGY, 2013, 4