Credit assignment to state-independent task representations and its relationship with model-based decision making

被引:34
作者
Shahar, Nitzan [1 ,2 ]
Moran, Rani [1 ,2 ]
Hauser, Tobias U. [1 ,2 ]
Kievit, Rogier A. [2 ,3 ]
McNamee, Daniel [1 ,2 ]
Moutoussis, Michael [1 ,2 ]
Dolan, Raymond J. [1 ,2 ]
机构
[1] UCL, Wellcome Ctr Human Neuroimaging, London WC1N 3BG, England
[2] Max Planck Univ Coll London, Ctr Computat Psychiat & Res, Dept Imaging Neurosci, London WC1B 5EH, England
[3] Univ Cambridge, MRC, Cognit & Brain Sci Unit, Cambridge CB2 7EF, England
基金
英国惠康基金;
关键词
reinforcement learning; decision making; motor learning; FRONTAL-CORTEX; STIMULUS-VALUE; CHOICES; HUMANS;
D O I
10.1073/pnas.1821647116
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Model-free learning enables an agent to make better decisions based on prior experience while representing only minimal knowledge about an environment's structure. It is generally assumed that model-free state representations are based on outcome-relevant features of the environment. Here, we challenge this assumption by providing evidence that a putative model-free system assigns credit to task representations that are irrelevant to an outcome. We examined data from 769 individuals performing a well-described 2-step reward decision task where stimulus identity but not spatial-motor aspects of the task predicted reward. We show that participants assigned value to spatial-motor representations despite it being outcome irrelevant. Strikingly, spatial-motor value associations affected behavior across all outcome-relevant features and stages of the task, consistent with credit assignment to low-level state-independent task representations. Individual difference analyses suggested that the impact of spatial-motor value formation was attenuated for individuals who showed greater deployment of goal-directed (model-based) strategies. Our findings highlight a need for a reconsideration of how model-free representations are formed and regulated according to the structure of the environment.
引用
收藏
页码:15871 / 15876
页数:6
相关论文
共 34 条
  • [1] [Anonymous], WELLCOME OPEN RES
  • [2] [Anonymous], 1898, Psychol. Rev.
  • [3] Fast and Elegant Numerical Linear Algebra Using the RcppEigen Package
    Bates, Douglas
    Eddelbuettel, Dirk
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2013, 52 (05): : 1 - 24
  • [4] Separate encoding of model-based and model-free valuations in the human brain
    Beierholm, Ulrik R.
    Anen, Cedric
    Quartz, Steven
    Bossaerts, Peter
    [J]. NEUROIMAGE, 2011, 58 (03) : 955 - 962
  • [5] Double Dissociation of Stimulus-Value and Action-Value Learning in Humans with Orbitofrontal or Anterior Cingulate Cortex Damage
    Camille, Nathalie
    Tsuchida, Ami
    Fellows, Lesley K.
    [J]. JOURNAL OF NEUROSCIENCE, 2011, 31 (42) : 15048 - 15052
  • [6] Model-Based Influences on Humans' Choices and Striatal Prediction Errors
    Daw, Nathaniel D.
    Gershman, Samuel J.
    Seymour, Ben
    Dayan, Peter
    Dolan, Raymond J.
    [J]. NEURON, 2011, 69 (06) : 1204 - 1215
  • [7] From Creatures of Habit to Goal-Directed Learners: Tracking the Developmental Emergence of Model-Based Reinforcement Learning
    Decker, Johannes H.
    Otto, A. Ross
    Daw, Nathaniel D.
    Hartley, Catherine A.
    [J]. PSYCHOLOGICAL SCIENCE, 2016, 27 (06) : 848 - 858
  • [8] Ventral striatal dopamine reflects behavioral and neural signatures of model-based control during sequential decision making
    Deserno, Lorenz
    Huys, Quentin J. M.
    Boehme, Rebecca
    Buchert, Ralph
    Heinze, Hans-Jochen
    Grace, Anthony A.
    Dolan, Raymond J.
    Heinz, Andreas
    Schlagenhauf, Florian
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (05) : 1595 - 1600
  • [9] Actions, Action Sequences and Habits: Evidence That Goal-Directed and Habitual Action Control Are Hierarchically Organized
    Dezfouli, Amir
    Balleine, Bernard W.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (12)
  • [10] Goals and Habits in the Brain
    Dolan, Ray J.
    Dayan, Peter
    [J]. NEURON, 2013, 80 (02) : 312 - 325