Habits Without Values

被引:134
作者
Miller, Kevin J. [1 ,4 ,5 ]
Shenhav, Amitai [2 ]
Ludvig, Elliot A. [3 ]
机构
[1] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA
[2] Brown Univ, Dept Cognit Linguist & Psychol Sci, Brown Inst Brain Sci, 190 Thayer St,Box 1821, Providence, RI 02912 USA
[3] Univ Warwick, Dept Psychol, Coventry, W Midlands, England
[4] UCL, London, England
[5] DeepMind, London, England
关键词
habits; decision making; reinforcement learning; model-based; model-free; DIRECTED DECISION-MAKING; PREFRONTAL CORTEX; ORBITOFRONTAL CORTEX; MODEL-FREE; DORSOLATERAL STRIATUM; BASAL GANGLIA; LEARNING-SYSTEMS; VENTRAL STRIATUM; REACTION-TIME; CHOICE;
D O I
10.1037/rev0000120
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Habits form a crucial component of behavior. In recent years, key computational models have conceptualized habits as arising from model-free reinforcement learning mechanisms, which typically select between available actions based on the future value expected to result from each. Traditionally, however, habits have been understood as behaviors that can be triggered directly by a stimulus, without requiring the animal to evaluate expected outcomes. Here, we develop a computational model instantiating this traditional view, in which habits develop through the direct strengthening of recently taken actions rather than through the encoding of outcomes. We demonstrate that this model accounts for key behavioral manifestations of habits, including insensitivity to outcome devaluation and contingency degradation, as well as the effects of reinforcement schedule on the rate of habit formation. The model also explains the prevalent observation of perseveration in repeated-choice tasks as an additional behavioral manifestation of the habit system. We suggest that mapping habitual behaviors onto value-free mechanisms provides a parsimonious account of existing behavioral and neural data. This mapping may provide a new foundation for building robust and comprehensive models of the interaction of habits with other, more goal-directed types of behaviors and help to better guide research into the neural mechanisms underlying control of instrumental behavior more generally.
引用
收藏
页码:292 / 311
页数:20
相关论文
共 125 条
  • [31] Model-Based Influences on Humans' Choices and Striatal Prediction Errors
    Daw, Nathaniel D.
    Gershman, Samuel J.
    Seymour, Ben
    Dayan, Peter
    Dolan, Raymond J.
    [J]. NEURON, 2011, 69 (06) : 1204 - 1215
  • [32] Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    Daw, ND
    Niv, Y
    Dayan, P
    [J]. NATURE NEUROSCIENCE, 2005, 8 (12) : 1704 - 1711
  • [33] Learning and selective attention
    Dayan, Peter
    Kakade, Sham
    Montague, P. Read
    [J]. NATURE NEUROSCIENCE, 2000, 3 (11) : 1218 - 1223
  • [34] Instrumental uncertainty as a determinant of behavior under interval schedules of reinforcement
    DeRusso, Alicia L.
    Fan, David
    Gupta, Jay
    Shelest, Oksana
    Costa, Rui M.
    Yin, Henry H.
    [J]. FRONTIERS IN INTEGRATIVE NEUROSCIENCE, 2010, 4
  • [35] Habits, action sequences and reinforcement learning
    Dezfouli, Amir
    Balleine, Bernard W.
    [J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) : 1036 - 1051
  • [36] THE EFFECT OF THE INSTRUMENTAL TRAINING CONTINGENCY ON SUSCEPTIBILITY TO REINFORCER DEVALUATION
    DICKINSON, A
    NICHOLAS, DJ
    ADAMS, CD
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION B-COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1983, 35 (FEB): : 35 - 51
  • [38] Dickinson A, 1998, Q J EXP PSYCHOL-B, V51, P271
  • [39] Goals and Habits in the Brain
    Dolan, Ray J.
    Dayan, Peter
    [J]. NEURON, 2013, 80 (02) : 312 - 325
  • [40] The ubiquity of model-based reinforcement learning
    Doll, Bradley B.
    Simon, Dylan A.
    Daw, Nathaniel D.
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 1075 - 1081