Habits Without Values

被引:134
作者
Miller, Kevin J. [1 ,4 ,5 ]
Shenhav, Amitai [2 ]
Ludvig, Elliot A. [3 ]
机构
[1] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA
[2] Brown Univ, Dept Cognit Linguist & Psychol Sci, Brown Inst Brain Sci, 190 Thayer St,Box 1821, Providence, RI 02912 USA
[3] Univ Warwick, Dept Psychol, Coventry, W Midlands, England
[4] UCL, London, England
[5] DeepMind, London, England
关键词
habits; decision making; reinforcement learning; model-based; model-free; DIRECTED DECISION-MAKING; PREFRONTAL CORTEX; ORBITOFRONTAL CORTEX; MODEL-FREE; DORSOLATERAL STRIATUM; BASAL GANGLIA; LEARNING-SYSTEMS; VENTRAL STRIATUM; REACTION-TIME; CHOICE;
D O I
10.1037/rev0000120
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Habits form a crucial component of behavior. In recent years, key computational models have conceptualized habits as arising from model-free reinforcement learning mechanisms, which typically select between available actions based on the future value expected to result from each. Traditionally, however, habits have been understood as behaviors that can be triggered directly by a stimulus, without requiring the animal to evaluate expected outcomes. Here, we develop a computational model instantiating this traditional view, in which habits develop through the direct strengthening of recently taken actions rather than through the encoding of outcomes. We demonstrate that this model accounts for key behavioral manifestations of habits, including insensitivity to outcome devaluation and contingency degradation, as well as the effects of reinforcement schedule on the rate of habit formation. The model also explains the prevalent observation of perseveration in repeated-choice tasks as an additional behavioral manifestation of the habit system. We suggest that mapping habitual behaviors onto value-free mechanisms provides a parsimonious account of existing behavioral and neural data. This mapping may provide a new foundation for building robust and comprehensive models of the interaction of habits with other, more goal-directed types of behaviors and help to better guide research into the neural mechanisms underlying control of instrumental behavior more generally.
引用
收藏
页码:292 / 311
页数:20
相关论文
共 125 条
  • [51] Habits, rituals, and the evaluative brain
    Graybiel, Ann M.
    [J]. ANNUAL REVIEW OF NEUROSCIENCE, 2008, 31 : 359 - 387
  • [52] Orbitofrontal and striatal circuits dynamically encode the shift between goal-directed and habitual actions
    Gremel, Christina M.
    Costa, Rui M.
    [J]. NATURE COMMUNICATIONS, 2013, 4
  • [54] Neuronal basis of sequential foraging decisions in a patchy environment
    Hayden, Benjamin Y.
    Pearson, John M.
    Platt, Michael L.
    [J]. NATURE NEUROSCIENCE, 2011, 14 (07) : 933 - U165
  • [55] A Neurocomputational Model of Automatic Sequence Production
    Helie, Sebastien
    Roeder, Jessica L.
    Vucovich, Lauren
    Ruenger, Dennis
    Ashby, F. Gregory
    [J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2015, 27 (07) : 1456 - 1469
  • [56] Learning robust cortico-cortical associations with the basal ganglia: An integrative review
    Helie, Sebastien
    Ell, Shawn W.
    Ashby, F. Gregory
    [J]. CORTEX, 2015, 64 : 123 - 135
  • [57] Hull C. L, 1943, J PHILOS
  • [58] Validation of Decision-Making Models and Analysis of Decision Variables in the Rat Basal Ganglia
    Ito, Makoto
    Doya, Kenji
    [J]. JOURNAL OF NEUROSCIENCE, 2009, 29 (31) : 9861 - 9874
  • [59] James W., 1890, The Principles of Psychology
  • [60] Orbitofrontal Cortex Supports Behavior and Learning Using Inferred But Not Cached Values
    Jones, Joshua L.
    Esber, Guillem R.
    McDannald, Michael A.
    Gruber, Aaron J.
    Hernandez, Alex
    Mirenzi, Aaron
    Schoenbaum, Geoffrey
    [J]. SCIENCE, 2012, 338 (6109) : 953 - 956