Habits Without Values

被引:134
作者
Miller, Kevin J. [1 ,4 ,5 ]
Shenhav, Amitai [2 ]
Ludvig, Elliot A. [3 ]
机构
[1] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ 08544 USA
[2] Brown Univ, Dept Cognit Linguist & Psychol Sci, Brown Inst Brain Sci, 190 Thayer St,Box 1821, Providence, RI 02912 USA
[3] Univ Warwick, Dept Psychol, Coventry, W Midlands, England
[4] UCL, London, England
[5] DeepMind, London, England
关键词
habits; decision making; reinforcement learning; model-based; model-free; DIRECTED DECISION-MAKING; PREFRONTAL CORTEX; ORBITOFRONTAL CORTEX; MODEL-FREE; DORSOLATERAL STRIATUM; BASAL GANGLIA; LEARNING-SYSTEMS; VENTRAL STRIATUM; REACTION-TIME; CHOICE;
D O I
10.1037/rev0000120
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Habits form a crucial component of behavior. In recent years, key computational models have conceptualized habits as arising from model-free reinforcement learning mechanisms, which typically select between available actions based on the future value expected to result from each. Traditionally, however, habits have been understood as behaviors that can be triggered directly by a stimulus, without requiring the animal to evaluate expected outcomes. Here, we develop a computational model instantiating this traditional view, in which habits develop through the direct strengthening of recently taken actions rather than through the encoding of outcomes. We demonstrate that this model accounts for key behavioral manifestations of habits, including insensitivity to outcome devaluation and contingency degradation, as well as the effects of reinforcement schedule on the rate of habit formation. The model also explains the prevalent observation of perseveration in repeated-choice tasks as an additional behavioral manifestation of the habit system. We suggest that mapping habitual behaviors onto value-free mechanisms provides a parsimonious account of existing behavioral and neural data. This mapping may provide a new foundation for building robust and comprehensive models of the interaction of habits with other, more goal-directed types of behaviors and help to better guide research into the neural mechanisms underlying control of instrumental behavior more generally.
引用
收藏
页码:292 / 311
页数:20
相关论文
共 125 条
  • [1] Predicting behavior from actions in the past: Repeated decision making or a matter of habit?
    Aarts, H
    Verplanken, B
    van Knippenberg, A
    [J]. JOURNAL OF APPLIED SOCIAL PSYCHOLOGY, 1998, 28 (15) : 1355 - 1374
  • [2] The prefrontal cortex and hybrid learning during iterative competitive games
    Abe, Hiroshi
    Seo, Hyojung
    Lee, Daeyeol
    [J]. CRITICAL CONTRIBUTIONS OF THE ORBITOFRONTAL CORTEX TO BEHAVIOR, 2011, 1239 : 100 - 108
  • [3] INSTRUMENTAL RESPONDING FOLLOWING REINFORCER DEVALUATION
    ADAMS, CD
    DICKINSON, A
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION B-COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1981, 33 (MAY): : 109 - 121
  • [4] VARIATIONS IN THE SENSITIVITY OF INSTRUMENTAL RESPONDING TO REINFORCER DEVALUATION
    ADAMS, CD
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION B-COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1982, 34 (MAY): : 77 - 98
  • [5] Autonomous Mechanism of Internal Choice Estimate Underlies Decision Inertia
    Akaishi, Rei
    Umeda, Kazumasa
    Nagase, Asako
    Sakai, Katsuyuki
    [J]. NEURON, 2014, 81 (01) : 195 - 206
  • [6] Akam T., 2017, BioRxiv, P1, DOI [10.1101/126292, DOI 10.1101/126292]
  • [7] [Anonymous], 2008, INT C MACHINE LEARNI
  • [8] [Anonymous], 2015, INTRO MODEL BASED CO, DOI DOI 10.1007/978-1-4939-2236-9_8
  • [9] [Anonymous], 1998, REINFORCEMENT LEARNI
  • [10] A neurobiological theory of automaticity in perceptual categorization
    Ashby, F. Gregory
    Ennis, John M.
    Spiering, Brian J.
    [J]. PSYCHOLOGICAL REVIEW, 2007, 114 (03) : 632 - 656