HMM for discovering decision-making dynamics using reinforcement learning experiments

被引:0
作者
Guo, Xingche [1 ]
Zeng, Donglin [2 ]
Wang, Yuanjia [1 ,3 ]
机构
[1] Columbia Univ, Dept Biostat, 722 West 168th St, New York, NY 10032 USA
[2] Univ Michigan, Dept Biostat, 1415 Washington Hts, Ann Arbor, MI 48109 USA
[3] Columbia Univ, Dept Psychiat, 1051 Riverside Dr, New York, NY 10032 USA
基金
美国国家卫生研究院;
关键词
behavioral phenotyping; brain-behavior association; mental health; reinforcement learning; reward tasks; state-switching; PSYCHIATRY; TASK;
D O I
10.1093/biostatistics/kxae033
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Major depressive disorder (MDD), a leading cause of years of life lived with disability, presents challenges in diagnosis and treatment due to its complex and heterogeneous nature. Emerging evidence indicates that reward processing abnormalities may serve as a behavioral marker for MDD. To measure reward processing, patients perform computer-based behavioral tasks that involve making choices or responding to stimulants that are associated with different outcomes, such as gains or losses in the laboratory. Reinforcement learning (RL) models are fitted to extract parameters that measure various aspects of reward processing (e.g. reward sensitivity) to characterize how patients make decisions in behavioral tasks. Recent findings suggest the inadequacy of characterizing reward learning solely based on a single RL model; instead, there may be a switching of decision-making processes between multiple strategies. An important scientific question is how the dynamics of strategies in decision-making affect the reward learning ability of individuals with MDD. Motivated by the probabilistic reward task within the Establishing Moderators and Biosignatures of Antidepressant Response in Clinical Care (EMBARC) study, we propose a novel RL-HMM (hidden Markov model) framework for analyzing reward-based decision-making. Our model accommodates decision-making strategy switching between two distinct approaches under an HMM: subjects making decisions based on the RL model or opting for random choices. We account for continuous RL state space and allow time-varying transition probabilities in the HMM. We introduce a computationally efficient Expectation-maximization (EM) algorithm for parameter estimation and use a nonparametric bootstrap for inference. Extensive simulation studies validate the finite-sample performance of our method. We apply our approach to the EMBARC study to show that MDD patients are less engaged in RL compared to the healthy controls, and engagement is associated with brain activities in the negative affect circuitry during an emotional conflict task.
引用
收藏
页数:16
相关论文
共 32 条
  • [11] Implicit learning
    Frensch, PA
    Rünger, D
    [J]. CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2003, 12 (01) : 13 - 18
  • [12] A Semiparametric Inverse Reinforcement Learning Approach to Characterize Decision Making for Mental Disorders
    Guo, Xingche
    Zeng, Donglin
    Wang, Yuanjia
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (545) : 27 - 38
  • [13] A RATING SCALE FOR DEPRESSION
    HAMILTON, M
    [J]. JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY, 1960, 23 (01) : 56 - 62
  • [14] Computational psychiatry as a bridge from neuroscience to clinical applications
    Huys, Quentin J. M.
    Maia, Tiago V.
    Frank, Michael J.
    [J]. NATURE NEUROSCIENCE, 2016, 19 (03) : 404 - 413
  • [15] Mapping anhedonia onto reinforcement learning: a behavioural meta-analysis
    Huys, Quentin J. M.
    Pizzagalli, Diego A.
    Bogdan, Ryan
    Dayan, Peter
    [J]. BIOLOGY OF MOOD & ANXIETY DISORDERS, 2013, 3 (01):
  • [16] Disentangling the Roles of Approach, Activation and Valence in Instrumental and Pavlovian Responding
    Huys, Quentin J. M.
    Cools, Roshan
    Goelzer, Martin
    Friedel, Eva
    Heinz, Andreas
    Dolan, Raymond J.
    Dayan, Peter
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (04)
  • [17] An effect of serotonergic stimulation on learning rates for rewards apparent after long intertrial intervals
    Iigaya, Kiyohito
    Fonseca, Madalena S.
    Murakami, Masayoshi
    Mainen, Zachary F.
    Dayan, Peter
    [J]. NATURE COMMUNICATIONS, 2018, 9
  • [18] Research Domain Criteria (RDoC): Toward a New Classification Framework for Research on Mental Disorders
    Insel, Thomas
    Cuthbert, Bruce
    Garvey, Marjorie
    Heinssen, Robert
    Pine, Daniel S.
    Quinn, Kevin
    Sanislow, Charles
    Wang, Philip
    [J]. AMERICAN JOURNAL OF PSYCHIATRY, 2010, 167 (07) : 748 - 751
  • [19] Causal relationship between stressful life events and the onset of major depression
    Kendler, KS
    Karkowski, LM
    Prescott, CA
    [J]. AMERICAN JOURNAL OF PSYCHIATRY, 1999, 156 (06) : 837 - 841
  • [20] Toward an objective characterization of an anhedonic phenotype: A signal detection approach
    Pizzagalli, DA
    Jahn, AL
    O'Shea, JP
    [J]. BIOLOGICAL PSYCHIATRY, 2005, 57 (04) : 319 - 327