A smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes

被引:2
|
作者
Fan, Yanqin [1 ]
He, Ming [2 ]
Su, Liangjun [3 ]
Zhou, Xiao-Hua [4 ,5 ]
机构
[1] Univ Washington, Dept Econ, Seattle, WA 98195 USA
[2] Univ Technol Sydney, Econ Discipline Grp, Ultimo, Australia
[3] Singapore Management Univ, Sch Econ, Singapore, Singapore
[4] Peking Univ, Beijing Int Ctr Math Res, Beijing 100871, Peoples R China
[5] Peking Univ, Sch Publ Hlth, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
asymptotic normality; exceptional law; optimal smoothing parameter; sequential randomization; Wald-type inference; TECHNICAL CHALLENGES; INFERENCE;
D O I
10.1111/sjos.12359
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we propose a smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes. In contrast to the Q-learning algorithm in which nonregular inference is involved, we show that, under assumptions adopted in this paper, the proposed smoothed Q-learning estimator is asymptotically normally distributed even when the Q-learning estimator is not and its asymptotic variance can be consistently estimated. As a result, inference based on the smoothed Q-learning estimator is standard. We derive the optimal smoothing parameter and propose a data-driven method for estimating it. The finite sample properties of the smoothed Q-learning estimator are studied and compared with several existing estimators including the Q-learning estimator via an extensive simulation study. We illustrate the new method by analyzing data from the Clinical Antipsychotic Trials of Intervention Effectiveness-Alzheimer's Disease (CATIE-AD) study.
引用
收藏
页码:446 / 469
页数:24
相关论文
共 33 条
  • [21] A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes
    Murray, Thomas A.
    Yuan, Ying
    Thall, Peter F.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (523) : 1255 - 1267
  • [22] Bayesian likelihood-based regression for estimation of optimal dynamic treatment regimes
    Yu, Weichang
    Bondell, Howard D.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (03) : 551 - 574
  • [23] Finite sample variance estimation for optimal dynamic treatment regimes of survival outcomes
    Simoneau, Gabrielle
    Moodie, Erica E. M.
    Nijjar, Jagtar S.
    Platt, Robert W.
    STATISTICS IN MEDICINE, 2020, 39 (29) : 4466 - 4479
  • [24] Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring
    Cho, Hunyong
    Holloway, Shannon T.
    Couper, David J.
    Kosorok, Michael R.
    BIOMETRIKA, 2022, : 395 - 410
  • [25] Doubly robust estimation of optimal dynamic treatment regimes with multicategory treatments and survival outcomes
    Zhang, Zhang
    Yi, Danhui
    Fan, Yiwei
    STATISTICS IN MEDICINE, 2022, 41 (24) : 4903 - 4923
  • [26] FINDING THE OPTIMAL DYNAMIC TREATMENT REGIMES USING SMOOTH FISHER CONSISTENT SURROGATE LOSS
    Laha, Nilanjana
    Sonabend-w, Aaron
    Mukherjee, Rajarshi
    Cai, Tianxi
    ANNALS OF STATISTICS, 2024, 52 (02) : 679 - 707
  • [27] Estimating dynamic treatment regimes for ordinal outcomes with household interference: Application in household smoking cessation
    Jiang, Cong
    Thompson, Mary
    Wallace, Michael
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2024, 33 (06) : 981 - 995
  • [28] Estimating optimal treatment rules with an instrumental variable: A partial identification learning approach
    Pu, Hongming
    Zhang, Bo
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2021, 83 (02) : 318 - 345
  • [29] Dynamic Regime Marginal Structural Mean Models for Estimation of Optimal Dynamic Treatment Regimes, Part I: Main Content
    Orellana, Liliana
    Rotnitzky, Andrea
    Robins, James M.
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2010, 6 (02)
  • [30] Learning Optimal Dynamic Treatment Regimens Subject to Stagewise Risk Controls
    Liu, Mochuan
    Wang, Yuanjia
    Fu, Haoda
    Zeng, Donglin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25