Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data

被引:2
作者
Tang, Ming [1 ]
Wang, Lu [1 ]
Gorin, Michael A. [2 ,3 ]
Taylor, Jeremy M. G. [1 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Johns Hopkins Univ, Sch Med, James Buchanan Brady Urol Inst, Baltimore, MD USA
[3] Johns Hopkins Univ, Sch Med, Dept Urol, Baltimore, MD 21205 USA
关键词
dynamic treatment regimes; multistage decision-making; observational data; personalized health care; test-and-treat strategy; tree-based reinforcement learning; PROSTATE-CANCER; OPTIMIZATION; INFORMATION; INFERENCE; MODELS;
D O I
10.1002/sim.9177
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Dynamic treatment regimes (DTRs) include a sequence of treatment decision rules, in which treatment is adapted over time in response to the changes in an individual's disease progression and health care history. In medical practice, nested test-and-treat strategies are common to improve cost-effectiveness. For example, for patients at risk of prostate cancer, only patients who have high prostate-specific antigen (PSA) need a biopsy, which is costly and invasive, to confirm the diagnosis and help determine the treatment if needed. A decision about treatment happens after the biopsy, and is thus nested within the decision of whether to do the test. However, current existing statistical methods are not able to accommodate such a naturally embedded property of the treatment decision within the test decision. Therefore, we developed a new statistical learning method, step-adjusted tree-based reinforcement learning, to evaluate DTRs within such a nested multistage dynamic decision framework using observational data. At each step within each stage, we combined the robust semiparametric estimation via augmented inverse probability weighting with a tree-based reinforcement learning method to deal with the counterfactual optimization. The simulation studies demonstrated robust performance of the proposed methods under different scenarios. We further applied our method to evaluate the necessity of prostate biopsy and identify the optimal test-and-treat regimes for prostate cancer patients using data from the Johns Hopkins University prostate cancer active surveillance dataset.
引用
收藏
页码:6164 / 6177
页数:14
相关论文
共 30 条
  • [1] [Anonymous], 2009, Ann Intern Med, V151, pI, DOI 10.7326/0003-4819-151-9-200911030-00002
  • [2] Butler EL., 2016, THESIS U N CAROLINA
  • [3] Dynamic Treatment Regimes
    Chakraborty, Bibhas
    Murphy, Susan A.
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 1, 2014, 1 : 447 - U1025
  • [4] Inference for Optimal Dynamic Treatment Regimes Using an Adaptive m-Out-of-n Bootstrap Scheme
    Chakraborty, Bibhas
    Laber, Eric B.
    Zhao, Yingqi
    [J]. BIOMETRICS, 2013, 69 (03) : 714 - 723
  • [5] Optimizing Prostate Cancer Surveillance: Using Data-driven Models for Informed Decision-making
    Denton, Brian T.
    Hawley, Sarah T.
    Morgan, Todd M.
    [J]. EUROPEAN UROLOGY, 2019, 75 (06) : 918 - 919
  • [6] Expected value of information and decision making in HTA
    Eckermann, Simon
    Willan, Andrew R.
    [J]. HEALTH ECONOMICS, 2007, 16 (02) : 195 - 209
  • [7] Optimization of multi-stage dynamic treatment regimes utilizing accumulated data
    Huang, Xuelin
    Choi, Sangbum
    Wang, Lu
    Thall, Peter F.
    [J]. STATISTICS IN MEDICINE, 2015, 34 (26) : 3424 - 3443
  • [8] A Simplified, Noninvasive Stool DNA Test for Colorectal Cancer Detection
    Itzkowitz, Steven
    Brand, Randall
    Jandorf, Lina
    Durkee, Kris
    Millholland, John
    Rabeneck, Linda
    Schroy, Paul C., III
    Sontag, Stephen
    Johnson, David
    Markowitz, Sanford
    Paszat, Lawrence
    Berger, Barry M.
    [J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2008, 103 (11) : 2862 - 2870
  • [9] Set-Valued Dynamic Treatment Regimes for Competing Outcomes
    Laber, Eric B.
    Lizotte, Daniel J.
    Ferguson, Bradley
    [J]. BIOMETRICS, 2014, 70 (01) : 53 - 61
  • [10] Lizotte DJ, 2016, J MACH LEARN RES, V17