Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

被引:144
作者
Schulte, Phillip J. [1 ]
Tsiatis, Anastasios A. [2 ]
Laber, Eric B. [2 ]
Davidian, Marie [2 ]
机构
[1] Duke Clin Res Inst, Durham, NC 27701 USA
[2] N Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
关键词
Advantage learning; bias-variance trade-off; model misspecification; personalized medicine; potential outcomes; sequential decision-making; ADAPTIVE TREATMENT STRATEGIES; NESTED MEAN MODELS; CLINICAL-TRIALS; DESIGN; INFERENCE; RANDOMIZATION; DEPRESSION; REGRESSION; DECISIONS; SUBJECT;
D O I
10.1214/13-STS450
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In clinical practice, physicians make a series of treatment decisions over the course of a patient's disease based on his/her baseline and evolving characteristics. A dynamic treatment regime is a set of sequential decision rules that operationalizes this process. Each rule corresponds to a decision point and dictates the next treatment action based on the accrued information. Using existing data, a key goal is estimating the optimal regime, that, if followed by the patient population, would yield the most favorable outcome on average. Q- and A-learning are two main approaches for this purpose. We provide a detailed account of these methods, study their performance, and illustrate them using data from a depression study.
引用
收藏
页码:640 / 661
页数:22
相关论文
共 44 条
[1]   Structural Nested Mean Models for Assessing Time-Varying Effect Moderation [J].
Almirall, Daniel ;
Ten Have, Thomas ;
Murphy, Susan A. .
BIOMETRICS, 2010, 66 (01) :131-139
[2]  
[Anonymous], 2013, SINGLE WORLD INTERVE
[3]  
[Anonymous], 2004, ANN ARBOR
[4]  
[Anonymous], 1989, THESIS CAMBRIDGE U
[5]  
BATHER J., 2000, DECISION THEORY INTR
[6]  
Chakraborty B., 2012, ESTIMATING OPT UNPUB
[7]   Inference for non-regular parameters in optimal dynamic treatment regimes [J].
Chakraborty, Bibhas ;
Murphy, Susan ;
Strecher, Victor .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (03) :317-343
[8]  
Craven MW, 1996, ADV NEUR IN, V8, P24
[9]   Regret-Regression for Optimal Dynamic Treatment Regimes [J].
Henderson, Robin ;
Ansell, Phil ;
Alshibani, Deyadeen .
BIOMETRICS, 2010, 66 (04) :1192-1201
[10]  
Laber E. B., 2010, ARXIV10065831V1