Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions

被引:123
作者
Zhang, Baqun [1 ]
Tsiatis, Anastasios A. [2 ]
Laber, Eric B. [2 ]
Davidian, Marie [2 ]
机构
[1] Renmin Univ China, Sch Stat, Beijing 100872, Peoples R China
[2] N Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
基金
美国国家卫生研究院;
关键词
A-learning; Double robustness; Outcome regression; Propensity score; Q-learning; REGRESSION; INFERENCE; DESIGN; MODELS;
D O I
10.1093/biomet/ast014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A dynamic treatment regime is a list of sequential decision rules for assigning treatment based on a patient's history. Q- and A-learning are two main approaches for estimating the optimal regime, i.e., that yielding the most beneficial outcome in the patient population, using data from a clinical trial or observational study. Q-learning requires postulated regression models for the outcome, while A-learning involves models for that part of the outcome regression representing treatment contrasts and for treatment assignment. We propose an alternative to Q- and A-learning that maximizes a doubly robust augmented inverse probability weighted estimator for population mean outcome over a restricted class of regimes. Simulations demonstrate the method's performance and robustness to model misspecification, which is a key concern.
引用
收藏
页码:681 / 694
页数:14
相关论文
共 22 条
[1]   Structural Nested Mean Models for Assessing Time-Varying Effect Moderation [J].
Almirall, Daniel ;
Ten Have, Thomas ;
Murphy, Susan A. .
BIOMETRICS, 2010, 66 (01) :131-139
[2]  
Bather J., 2000, GENETIC ALGORITHMS S
[3]   Inference for non-regular parameters in optimal dynamic treatment regimes [J].
Chakraborty, Bibhas ;
Murphy, Susan ;
Strecher, Victor .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (03) :317-343
[4]  
Goldberg DE, 1989, READING
[5]   Regret-Regression for Optimal Dynamic Treatment Regimes [J].
Henderson, Robin ;
Ansell, Phil ;
Alshibani, Deyadeen .
BIOMETRICS, 2010, 66 (04) :1192-1201
[6]  
Mebane WR, 2011, J STAT SOFTW, V42, P1
[7]   Demystifying optimal dynamic treatment regimes [J].
Moodie, Erica E. M. ;
Richardson, Thomas S. ;
Stephens, David A. .
BIOMETRICS, 2007, 63 (02) :447-455
[8]   An experimental design for the development of adaptive treatment strategies [J].
Murphy, SA .
STATISTICS IN MEDICINE, 2005, 24 (10) :1455-1481
[9]   Optimal dynamic treatment regimes [J].
Murphy, SA .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 :331-355
[10]   Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders [J].
Murphy, Susan A. ;
Oslin, David W. ;
Rush, A. John ;
Zhu, Ji .
NEUROPSYCHOPHARMACOLOGY, 2007, 32 (02) :257-262