Q-learning residual analysis: application to the effectiveness of sequences of antipsychotic medications for patients with schizophrenia

被引:15
作者
Ertefaie, Ashkan [1 ,2 ]
Shortreed, Susan [3 ]
Chakraborty, Bibhas [4 ,5 ]
机构
[1] Univ Penn, Dept Stat, Philadelphia, PA 19104 USA
[2] Univ Penn, Ctr Pharmacoepidemiol Res & Training, Philadelphia, PA 19104 USA
[3] GroupHlth Res Inst, Biostat Unit, Seattle, WA USA
[4] Duke NUS Grad Med Sch, Ctr Quantitat Med, Singapore, Singapore
[5] Columbia Univ, Dept Biostat, New York, NY USA
基金
美国国家科学基金会;
关键词
dynamic treatment regimes; Q-learning; Residual analysis; SMART designs; DESIGN; STRATEGIES; REGRESSION; INFERENCE; TRIALS; SCALE;
D O I
10.1002/sim.6859
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Q-learning is a regression-based approach that uses longitudinal data to construct dynamic treatment regimes, which are sequences of decision rules that use patient information to inform future treatment decisions. An optimal dynamic treatment regime is composed of a sequence of decision rules that indicate how to optimally individualize treatment using the patients' baseline and time-varying characteristics to optimize the final outcome. Constructing optimal dynamic regimes using Q-learning depends heavily on the assumption that regression models at each decision point are correctly specified; yet model checking in the context of Q-learning has been largely overlooked in the current literature. In this article, we show that residual plots obtained from standard Q-learning models may fail to adequately check the quality of the model fit. We present a modified Q-learning procedure that accommodates residual analyses using standard tools. We present simulation studies showing the advantage of the proposed modification over standard Q-learning. We illustrate this new Q-learning approach using data collected from a sequential multiple assignment randomized trial of patients with schizophrenia. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:2221 / 2234
页数:14
相关论文
共 39 条
[1]  
[Anonymous], 1990, Applied Linear Statistical Models: Regression, Analysis of Variance, and Experimental Designs
[2]  
[Anonymous], 1989, THESIS CAMBRIDGE U
[3]  
CARROLL RJ, 1988, TECHNOMETRICS, V30, P30, DOI 10.2307/1270315
[4]  
Chakraborty B., 2013, STAT METHODS DYNAMIC, V76
[5]   Inference for Optimal Dynamic Treatment Regimes Using an Adaptive m-Out-of-n Bootstrap Scheme [J].
Chakraborty, Bibhas ;
Laber, Eric B. ;
Zhao, Yingqi .
BIOMETRICS, 2013, 69 (03) :714-723
[6]   Inference for non-regular parameters in optimal dynamic treatment regimes [J].
Chakraborty, Bibhas ;
Murphy, Susan ;
Strecher, Victor .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (03) :317-343
[7]   A conceptual framework for adaptive preventive interventions [J].
Collins, LM ;
Murphy, SA ;
Bierman, KL .
PREVENTION SCIENCE, 2004, 5 (03) :185-196
[8]   Q-LEARNING WITH CENSORED DATA [J].
Goldberg, Yair ;
Kosorok, Michael R. .
ANNALS OF STATISTICS, 2012, 40 (01) :529-560
[9]  
Hastie T., 1986, STAT SCI, V1, P297
[10]   THE QUALITY OF LIFE SCALE - AN INSTRUMENT FOR RATING THE SCHIZOPHRENIC DEFICIT SYNDROME [J].
HEINRICHS, DW ;
HANLON, TE ;
CARPENTER, WT .
SCHIZOPHRENIA BULLETIN, 1984, 10 (03) :388-398