Tutorial in biostatistics: data-driven subgroup identification and analysis in clinical trials

被引:213
作者
Lipkovich, Ilya [1 ]
Dmitrienko, Alex [2 ]
D'Agostino, Ralph B., Sr. [3 ]
机构
[1] Quintiles Inc, 4820 Emperor Blvd, Durham, NC 27703 USA
[2] Mediana Inc, Overland Pk, KS USA
[3] Boston Univ, Boston, MA 02215 USA
关键词
clinical trials; exploratory subgroup analysis; biomarker analysis; data mining; multiplicity control; OPTIMAL TREATMENT REGIMES; PERSONALIZED MEDICINE; VARIABLE SELECTION; POTENTIAL OUTCOMES; REGRESSION; SIGNATURE; LASSO; TREES; REGULARIZATION; HETEROGENEITY;
D O I
10.1002/sim.7064
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is well known that both the direction and magnitude of the treatment effect in clinical trials are often affected by baseline patient characteristics (generally referred to as biomarkers). Characterization of treatment effect heterogeneity plays a central role in the field of personalized medicine and facilitates the development of tailored therapies. This tutorial focuses on a general class of problems arising in data-driven subgroup analysis, namely, identification of biomarkers with strong predictive properties and patient subgroups with desirable characteristics such as improved benefit and/or safety. Limitations of ad-hoc approaches to biomarker exploration and subgroup identification in clinical trials are discussed, and the ad-hoc approaches are contrasted with principled approaches to exploratory subgroup analysis based on recent advances in machine learning and data mining. A general framework for evaluating predictive biomarkers and identification of associated subgroups is introduced. The tutorial provides a review of a broad class of statistical methods used in subgroup discovery, including global outcome modeling methods, global treatment effect modeling methods, optimal treatment regimes, and local modeling methods. Commonly used subgroup identification methods are illustrated using two case studies based on clinical trials with binary and survival endpoints. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:136 / 196
页数:61
相关论文
共 100 条
[1]  
[Anonymous], 2014, GUID INV SUBGR CONF
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]   A BAYESIAN APPROACH TO SUBGROUP IDENTIFICATION [J].
Berger, James O. ;
Wang, Xiaojing ;
Shen, Lei .
JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2014, 24 (01) :110-129
[4]  
BERRY DA, 1990, BIOMETRICS, V46, P1227
[5]   A LASSO FOR HIERARCHICAL INTERACTIONS [J].
Bien, Jacob ;
Taylor, Jonathan ;
Tibshirani, Robert .
ANNALS OF STATISTICS, 2013, 41 (03) :1111-1141
[6]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[9]  
Brookes S T, 2001, Health Technol Assess, V5, P1
[10]   Analysis of randomized comparative clinical trial data for personalized treatment selections [J].
Cai, Tianxi ;
Tian, Lu ;
Wong, Peggy H. ;
Wei, L. J. .
BIOSTATISTICS, 2011, 12 (02) :270-282