SUBGROUP-EFFECTS MODELS FOR THE ANALYSIS OF PERSONAL TREATMENT EFFECTS

被引:4
作者
Zhou, Ling [1 ]
Sun, Shiquan [2 ]
Fu, Haoda [3 ]
Song, Peter X-K [4 ]
机构
[1] Southwestern Univ Finance & Econ, Ctr Stat Res, Chengdu, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Publ Hlth, Xian, Peoples R China
[3] Eli Lilly & Co, Indianapolis, IN 46285 USA
[4] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
基金
中国国家自然科学基金; 美国国家卫生研究院; 美国国家科学基金会;
关键词
ADMM algorithm; EM algorithm; maximum likelihood; precision medicine; supervised clustering; BLOOD LEAD LEVELS; VARIABLE SELECTION; CALCIUM SUPPLEMENTATION; MAXIMUM-LIKELIHOOD; MIXED MODELS; MIXTURE; PREGNANCY; ALGORITHM; COMPONENTS; EXPOSURE;
D O I
10.1214/21-AOAS1503
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The emerging field of precision medicine is transforming statistical analysis from the classical paradigm of population-average treatment effects into that of personal treatment effects. This new scientific mission has called for adequate statistical methods to assess heterogeneous covariate effects in regression analysis. This paper focuses on a subgroup analysis that consists of two primary analytic tasks: identification of treatment effect subgroups and individual group memberships, and statistical inference on treatment effects by subgroup. We propose an approach to synergizing supervised clustering analysis via alternating direction method of multipliers (ADMM) algorithm and statistical inference on subgroup effects via expectation-maximization (EM) algorithm. Our proposed procedure, termed as hybrid operation for subgroup analysis (HOSA), enjoys computational speed and numerical stability with interpretability and reproducibility. We establish key theoretical properties for both proposed clustering and inference procedures. Numerical illustration includes extensive simulation studies and analyses of motivating data from two randomized clinical trials to learn subgroup treatment effects.
引用
收藏
页码:80 / 103
页数:24
相关论文
共 61 条
[11]   Splitting Methods for Convex Clustering [J].
Chi, Eric C. ;
Lange, Kenneth .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2015, 24 (04) :994-1013
[13]   Detecting features in spatial point processes with clutter via model-based clustering [J].
Dasgupta, A ;
Raftery, AE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1998, 93 (441) :294-302
[14]  
Deb P, 2000, HEALTH ECON, V9, P475, DOI 10.1002/1099-1050(200009)9:6<475::AID-HEC544>3.0.CO
[15]  
2-H
[16]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[17]   Sample size planning for developing classifiers using high-dimensional DNA microarray data [J].
Dobbin, Kevin K. ;
Simon, Richard M. .
BIOSTATISTICS, 2007, 8 (01) :101-117
[18]   Sparse Subspace Clustering: Algorithm, Theory, and Applications [J].
Elhamifar, Ehsan ;
Vidal, Rene .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2765-2781
[19]   Dietary calcium supplementation to lower blood lead levels in pregnancy and lactation [J].
Ettinger, Adrienne S. ;
Hu, Howard ;
Hernandez-Avila, Mauricio .
JOURNAL OF NUTRITIONAL BIOCHEMISTRY, 2007, 18 (03) :172-178
[20]  
Ettinger AS, 2009, ENVIRON HEALTH PERSP, V117, P26, DOI [10.1289/ehp.11868, 10.1289/ehp.1171c26]