Preventing false discovery of heterogeneous treatment effect subgroups in randomized trials

被引:16
作者
Rigdon, Joseph [1 ]
Baiocchi, Michael [2 ]
Basu, Sanjay [3 ,4 ,5 ]
机构
[1] Stanford Univ, Sch Med, Quantitat Sci Unit, 1070 Arastradero Rd,3C3104, Palo Alto, CA 94304 USA
[2] Stanford Univ, Sch Med, Stanford Prevent Res Ctr, Med Sch Off Bldg,Room 318,1265 Welch Rd,MC 5411, Stanford, CA 94305 USA
[3] Stanford Univ, Sch Med, Ctr Primary Care & Outcomes Res, Dept Med, 1070 Arastradero Rd,Off 282,MC 5560, Palo Alto, CA 94304 USA
[4] Stanford Univ, Sch Med, Ctr Primary Care & Outcomes Res, Dept Hlth Res & Policy, 1070 Arastradero Rd,Off 282,MC 5560, Palo Alto, CA 94304 USA
[5] Stanford Univ, Sch Med, Ctr Populat Hlth Sci, 1070 Arastradero Rd,Off 282,MC 5560, Palo Alto, CA 94304 USA
基金
美国国家卫生研究院;
关键词
Classification and regression trees; Decision support tool; Heterogeneous treatment effects; Matching; INFERENCE;
D O I
10.1186/s13063-018-2774-5
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background: Heterogeneous treatment effects (HTEs), or systematic differences in treatment effectiveness among participants with different observable features, may be important when applying trial results to clinical practice. Current methods suffer from a potential for false detection of HTEs due to imbalances in covariates between candidate subgroups. Methods: We introduce a new method, matching plus classification and regression trees (mCART), that yields balance in covariates in identified HTE subgroups. We compared mCART to a classical method (logistic regression [LR] with backwards covariate selection using the Akaike information criterion) and two machine-learning approaches increasingly applied to HTE detection (random forest [RF] and gradient RF) in simulations with a binary outcome with known HTE subgroups. We considered an N = 200 phase II oncology trial where there were either no HTEs (1A) or two HTE subgroups (1B) and an N = 6000 phase III cardiovascular disease trial where there were either no HTEs (2A) or four HTE subgroups (2B). Additionally, we considered an N = 6000 phase III cardiovascular disease trial where there was no average treatment effect but there were four HTE subgroups (2C). Results: In simulations 1A and 2A (no HTEs), mCART did not identify any HTE subgroups, whereas LR found 2 and 448, RF 5 and 2, and gradient RF 5 and 24, respectively (all false positives). In simulation 1B, mCART failed to identify the two true HTE subgroups whereas LR found 4, RF 6, and gradient RF 10 (half or more of which were false positives). In simulations 2B and 2C, mCART captured the four true HTE subgroups, whereas the other methods found only false positives. All HTE subgroups identified by mCART had acceptable treated vs. control covariate balance with absolute standardized differences less than 0.2, whereas the absolute standardized differences for the other methods typically exceeded 0.2. The imbalance in covariates in identified subgroups for LR, RF, and gradient RF indicates the false HTE detection may have been due to confounding. Conclusions: Covariate imbalances may be producing false positives in subgroup analyses. mCART could be a useful tool to help prevent the false discovery of HTE subgroups in secondary analyses of randomized trial data.
引用
收藏
页数:15
相关论文
共 24 条
  • [21] A Randomized Trial of Intensive versus Standard Blood-Pressure Control
    Wright, Jackson T., Jr.
    Williamson, Jeff D.
    Whelton, Paul K.
    Snyder, Joni K.
    Sink, Kaycee M.
    Rocco, Michael V.
    Reboussin, David M.
    Rahman, Mahboob
    Oparil, Suzanne
    Lewis, Cora E.
    Kimmel, Paul L.
    Johnson, Karen C.
    Goff, David C., Jr.
    Fine, Lawrence J.
    Cutler, Jeffrey A.
    Cushman, William C.
    Cheung, Alfred K.
    Ambrosius, Walter T.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2015, 373 (22) : 2103 - 2116
  • [22] A Simple Method for Estimating Interactions Between a Treatment and a Large Number of Covariates
    Tian, Lu
    Alizadeh, Ash A.
    Gentles, Andrew J.
    Tibshirani, Robert
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (508) : 1517 - 1532
  • [23] von Elm E, 2007, PLOS MED, V4, P1623, DOI [10.1371/journal.pmed.0040296, 10.1371/journal.pmed.0040297, 10.1016/S0140-6736(07)61602-X]
  • [24] Wager S, 2017, J AM STAT ASSOC, DOI [10.1080/01621459.2017.1319839?journalCode=uasa20, DOI 10.1080/01621459.2017.1319839?JOURNALCODE=UASA20]