Preventing false discovery of heterogeneous treatment effect subgroups in randomized trials

被引:16
作者
Rigdon, Joseph [1 ]
Baiocchi, Michael [2 ]
Basu, Sanjay [3 ,4 ,5 ]
机构
[1] Stanford Univ, Sch Med, Quantitat Sci Unit, 1070 Arastradero Rd,3C3104, Palo Alto, CA 94304 USA
[2] Stanford Univ, Sch Med, Stanford Prevent Res Ctr, Med Sch Off Bldg,Room 318,1265 Welch Rd,MC 5411, Stanford, CA 94305 USA
[3] Stanford Univ, Sch Med, Ctr Primary Care & Outcomes Res, Dept Med, 1070 Arastradero Rd,Off 282,MC 5560, Palo Alto, CA 94304 USA
[4] Stanford Univ, Sch Med, Ctr Primary Care & Outcomes Res, Dept Hlth Res & Policy, 1070 Arastradero Rd,Off 282,MC 5560, Palo Alto, CA 94304 USA
[5] Stanford Univ, Sch Med, Ctr Populat Hlth Sci, 1070 Arastradero Rd,Off 282,MC 5560, Palo Alto, CA 94304 USA
基金
美国国家卫生研究院;
关键词
Classification and regression trees; Decision support tool; Heterogeneous treatment effects; Matching; INFERENCE;
D O I
10.1186/s13063-018-2774-5
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background: Heterogeneous treatment effects (HTEs), or systematic differences in treatment effectiveness among participants with different observable features, may be important when applying trial results to clinical practice. Current methods suffer from a potential for false detection of HTEs due to imbalances in covariates between candidate subgroups. Methods: We introduce a new method, matching plus classification and regression trees (mCART), that yields balance in covariates in identified HTE subgroups. We compared mCART to a classical method (logistic regression [LR] with backwards covariate selection using the Akaike information criterion) and two machine-learning approaches increasingly applied to HTE detection (random forest [RF] and gradient RF) in simulations with a binary outcome with known HTE subgroups. We considered an N = 200 phase II oncology trial where there were either no HTEs (1A) or two HTE subgroups (1B) and an N = 6000 phase III cardiovascular disease trial where there were either no HTEs (2A) or four HTE subgroups (2B). Additionally, we considered an N = 6000 phase III cardiovascular disease trial where there was no average treatment effect but there were four HTE subgroups (2C). Results: In simulations 1A and 2A (no HTEs), mCART did not identify any HTE subgroups, whereas LR found 2 and 448, RF 5 and 2, and gradient RF 5 and 24, respectively (all false positives). In simulation 1B, mCART failed to identify the two true HTE subgroups whereas LR found 4, RF 6, and gradient RF 10 (half or more of which were false positives). In simulations 2B and 2C, mCART captured the four true HTE subgroups, whereas the other methods found only false positives. All HTE subgroups identified by mCART had acceptable treated vs. control covariate balance with absolute standardized differences less than 0.2, whereas the absolute standardized differences for the other methods typically exceeded 0.2. The imbalance in covariates in identified subgroups for LR, RF, and gradient RF indicates the false HTE detection may have been due to confounding. Conclusions: Covariate imbalances may be producing false positives in subgroup analyses. mCART could be a useful tool to help prevent the false discovery of HTE subgroups in secondary analyses of randomized trial data.
引用
收藏
页数:15
相关论文
共 24 条
  • [1] The Precision Medicine Initiative A New National Effort
    Ashley, Euan A.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2015, 313 (21): : 2119 - 2120
  • [2] Recursive partitioning for heterogeneous causal effects
    Athey, Susan
    Imbens, Guido
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) : 7353 - 7360
  • [3] Benefit and harm of intensive blood pressure treatment: Derivation and validation of risk models using data from the SPRINT and ACCORD trials
    Basu, Sanjay
    Sussman, Jeremy B.
    Rigdon, Joseph
    Steimle, Lauren
    Denton, Brian T.
    Hayward, Rodney A.
    [J]. PLOS MEDICINE, 2017, 14 (10)
  • [4] Targeting weight loss interventions to reduce cardiovascular complications of type 2 diabetes: a machine learning-based post-hoc analysis of heterogeneous treatment effects in the Look AHEAD trial
    Baum, Aaron
    Scarpa, Joseph
    Bruzelius, Emilie
    Tamler, Ronald
    Basu, Sanjay
    Faghmous, James
    [J]. LANCET DIABETES & ENDOCRINOLOGY, 2017, 5 (10) : 808 - 815
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Using Internally Developed Risk Models to Assess Heterogeneity in Treatment Effects in Clinical Trials
    Burke, James F.
    Hayward, Rodney A.
    Nelson, Jason P.
    Kent, David M.
    [J]. CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES, 2014, 7 (01): : 163 - 169
  • [7] Cohen J., 1988, STAT POWER ANAL BEHA, DOI [10.4324/9780203771587, DOI 10.4324/9780203771587]
  • [8] Estimating treatment effects for individual patients based on the results of randomised clinical trials
    Dorresteijn, Johannes A. N.
    Visseren, Frank L. J.
    Ridker, Paul M.
    Wassink, Annemarie M. J.
    Paynter, Nina P.
    Steyerberg, Ewout W.
    van der Graaf, Yolanda
    Cook, Nancy R.
    [J]. BMJ-BRITISH MEDICAL JOURNAL, 2011, 343
  • [9] The cyclin-dependent kinase 4/6 inhibitor palbociclib in combination with letrozole versus letrozole alone as first-line treatment of oestrogen receptor-positive, HER2-negative, advanced breast cancer (PALOMA-1/TRIO-18): a randomised phase 2 study
    Finn, Richard S.
    Crown, John P.
    Lang, Istvan
    Boer, Katalin
    Bondarenko, Igor M.
    Kulyk, Sergey O.
    Ettl, Johannes
    Patel, Ravindranath
    Pinter, Tamas
    Schmidt, Marcus
    Shparyk, Yaroslav
    Thummala, Anu R.
    Voytko, Nataliya L.
    Fowst, Camilla
    Huang, Xin
    Kim, Sindy T.
    Randolph, Sophia
    Slamon, Dennis J.
    [J]. LANCET ONCOLOGY, 2015, 16 (01) : 25 - 35
  • [10] Optimal full matching and related designs via network flows
    Hansen, Ben B.
    Klopfer, Stephanie Olsen
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2006, 15 (03) : 609 - 627