Testing for baseline differences in randomized controlled trials: an unhealthy research behavior that is hard to eradicate

被引:247
作者
de Boer, Michiel R. [1 ,2 ]
Waterlander, Wilma E. [3 ]
Kuijper, Lothar D. J. [1 ,2 ]
Steenhuis, Ingrid H. M. [1 ,2 ]
Twisk, Jos W. R. [4 ]
机构
[1] Vrije Univ Amsterdam, Fac Earth & Life Sci, Dept Hlth Sci, NL-1081 HV Amsterdam, Netherlands
[2] Vrije Univ Amsterdam, Fac Earth & Life Sci, EMGO Inst Hlth & Care Res, NL-1081 HV Amsterdam, Netherlands
[3] Univ Auckland, Natl Inst Hlth Innovat, Sch Populat Hlth, Auckland 1142, New Zealand
[4] Vrije Univ Amsterdam, Med Ctr, Dept Epidemiol & Biostat, NL-1081 HV Amsterdam, Netherlands
关键词
Epidemiology; Methods; Reporting; Nutrition behavior research; Statistical analyses; SUBGROUP ANALYSIS; DISCOUNT;
D O I
10.1186/s12966-015-0162-z
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
Background: According to the CONSORT statement, significance testing of baseline differences in randomized controlled trials should not be performed. In fact, this practice has been discouraged by numerous authors throughout the last forty years. During that time span, reporting of baseline differences has substantially decreased in the leading general medical journals. Our own experience in the field of nutrition behavior research however, is that co-authors, reviewers and even editors are still very persistent in their demand for these tests. The aim of this paper is therefore to negate this demand by providing clear evidence as to why testing for baseline differences between intervention groups statistically is superfluous and why such results should not be published. Discussion: Testing for baseline differences is often propagated because of the belief that it shows whether randomization was successful and it identifies real or important differences between treatment arms that should be accounted for in the statistical analyses. Especially the latter argument is flawed, because it ignores the fact that the prognostic strength of a variable is also important when the interest is in adjustment for confounding. In addition, including prognostic variables as covariates can increase the precision of the effect estimate. This means that choosing covariates based on significance tests for baseline differences might lead to omissions of important covariates and, less importantly, to inclusion of irrelevant covariates in the analysis. We used data from four supermarket trials on the effects of pricing strategies on fruit and vegetables purchases, to show that results from fully adjusted analyses sometimes do appreciably differ from results from analyses adjusted for significant baseline differences only. We propose to adjust for known or anticipated important prognostic variables. These could or should be pre-specified in trial protocols. Subsequently, authors should report results from the fully adjusted as well as crude analyses, especially for dichotomous and time to event data. Summary: Based on our arguments, which were illustrated by our findings, we propose that journals in and outside the field of nutrition behavior actively adopt the CONSORT 2010 statement on this topic by not publishing significance tests for baseline differences anymore.
引用
收藏
页数:8
相关论文
共 18 条
[1]  
ALTMAN DG, 1985, J ROY STAT SOC D-STA, V34, P125
[2]   RANDOMIZATION AND BASE-LINE COMPARISONS IN CLINICAL-TRIALS [J].
ALTMAN, DG ;
DORE, CJ .
LANCET, 1990, 335 (8682) :149-153
[3]  
[Anonymous], STAT METHOD IN PRESS
[4]   Subgroup analysis and other (mis)uses of baseline data in clinical trials [J].
Assmann, SF ;
Pocock, SJ ;
Enos, LE ;
Kasten, LE .
LANCET, 2000, 355 (9209) :1064-1069
[5]   A substantial and confusing variation exists in handling of baseline covariates in randomized controlled trials: a review of trials published in leading medical journals [J].
Austin, Peter C. ;
Manca, Andrea ;
Zwarenstein, Merrick ;
Juurlink, David N. ;
Stanbrook, Matthew B. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2010, 63 (02) :142-153
[6]   Should we adjust for covariates in nonlinear regression analyses of randomized trials? [J].
Hauck, WW ;
Anderson, S ;
Marcus, SM .
CONTROLLED CLINICAL TRIALS, 1998, 19 (03) :249-256
[7]   DESIGNS FOR EXPERIMENTS - PARALLEL COMPARISONS OF TREATMENT [J].
LAVORI, PW ;
LOUIS, TA ;
BAILAR, JC ;
POLANSKY, M .
NEW ENGLAND JOURNAL OF MEDICINE, 1983, 309 (21) :1291-1299
[8]  
Moher D, 2010, J Clin Epidemiol, V2010, P1
[9]   Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: current practice and problems [J].
Pocock, SJ ;
Assmann, SE ;
Enos, LE ;
Kasten, LE .
STATISTICS IN MEDICINE, 2002, 21 (19) :2917-2930
[10]  
ROTHMAN KJ, 1977, CANCER, V39, P1771, DOI 10.1002/1097-0142(197704)39:4+<1771::AID-CNCR2820390803>3.0.CO