The statistical interpretation of pilot trials: should significance thresholds be reconsidered?

被引:289
作者
Lee, Ellen C. [1 ]
Whitehead, Amy L. [1 ]
Jacques, Richard M. [1 ]
Julious, Steven A. [1 ]
机构
[1] Univ Sheffield, Sch Hlth & Related Res ScHARR, Med Stat Grp, Sheffield S1 4DA, S Yorkshire, England
关键词
Pilot trial; Power; Type I error; Confidence interval; Significance; Bayesian methods; SURROGATE END-POINTS; RANDOMIZED CONTROLLED-TRIAL; CLINICAL-TRIALS; SAMPLE-SIZES; HEALTH; DESIGN;
D O I
10.1186/1471-2288-14-41
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: In an evaluation of a new health technology, a pilot trial may be undertaken prior to a trial that makes a definitive assessment of benefit. The objective of pilot studies is to provide sufficient evidence that a larger definitive trial can be undertaken and, at times, to provide a preliminary assessment of benefit. Methods: We describe significance thresholds, confidence intervals and surrogate markers in the context of pilot studies and how Bayesian methods can be used in pilot trials. We use a worked example to illustrate the issues raised. Results: We show how significance levels other than the traditional 5% should be considered to provide preliminary evidence for efficacy and how estimation and confidence intervals should be the focus to provide an estimated range of possible treatment effects. We also illustrate how Bayesian methods could also assist in the early assessment of a health technology. Conclusions: We recommend that in pilot trials the focus should be on descriptive statistics and estimation, using confidence intervals, rather than formal hypothesis testing and that confidence intervals other than 95% confidence intervals, such as 85% or 75%, be used for the estimation. The confidence interval should then be interpreted with regards to the minimum clinically important difference. We also recommend that Bayesian methods be used to assist in the interpretation of pilot trials. Surrogate endpoints can also be used in pilot trials but they must reliably predict the overall effect on the clinical outcome.
引用
收藏
页数:8
相关论文
共 29 条
[1]   What is a pilot or feasibility study? A review of current practice and editorial policy [J].
Arain, Mubashir ;
Campbell, Michael J. ;
Cooper, Cindy L. ;
Lancaster, Gillian A. .
BMC MEDICAL RESEARCH METHODOLOGY, 2010, 10
[2]   Biomarkers and surrogate endpoints: Preferred definitions and conceptual framework [J].
Atkinson, AJ ;
Colburn, WA ;
DeGruttola, VG ;
DeMets, DL ;
Downing, GJ ;
Hoth, DF ;
Oates, JA ;
Peck, CC ;
Schooley, RT ;
Spilker, BA ;
Woodcock, J ;
Zeger, SL .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2001, 69 (03) :89-95
[3]  
Carpenter JamesR., 2013, MULTIPLE IMPUTATION
[4]   Sample size and the probability of a successful trial [J].
Chuang-Stein, Christy .
PHARMACEUTICAL STATISTICS, 2006, 5 (04) :305-309
[5]   A Quantitative Approach for Making Go/No-Go Decisions in Drug Development [J].
Chuang-Stein, Christy ;
Kirby, Simon ;
French, Jonathan ;
Kowalski, Ken ;
Marshall, Scott ;
Smith, Mike K. ;
Bycott, Paul ;
Beltangady, Mohan .
DRUG INFORMATION JOURNAL, 2011, 45 (02) :187-202
[6]  
Collins K, 2007, BRI J COMMUN HLTH NU, V2, P156
[7]   Considerations in the evaluation of surrogate endpoints in clinical trials: Summary of a National Institutes of Health Workshop [J].
De Gruttola, VG ;
Clax, P ;
DeMets, DL ;
Downing, GJ ;
Ellenberg, SS ;
Friedman, L ;
Gail, MH ;
Prentice, R ;
Wittes, J ;
Zeger, SL .
CONTROLLED CLINICAL TRIALS, 2001, 22 (05) :485-502
[8]   Surrogate end points in clinical trials: Are we being misled? [J].
Fleming, TR ;
DeMets, DL .
ANNALS OF INTERNAL MEDICINE, 1996, 125 (07) :605-613
[9]  
International Conference on Harmonisation, 1998, ICH E9 STAT PRINC CL
[10]   Moving statistics beyond the individual clinical trial: applying decision science to optimize a clinical development plan [J].
Julious, SA ;
Swank, DJ .
PHARMACEUTICAL STATISTICS, 2005, 4 (01) :37-46