Strategies in adjusting for multiple comparisons: A primer for pediatric surgeons

被引:24
作者
Staffa, Steven J. [1 ]
Zurakowski, David [1 ]
机构
[1] Harvard Med Sch, Boston Childrens Hosp, Dept Surg, Boston, MA 02115 USA
关键词
Multiple comparisons; Type I error; Multiplicity; P value; Study design; Bonferroni;
D O I
10.1016/j.jpedsurg.2020.01.003
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
Background/Purpose: In pediatric surgery research, the issue of multiple comparisons commonly arises when there are multiple patient or experimental groups being compared two at a time on an outcome of interest. Performing multiple statistical comparisons increases the likelihood of finding a false positive result when there truly are no statistically significant group differences (falsely rejecting the null hypothesis when it is true). In order to control for the risk of false positive results, there are several statistical approaches that surgeons should consider in collaboration with a biostatistician when performing a study that is prone to the issue of false discovery related to multiple comparisons. It is becoming increasingly more common for high impact journals to require authors to carefully consider multiplicity in their studies. Therefore, the objective of this primer is to provide surgeons with a useful guide and recommendations on how to go about taking multiple comparisons into account to keep false positive results at an acceptable level. Methods: We provide background on the issue of multiple comparisons and risk of type I error and guidance on statistical approaches (i.e. multiple comparisons procedures) that can be implemented to control the type I false positive error rate based on the statistical analysis plan. These include, but are not limited to, the Bonferroni correction, the False Discovery Rate (FDR) approach, Tukey's procedure, Scheffers procedure, Holm's procedure, and Dunnett's procedure. Results: We present the results of the various approaches following one-way analysis of the variance (ANOVA) using a hypothetical surgical research example of the comparison between three experimental groups of rats on skin defect coverage for experimental spina bifida: the TRASCET group, sham control, and saline control. The ultimate decision in accounting for multiple comparisons is situation-dependent and surgeons should work with their statistical colleagues to ensure the best approach for controlling the type I error rate and interpreting the evidence when making multiple inferences and comparisons. Conclusions: The risk of rejecting the null hypothesis increases when multiple hypotheses arc tested using the same data. Surgeons should be aware of the available approaches and considerations to take into account multiplicity in the statistical plan or protocol of their clinical and basic science research studies. This strategy will improve their study design and ensure the most appropriate analysis of their data. Not adjusting for multiple comparisons can lead to misleading presentation of evidence to the surgical research community because of exaggerating treatment differences or effects. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:1699 / 1705
页数:7
相关论文
共 50 条
  • [31] SISVAR: A GUIDE FOR ITS BOOTSTRAP PROCEDURES IN MULTIPLE COMPARISONS
    Ferreira, Daniel Furtado
    CIENCIA E AGROTECNOLOGIA, 2014, 38 (02): : 109 - 112
  • [32] Advantages of the Closed Testing Method in Multiple Comparisons Procedures
    Giancristofaro, Rosa Arboretti
    Bolzan, Mario
    Bonnini, Stefano
    Corain, Livio
    Solmi, Francesca
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2012, 41 (06) : 746 - 763
  • [33] ON MAKING MULTIPLE COMPARISONS IN CLINICAL AND EXPERIMENTAL PHARMACOLOGY AND PHYSIOLOGY
    LUDBROOK, J
    CLINICAL AND EXPERIMENTAL PHARMACOLOGY AND PHYSIOLOGY, 1991, 18 (06): : 379 - 392
  • [34] Estimating the Proportion of True Null Hypotheses for Multiple Comparisons
    Jiang, Hongmei
    Doerge, R. W.
    CANCER INFORMATICS, 2008, 6 : 25 - 32
  • [35] Stepwise multiple tests for successive comparisons of treatment effects
    Liu, W
    Somerville, PN
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 46 (01) : 189 - 199
  • [36] POWER OF PAIRWISE MULTIPLE COMPARISONS IN THE UNEQUAL VARIANCE CASE
    HSIUNG, TH
    OLEJNIK, S
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1994, 23 (03) : 691 - 710
  • [37] Multiple Comparisons Controlling Expected Number of False Discoveries
    Meng, Xianhua
    Wang, Jinglong
    Wu, Xianyi
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2014, 43 (13) : 2830 - 2843
  • [38] Exact simultaneous confidence intervals for multiple comparisons with the mean
    Soong, WC
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 37 (01) : 33 - 47
  • [39] Multiple Comparisons for a Psychophysical Test in Bootstrap Logistic Regression
    Mita, Norihiro
    Sasaki, Hiroshi
    Kani, Kazutaka
    Tabuchi, Akio
    Hara, Heihachiro
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2014, 8 (03) : 339 - 355
  • [40] Kruskal-Wallis, multiple comparisons and Efron dice
    Brown, BM
    Hettmansperger, TP
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2002, 44 (04) : 427 - 438