Strategies in adjusting for multiple comparisons: A primer for pediatric surgeons

被引：24

作者：

Staffa, Steven J. ^{[1
]}

Zurakowski, David ^{[1
]}

机构：

[1] Harvard Med Sch, Boston Childrens Hosp, Dept Surg, Boston, MA 02115 USA

来源：

JOURNAL OF PEDIATRIC SURGERY | 2020年 / 55卷 / 09期

关键词：

Multiple comparisons; Type I error; Multiplicity; P value; Study design; Bonferroni;

D O I：

10.1016/j.jpedsurg.2020.01.003

中图分类号：

R72 [儿科学];

学科分类号：

100202 ;

摘要：

Background/Purpose: In pediatric surgery research, the issue of multiple comparisons commonly arises when there are multiple patient or experimental groups being compared two at a time on an outcome of interest. Performing multiple statistical comparisons increases the likelihood of finding a false positive result when there truly are no statistically significant group differences (falsely rejecting the null hypothesis when it is true). In order to control for the risk of false positive results, there are several statistical approaches that surgeons should consider in collaboration with a biostatistician when performing a study that is prone to the issue of false discovery related to multiple comparisons. It is becoming increasingly more common for high impact journals to require authors to carefully consider multiplicity in their studies. Therefore, the objective of this primer is to provide surgeons with a useful guide and recommendations on how to go about taking multiple comparisons into account to keep false positive results at an acceptable level. Methods: We provide background on the issue of multiple comparisons and risk of type I error and guidance on statistical approaches (i.e. multiple comparisons procedures) that can be implemented to control the type I false positive error rate based on the statistical analysis plan. These include, but are not limited to, the Bonferroni correction, the False Discovery Rate (FDR) approach, Tukey's procedure, Scheffers procedure, Holm's procedure, and Dunnett's procedure. Results: We present the results of the various approaches following one-way analysis of the variance (ANOVA) using a hypothetical surgical research example of the comparison between three experimental groups of rats on skin defect coverage for experimental spina bifida: the TRASCET group, sham control, and saline control. The ultimate decision in accounting for multiple comparisons is situation-dependent and surgeons should work with their statistical colleagues to ensure the best approach for controlling the type I error rate and interpreting the evidence when making multiple inferences and comparisons. Conclusions: The risk of rejecting the null hypothesis increases when multiple hypotheses arc tested using the same data. Surgeons should be aware of the available approaches and considerations to take into account multiplicity in the statistical plan or protocol of their clinical and basic science research studies. This strategy will improve their study design and ensure the most appropriate analysis of their data. Not adjusting for multiple comparisons can lead to misleading presentation of evidence to the surgical research community because of exaggerating treatment differences or effects. (C) 2020 Elsevier Inc. All rights reserved.

引用

页码：1699 / 1705

页数：7

共 50 条

[31] SISVAR: A GUIDE FOR ITS BOOTSTRAP PROCEDURES IN MULTIPLE COMPARISONS
Ferreira, Daniel Furtado
CIENCIA E AGROTECNOLOGIA, 2014, 38 (02): : 109 - 112
[32] Advantages of the Closed Testing Method in Multiple Comparisons Procedures
Giancristofaro, Rosa Arboretti
Bolzan, Mario
Bonnini, Stefano
Corain, Livio
Solmi, Francesca
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2012, 41 (06) : 746 - 763
[33] ON MAKING MULTIPLE COMPARISONS IN CLINICAL AND EXPERIMENTAL PHARMACOLOGY AND PHYSIOLOGY
LUDBROOK, J
CLINICAL AND EXPERIMENTAL PHARMACOLOGY AND PHYSIOLOGY, 1991, 18 (06): : 379 - 392
[34] Estimating the Proportion of True Null Hypotheses for Multiple Comparisons
Jiang, Hongmei
Doerge, R. W.
CANCER INFORMATICS, 2008, 6 : 25 - 32
[35] Stepwise multiple tests for successive comparisons of treatment effects
Liu, W
Somerville, PN
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 46 (01) : 189 - 199
[36] POWER OF PAIRWISE MULTIPLE COMPARISONS IN THE UNEQUAL VARIANCE CASE
HSIUNG, TH
OLEJNIK, S
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1994, 23 (03) : 691 - 710
[37] Multiple Comparisons Controlling Expected Number of False Discoveries
Meng, Xianhua
Wang, Jinglong
Wu, Xianyi
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2014, 43 (13) : 2830 - 2843
[38] Exact simultaneous confidence intervals for multiple comparisons with the mean
Soong, WC
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 37 (01) : 33 - 47
[39] Multiple Comparisons for a Psychophysical Test in Bootstrap Logistic Regression
Mita, Norihiro
Sasaki, Hiroshi
Kani, Kazutaka
Tabuchi, Akio
Hara, Heihachiro
JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2014, 8 (03) : 339 - 355
[40] Kruskal-Wallis, multiple comparisons and Efron dice
Brown, BM
Hettmansperger, TP
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2002, 44 (04) : 427 - 438

← 1 2 3 4 5 →