Strategies in adjusting for multiple comparisons: A primer for pediatric surgeons

被引:24
作者
Staffa, Steven J. [1 ]
Zurakowski, David [1 ]
机构
[1] Harvard Med Sch, Boston Childrens Hosp, Dept Surg, Boston, MA 02115 USA
关键词
Multiple comparisons; Type I error; Multiplicity; P value; Study design; Bonferroni;
D O I
10.1016/j.jpedsurg.2020.01.003
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
Background/Purpose: In pediatric surgery research, the issue of multiple comparisons commonly arises when there are multiple patient or experimental groups being compared two at a time on an outcome of interest. Performing multiple statistical comparisons increases the likelihood of finding a false positive result when there truly are no statistically significant group differences (falsely rejecting the null hypothesis when it is true). In order to control for the risk of false positive results, there are several statistical approaches that surgeons should consider in collaboration with a biostatistician when performing a study that is prone to the issue of false discovery related to multiple comparisons. It is becoming increasingly more common for high impact journals to require authors to carefully consider multiplicity in their studies. Therefore, the objective of this primer is to provide surgeons with a useful guide and recommendations on how to go about taking multiple comparisons into account to keep false positive results at an acceptable level. Methods: We provide background on the issue of multiple comparisons and risk of type I error and guidance on statistical approaches (i.e. multiple comparisons procedures) that can be implemented to control the type I false positive error rate based on the statistical analysis plan. These include, but are not limited to, the Bonferroni correction, the False Discovery Rate (FDR) approach, Tukey's procedure, Scheffers procedure, Holm's procedure, and Dunnett's procedure. Results: We present the results of the various approaches following one-way analysis of the variance (ANOVA) using a hypothetical surgical research example of the comparison between three experimental groups of rats on skin defect coverage for experimental spina bifida: the TRASCET group, sham control, and saline control. The ultimate decision in accounting for multiple comparisons is situation-dependent and surgeons should work with their statistical colleagues to ensure the best approach for controlling the type I error rate and interpreting the evidence when making multiple inferences and comparisons. Conclusions: The risk of rejecting the null hypothesis increases when multiple hypotheses arc tested using the same data. Surgeons should be aware of the available approaches and considerations to take into account multiplicity in the statistical plan or protocol of their clinical and basic science research studies. This strategy will improve their study design and ensure the most appropriate analysis of their data. Not adjusting for multiple comparisons can lead to misleading presentation of evidence to the surgical research community because of exaggerating treatment differences or effects. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:1699 / 1705
页数:7
相关论文
共 50 条
  • [1] Statistical power and sample size calculations: A primer for pediatric surgeons
    Staffa, Steven J.
    Zurakowski, David
    JOURNAL OF PEDIATRIC SURGERY, 2020, 55 (07) : 1173 - 1179
  • [2] LOPSIDED REASONING ON LOPSIDED TESTS AND MULTIPLE COMPARISONS
    Hurlbert, Stuart H.
    Lombardi, Celia M.
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2012, 54 (01) : 23 - 42
  • [3] Multiple testings: Multiple comparisons and multiple endpoints
    Chi, GYH
    DRUG INFORMATION JOURNAL, 1998, 32 : 1347S - 1362S
  • [4] Comparing multiple comparisons: practical guidance for choosing the best multiple comparisons test
    Midway, Stephen
    Robertson, Matthew
    Flinn, Shane
    Kaller, Michael
    PEERJ, 2020, 8
  • [5] Simple stepwise tests of hypotheses and multiple comparisons
    Seneta, E
    Chen, JT
    INTERNATIONAL STATISTICAL REVIEW, 2005, 73 (01) : 21 - 34
  • [6] Challenges in handling multiple comparisons in reproductive research
    Chae-Kim, Jennifer
    Flannagan, Kerry
    Hill, Kayla
    Hill, Micah J.
    FERTILITY AND STERILITY, 2024, 121 (06) : 909 - 913
  • [7] Nonparametric multiple comparisons
    Kimihiro Noguchi
    Riley S. Abel
    Fernando Marmolejo-Ramos
    Frank Konietschke
    Behavior Research Methods, 2020, 52 : 489 - 502
  • [8] Multiple comparisons of biodiversity
    Rogers, JA
    Hsu, JC
    BIOMETRICAL JOURNAL, 2001, 43 (05) : 617 - 625
  • [9] Nonparametric multiple comparisons
    Noguchi, Kimihiro
    Abel, Riley S.
    Marmolejo-Ramos, Fernando
    Konietschke, Frank
    BEHAVIOR RESEARCH METHODS, 2020, 52 (02) : 489 - 502
  • [10] Adjusting for multiple testing - when and how?
    Bender, R
    Lange, S
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2001, 54 (04) : 343 - 349