Pervasive errors in hypothesis testing: Toward better statistical practice in nursing research

被引:8
作者
Staggs, Vincent S. [1 ,2 ]
机构
[1] Childrens Mercy Kansas City, Biostat & Epidemiol Core, Hlth Serv & Outcomes Res, 2401 Gillham Rd, Kansas City, MO 64108 USA
[2] Univ Missouri, Sch Med, 2411 Holmes St, Kansas City, MO 64108 USA
关键词
Statistics; Statistical methods; Research methods; Nursing research; POWER;
D O I
10.1016/j.ijnurstu.2019.06.012
中图分类号
R47 [护理学];
学科分类号
1011 ;
摘要
Background: In recent years several authors have documented common problems in the use of statistics in nursing research, including failure to consider the effects of multiple testing, inattention to clinical significance, and under-reporting of effect sizes and confidence intervals. More subtle forms of multiple testing are not as widely recognized, and abuse of researcher degrees of freedom has received little attention in the nursing research literature. These and other unsound practices in applying and interpreting statistics are problematic in themselves, and they arguably reflect an insufficiently clear understanding of statistical inference as a method for dealing with randomness among many researchers. Objectives: The goal of this educational paper is to improve the understanding and practice of inferential statistics among nursing researchers. An accessible explanation of hypothesis testing is provided, including discussion of the crucial concept of repeated sampling. Several pervasive mistakes and misconceptions in statistical inference are examined in detail, including misinterpretation of "non-significant" p-values as evidence for the null hypothesis, failure to account for forms of multiple testing that arise in model selection, abuse of researcher degrees of freedom, and hypothesis testing for baseline differences between arms in randomized trials. Recommendations for better statistical practice are offered. Conclusion: For the foreseeable future classical methods of statistical inference based on the idea of repeated sampling will be the primary tools for quantifying randomness in nursing research. The hypothesis testing framework, despite its limitations, can be helpful in ruling out chance as an explanation for observed effects. Nursing researchers who use quantitative methods, as well as journal reviewers and editors, should understand this framework well. Those involved in educating nursing researchers and those who teach statistics would do well to ask what changes need to be made to raise the level of statistical practice in nursing research. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:87 / 93
页数:7
相关论文
共 24 条
[1]   Estimating the reproducibility of psychological science [J].
Aarts, Alexander A. ;
Anderson, Joanna E. ;
Anderson, Christopher J. ;
Attridge, Peter R. ;
Attwood, Angela ;
Axt, Jordan ;
Babel, Molly ;
Bahnik, Stepan ;
Baranski, Erica ;
Barnett-Cowan, Michael ;
Bartmess, Elizabeth ;
Beer, Jennifer ;
Bell, Raoul ;
Bentley, Heather ;
Beyan, Leah ;
Binion, Grace ;
Borsboom, Denny ;
Bosch, Annick ;
Bosco, Frank A. ;
Bowman, Sara D. ;
Brandt, Mark J. ;
Braswell, Erin ;
Brohmer, Hilmar ;
Brown, Benjamin T. ;
Brown, Kristina ;
Bruening, Jovita ;
Calhoun-Sauls, Ann ;
Callahan, Shannon P. ;
Chagnon, Elizabeth ;
Chandler, Jesse ;
Chartier, Christopher R. ;
Cheung, Felix ;
Christopherson, Cody D. ;
Cillessen, Linda ;
Clay, Russ ;
Cleary, Hayley ;
Cloud, Mark D. ;
Cohn, Michael ;
Cohoon, Johanna ;
Columbus, Simon ;
Cordes, Andreas ;
Costantini, Giulio ;
Alvarez, Leslie D. Cramblet ;
Cremata, Ed ;
Crusius, Jan ;
DeCoster, Jamie ;
DeGaetano, Michelle A. ;
Della Penna, Nicolas ;
den Bezemer, Bobby ;
Deserno, Marie K. .
SCIENCE, 2015, 349 (6251)
[2]   Retire statistical significance [J].
Amrhein, Valentin ;
Greenland, Sander ;
McShane, Blake .
NATURE, 2019, 567 (7748) :305-307
[3]  
[Anonymous], 2015, SIGNIFICANCE, DOI DOI 10.1111/J.1740-9713.2015.00828.X
[4]   A descriptive study of effect-size reporting in research reviews [J].
Floyd, Judith A. .
JOURNAL OF ADVANCED NURSING, 2017, 73 (06) :1467-1481
[5]   Power, effects, confidence, and significance: An investigation of statistical practices in nursing research [J].
Gaskin, Cadeyrn J. ;
Happell, Brenda .
INTERNATIONAL JOURNAL OF NURSING STUDIES, 2014, 51 (05) :795-806
[6]   Power of mental health nursing research: A statistical analysis of studies in the International Journal of Mental Health Nursing [J].
Gaskin, Cadeyrn J. ;
Happell, Brenda .
INTERNATIONAL JOURNAL OF MENTAL HEALTH NURSING, 2013, 22 (01) :69-75
[7]  
Harrell FE, 2015, SPRINGER SER STAT, DOI 10.1007/978-3-319-19425-7
[8]   Moving nursing beyond p < .05 [J].
Hayat, Matthew J. ;
Staggs, Vincent S. ;
Schwartz, Todd A. ;
Higgins, Melinda ;
Azuero, Andres ;
Budhathoki, Chakra ;
Chandrasekhar, Rameela ;
Cook, Paul ;
Cramer, Emily ;
Dietrich, Mary S. ;
Garnier-Villarreal, Mauricio ;
Hanlon, Alexandra ;
He, Jinghua ;
Hu, Jinxiang ;
Kim, MyoungJin ;
Mueller, Martina ;
Nolan, Joseph R. ;
Perkhounkova, Yelena ;
Rothers, Janet ;
Schluck, Glenna ;
Su, Xiaogang ;
Templin, Thomas N. ;
Weaver, Michael T. ;
Yang, Qing ;
Ye, Sangbeak .
INTERNATIONAL JOURNAL OF NURSING STUDIES, 2019, 95 :A1-A2
[9]   Statistical Challenges in Nursing Education and Research An Expert Panel Consensus [J].
Hayat, Matthew J. ;
Higgins, Melinda ;
Schwartz, Todd A. ;
Staggs, Vincent S. .
NURSE EDUCATOR, 2015, 40 (01) :21-25
[10]   MORE POWERFUL PROCEDURES FOR MULTIPLE SIGNIFICANCE TESTING [J].
HOCHBERG, Y ;
BENJAMINI, Y .
STATISTICS IN MEDICINE, 1990, 9 (07) :811-818