A logical analysis of null hypothesis significance testing using popular terminology

被引:1
作者
McNulty, Richard [1 ]
机构
[1] Blacktown Mt Druitt Hosp, Emergency Dept, Blacktown Rd, Sydney, NSW 2148, Australia
关键词
Logic; Null hypothesis significance test; Hypothesis testing; Statistical inference; Statistical significance; Type I error; Type II error; Power;
D O I
10.1186/s12874-022-01696-5
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Null Hypothesis Significance Testing (NHST) has been well criticised over the years yet remains a pillar of statistical inference. Although NHST is well described in terms of statistical models, most textbooks for non-statisticians present the null and alternative hypotheses (H-0 and H-A, respectively) in terms of differences between groups such as (mu(1) = mu(2)) and (mu(1) not equal mu(2) ) and H-A is often stated to be the research hypothesis. Here we use propositional calculus to analyse the internal logic of NHST when couched in this popular terminology. The testable H-0 is determined by analysing the scope and limits of the P-value and the test statistic's probability distribution curve. Results: We propose a minimum axiom set NHST in which it is taken as axiomatic that H o is rejected if P-value< a. Using the common scenario of the comparison of the means of two sample groups as an example, the testable H-0 is {(mu(1) = mu(2)) and [(<(x)over bar>(1) not equal (x)over bar>(2)) due to chance alone]}. The H-0 and H-A pair should be exhaustive to avoid false dichotomies. This entails that H-A is -{(mu(1) = mu(2)) and [((x) over bar (1) not equal (x) over bar (2) ) due to chance alone]}, rather than the research hypothesis (H-T). To see the relationship between H-A and H-T, H-A can be rewritten as the disjunction H-A: ({(mu(1), = mu(2)) boolean AND [((x) over bar (1) not equal (x)over bar>(2)) not due to chance alone]} boolean OR {(mu(1), not equal mu(2)) boolean AND [((x) over bar (1) not equal (x)over bar>(2)) not due to (mu(1) not equal mu(2)) alone]} boolean OR {(mu(1) not equal mu(2)) boolean AND [((x) over bar (1) not equal (x)over bar>(2)) due to (mu(1) not equal mu(2)) alone]}). This reveals that H-T (the last disjunct in bold) is just one possibility within H-A. It is only by adding premises to NHST that H-T or other conclusions can be reached. Conclusions: Using this popular terminology for NHST, analysis shows that the definitions of H-0 and H-A differ from those found in textbooks. In this framework, achieving a statistically significant result only justifies the broad conclusion that the results are not due to chance alone, not that the research hypothesis is true. More transparency is needed concerning the premises added to NHST to rig particular conclusions such as H-T. There are also ramifications for the interpretation of Type I and II errors, as well as power, which do not specifically refer to H-T as claimed by texts.
引用
收藏
页数:9
相关论文
共 47 条
[1]  
Altman DG., 1996, PRACTICAL STAT MED R, V7th
[2]  
[Anonymous], 2009, Medical Statistics at a Glance, The theory of linear regression and performing a linear regression analysis
[3]  
[Anonymous], 2009, STAT SQUARE ONE
[4]  
[Anonymous], 2015, INTRO MED STAT
[5]  
Argyrous G., 2000, Statistics for social and health research
[6]  
Campbell MJ., 2007, MED STAT TXB HLTH SC, P331
[7]  
Dahlberg L., 2010, PRACTICAL RES EVALUA
[8]  
Daly LE, 2000, Interpretation and Use of Medical Statistics
[9]  
Daniel W.W., 2009, BIOSTATISTICS FDN AN, V9th
[10]  
Everitt B., 2006, MED STAT A Z GUIDE C, V2nd ed.