Sensitivity and specificity of normality tests and consequences on reference interval accuracy at small sample size: a computer-simulation study

被引:89
作者
Le Boedec, Kevin [1 ]
机构
[1] Univ Illinois, Coll Vet Med, Urbana, IL 61801 USA
关键词
Gaussian distribution; nonparametric method; parametric method; Shapiro-Wilk test; REFERENCE VALUES;
D O I
10.1111/vcp.12390
中图分类号
S85 [动物医学(兽医学)];
学科分类号
0906 ;
摘要
BackgroundAccording to international guidelines, parametric methods must be chosen for RI construction when the sample size is small and the distribution is Gaussian. However, normality tests may not be accurate at small sample size. ObjectivesThe purpose of the study was to evaluate normality test performance to properly identify samples extracted from a Gaussian population at small sample sizes, and assess the consequences on RI accuracy of applying parametric methods to samples that falsely identified the parent population as Gaussian. MethodsSamples of n = 60 and n = 30 values were randomly selected 100 times from simulated Gaussian, lognormal, and asymmetric populations of 10,000 values. The sensitivity and specificity of 4 normality tests were compared. Reference intervals were calculated using 6 different statistical methods from samples that falsely identified the parent population as Gaussian, and their accuracy was compared. ResultsShapiro-Wilk and D'Agostino-Pearson tests were the best performing normality tests. However, their specificity was poor at sample size n = 30 (specificity for P < .05: .51 and .50, respectively). The best significance levels identified when n = 30 were 0.19 for Shapiro-Wilk test and 0.18 for D'Agostino-Pearson test. Using parametric methods on samples extracted from a lognormal population but falsely identified as Gaussian led to clinically relevant inaccuracies. ConclusionsAt small sample size, normality tests may lead to erroneous use of parametric methods to build RI. Using nonparametric methods (or alternatively Box-Cox transformation) on all samples regardless of their distribution or adjusting, the significance level of normality tests depending on sample size would limit the risk of constructing inaccurate RI.
引用
收藏
页码:648 / 656
页数:9
相关论文
共 20 条
[1]  
BOYD JC, 1982, CLIN CHEM, V28, P1735
[2]   HEALTH, NORMALITY, AND GHOST OF GAUSS [J].
ELVEBACK, LR ;
GUILLIER, CL ;
KEATING, FR .
JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1970, 211 (01) :69-&
[3]  
Farver TB, 2008, CLINICAL BIOCHEMISTRY OF DOMESTIC ANIMALS, 6TH EDITION, P1, DOI 10.1016/B978-0-12-370491-7.00001-5
[4]   ASVCP reference interval guidelines: determination of de novo reference intervals in veterinary species and other related topics [J].
Friedrichs, Kristen R. ;
Harr, Kendal E. ;
Freeman, Kathy P. ;
Szladovits, Balazs ;
Walton, Raquel M. ;
Barnhart, Kirstin F. ;
Blanco-Chavez, Julia .
VETERINARY CLINICAL PATHOLOGY, 2012, 41 (04) :441-453
[5]   Reference values: a review [J].
Geffre, Anne ;
Friedrichs, Kristen ;
Harr, Kendal ;
Concordet, Didier ;
Trumel, Catherine ;
Braun, Jean-Pierre .
VETERINARY CLINICAL PATHOLOGY, 2009, 38 (03) :288-298
[6]   Observed, unknown distributions of clinical chemical quantities should be considered to be log-normal: a proposal [J].
Haeckel, Rainer ;
Wosniok, Werner .
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2010, 48 (10) :1393-1396
[7]  
Harris EK, 1995, STAT BASES REFERENCE, P386
[8]   Reference intervals: an update [J].
Horn, PS ;
Pesce, AJ .
CLINICA CHIMICA ACTA, 2003, 334 (1-2) :5-23
[9]  
Horn PS, 1998, CLIN CHEM, V44, P622
[10]  
Horowitz GL, 2010, DEFINING ESTABLISHIN