Half-Normal Plots and Overdispersed Models in R: The hnp Package

被引:268
作者
Moral, Rafael A. [1 ]
Hinde, John [2 ]
Demetrio, Clarice G. B. [1 ]
机构
[1] Univ Sao Paulo, ESALQ, Dept Ciencias Exatas, Av Padua Dias 11, Piracicaba, SP, Brazil
[2] NUI Galway, Sch Math Stat & Appl Math, Univ Rd, Galway, Ireland
基金
巴西圣保罗研究基金会;
关键词
goodness-of-fit; generalized linear models; mixed models; R; REGRESSION;
D O I
10.18637/jss.v081.i10
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Count and proportion data may present overdispersion, i.e., greater variability than expected by the Poisson and binomial models, respectively. Different extended generalized linear models that allow for overdispersion may be used to analyze this type of data, such as models that use a generalized variance function, random-effects models, zero-inflated models and compound distribution models. Assessing goodness-of-fit and verifying assumptions of these models is not an easy task and the use of half-normal plots with a simulated envelope is a possible solution for this problem. These plots are a useful indicator of goodness-of-fit that may be used with any generalized linear model and extensions. For GLIM users, functions that generated these plots were widely used, however, in the open-source software R, these functions were not yet available on the Comprehensive R Archive Network (CRAN). We describe a new package in R, hnp, that may be used to generate the half-normal plot with a simulated envelope for residuals from different types of models. The function hnp ( ) can be used together with a range of different model fitting packages in R that extend the basic generalized linear model fitting in glm ( ) and is written so that it is relatively easy to extend it to new model classes and different diagnostics. We illustrate its use on a range of examples, including continuous and discrete responses, and show how it can be used to inform model selection and diagnose overdispersion.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 34 条
[1]  
[Anonymous], ENCY ENV
[2]  
[Anonymous], 2017, survival: Survival Analysis Routines for R. R package version 2.41.3
[3]  
Atkinson A.C., 1985, PLOTS TRANSFORMATION
[4]   Properties of sufficiency and statistical tests [J].
Bartlett, MS .
PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1937, 160 (A901) :0268-0282
[5]   Fitting Linear Mixed-Effects Models Using lme4 [J].
Bates, Douglas ;
Maechler, Martin ;
Bolker, Benjamin M. ;
Walker, Steven C. .
JOURNAL OF STATISTICAL SOFTWARE, 2015, 67 (01) :1-48
[6]  
Blom G., 1958, Statistical estimates and transformed betavariables
[7]  
Cribari-Neto F, 2010, J STAT SOFTW, V34, P1
[8]  
Demtrio C., 2014, Ecol Model Appl Entomol, DOI [DOI 10.1007/978-3-319-06877-09, 10.1007/978-3-319-06877-0_9, DOI 10.1007/978-3-319-06877-0_9]
[9]  
Demtrio CGB., 1997, GLM NEWSL, V27, P19
[10]  
Doran H, 2007, J STAT SOFTW, V20, P18, DOI DOI 10.18637/JSS.V020.I02