A data transformation to deal with constant under/over-dispersion in Poisson and binomial regression models

被引:2
作者
Vanegas, Luis Hernando [1 ]
Rondon, Luz Marina [1 ]
机构
[1] Univ Nacl Colombia, Dept Estadist, Bogota, Colombia
关键词
Count data; Poisson; proportions; binomial; under-dispersion; over-dispersion; data transformation;
D O I
10.1080/00949655.2020.1749276
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a data transformation to deal with the presence of constant under/over-dispersion relative to the Poisson or binomial assumptions. The proposed methodology is very simple as it does not require to replace the Poisson or binomial by more complex regression models based on more flexible distributions. The new approach consist of a transformation of the response variable, followed by the analysis of its relation with the covariates by applying to the transformed variable the usual Poisson or binomial regression models. The transformation depends on a tuning parameter, which can be easily chosen using a straightforward criterion. The efectiveness of the proposed approach is illustrated by simulation experiments and by analyzing six real data sets.
引用
收藏
页码:1811 / 1833
页数:23
相关论文
共 22 条
[1]  
Agresti A, 2015, Foundations of linear and generalized linear models
[2]  
[Anonymous], 2019, A Language and Environment for Statistical Computing
[3]  
[Anonymous], 2012, Overdispersion models in SAS
[4]  
Cameron AC, 2013, Regression analysis of count data
[5]  
Collet D., 2003, Modeling Binary Data, V2nd
[6]   Asymptotic inference under heteroskedasticity of unknown form [J].
Cribari-Neto, F .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 45 (02) :215-233
[7]  
Crowder M. J., 1978, Ap-plied statistics, P34, DOI [10.2307/2346223, DOI 10.2307/2346223]
[8]  
Dunn P.K., 1996, J Comput Graph Stat, V5, P236, DOI 10.2307/1390802
[9]   Modelling species abundance using the Poisson-Tweedie family [J].
El-Shaarawi, Abdel H. ;
Zhu, Rong ;
Joe, Harry .
ENVIRONMETRICS, 2011, 22 (02) :152-164
[10]  
Fahrmeir L., 2001, MULTIVARIATE STAT MO, Vsecond