Polya-Aeppli regression model for overdispersed count data

被引:4
|
作者
Borges, Patrick [1 ]
Godoi, Luciana G. [1 ]
机构
[1] Univ Fed Espirito Santo, Dept Estat, Ave Fernando Ferrari 514, BR-29075910 Vitoria, ES, Brazil
关键词
bootstrap; EM algorithm; Generalized linear models (GLM); overdispersion; zero-inflated models;
D O I
10.1177/1471082X18766797
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The log-linear Poisson model, characterized by linear variance function and a logarithmic relation between means and covariates, embedded in the exponential family regression framework provided by generalized linear models (GLM) is still the standard approach for analyzing count data responses with regression models. In practice, however, count data are often overdispersed and, thus, not conducive to Poisson regression. Therefore, the main goal of this article is to introduce a log-linear model based on the P null ' lya-Aeppli (PA) distribution, which is an extension of the Poisson distribution by including a dispersion parameter rho, to address the problem of overdispersion. Maximum likelihood (ML) estimation procedure is discussed as well as a test for determining the need for a PA regression over a standard Poisson regression. In addition, a simple EM-type algorithm for iteratively computing ML estimates is presented. In order to study departures from the error assumption as well as the presence of outliers, we perform residual analysis based on the standardized Pearson residuals. Furthermore, for different parameter settings and sample sizes, various simulations are performed. Finally, we also illustrated the new method on three real datasets, two of them are from biological researches and the other is from a violence study.
引用
收藏
页码:362 / 385
页数:24
相关论文
共 50 条
  • [21] A new regression model for overdispersed binomial data accounting for outliers and an excess of zeros
    Ascari, Roberto
    Migliorati, Sonia
    STATISTICS IN MEDICINE, 2021, 40 (17) : 3895 - 3914
  • [22] A generalized Waring regression model for count data
    Rodriguez-Avi, J.
    Conde-Sanchez, A.
    Saez-Castillo, A. J.
    Olmo-Jimenez, M. J.
    Martinez-Rodriguez, A. M.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (10) : 3717 - 3725
  • [23] A contaminated regression model for count health data
    Otto, Arnoldus F.
    Ferreira, Johannes T.
    Tomarchio, Salvatore Daniele
    Bekker, Andriette
    Punzo, Antonio
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2025, 34 (02) : 369 - 389
  • [24] Score Tests for Zero-Inflation in Overdispersed Count Data
    Yang, Zhao
    Hardin, James W.
    Addy, Cheryl L.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2010, 39 (11) : 2008 - 2030
  • [25] Semiparametric models for multilevel overdispersed count data with extra zeros
    Mahmoodi, Marzieh
    Moghimbeigi, Abbas
    Mohammad, Kazem
    Faradmal, Javad
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (04) : 1187 - 1201
  • [26] COMPARING METHODS FOR ANALYZING OVERDISPERSED COUNT DATA IN AQUATIC TOXICOLOGY
    Noe, Douglas A.
    Bailer, A. John
    Noble, Robert B.
    ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 2010, 29 (01) : 212 - 219
  • [27] A non-parametric model to address overdispersed count response in a longitudinal data setting with missingness
    Zhang, Hui
    He, Hua
    Lu, Naiji
    Zhu, Liang
    Zhang, Bo
    Zhang, Zhiwei
    Tang, Li
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (03) : 1461 - 1475
  • [28] Regression models for overdispersed jejunal surviving crypts data
    Dong Kee Kim
    In Vitro Cellular & Developmental Biology - Animal, 2002, 38 : 242 - 245
  • [29] ANALYSIS OF OVERDISPERSED COUNT DATA: AN APPLICATION ON ACAR (ACARINA) COUNTS
    Akkol, Suna
    Denizhan, Evsel
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2016, 69 (08): : 1091 - 1100
  • [30] Mean and Variance Modeling of Under- and Overdispersed Count Data
    Smith, David M.
    Faddy, Malcolm J.
    JOURNAL OF STATISTICAL SOFTWARE, 2016, 69 (06): : 1 - 23