Polya-Aeppli regression model for overdispersed count data

被引:4
|
作者
Borges, Patrick [1 ]
Godoi, Luciana G. [1 ]
机构
[1] Univ Fed Espirito Santo, Dept Estat, Ave Fernando Ferrari 514, BR-29075910 Vitoria, ES, Brazil
关键词
bootstrap; EM algorithm; Generalized linear models (GLM); overdispersion; zero-inflated models;
D O I
10.1177/1471082X18766797
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The log-linear Poisson model, characterized by linear variance function and a logarithmic relation between means and covariates, embedded in the exponential family regression framework provided by generalized linear models (GLM) is still the standard approach for analyzing count data responses with regression models. In practice, however, count data are often overdispersed and, thus, not conducive to Poisson regression. Therefore, the main goal of this article is to introduce a log-linear model based on the P null ' lya-Aeppli (PA) distribution, which is an extension of the Poisson distribution by including a dispersion parameter rho, to address the problem of overdispersion. Maximum likelihood (ML) estimation procedure is discussed as well as a test for determining the need for a PA regression over a standard Poisson regression. In addition, a simple EM-type algorithm for iteratively computing ML estimates is presented. In order to study departures from the error assumption as well as the presence of outliers, we perform residual analysis based on the standardized Pearson residuals. Furthermore, for different parameter settings and sample sizes, various simulations are performed. Finally, we also illustrated the new method on three real datasets, two of them are from biological researches and the other is from a violence study.
引用
收藏
页码:362 / 385
页数:24
相关论文
共 50 条
  • [1] Regression to the mean for overdispersed count data
    Iftikhar, Kiran
    Khan, Manzoor
    Olivier, Jake
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2025, 234
  • [2] A hyper-Poisson regression model for overdispersed and underdispersed count data
    Saez-Castillo, A. J.
    Conde-Sanchez, A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 61 : 148 - 157
  • [3] Distributions to model overdispersed count data
    Coly, Sylvain
    Yao, Anne-Franoise
    Abrial, David
    Charras-Garrido, Myriam
    JOURNAL OF THE SFDS, 2016, 157 (02): : 39 - 63
  • [4] A generalized model for overdispersed count data
    Okamura, Hiroshi
    Punt, Andre E.
    Amano, Tatsuya
    POPULATION ECOLOGY, 2012, 54 (03) : 467 - 474
  • [5] A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
    Bertoli, Wesley
    Conceicao, Katiane S.
    Andrade, Marinho G.
    Louzada, Francisco
    ENTROPY, 2021, 23 (06)
  • [6] Structured additive regression for overdispersed and zero-inflated-count data
    Fahrmeir, Ludwig
    Echavarria, Leyre Osuna
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2006, 22 (04) : 351 - 369
  • [7] Count Regression and Machine Learning Techniques for Zero-Inflated Overdispersed Count Data: Application to Ecological Data
    Sidumo B.
    Sonono E.
    Takaidza I.
    Annals of Data Science, 2024, 11 (03) : 803 - 817
  • [8] Time Series Regression for Zero-Inflated and Overdispersed Count Data: A Functional Response Model Approach
    M. Ghahramani
    S. S. White
    Journal of Statistical Theory and Practice, 2020, 14
  • [9] Time Series Regression for Zero-Inflated and Overdispersed Count Data: A Functional Response Model Approach
    Ghahramani, M.
    White, S. S.
    JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2020, 14 (02)
  • [10] A marginalized model for zero-inflated, overdispersed and correlated count data
    Iddia, Samuel
    Molenberghs, Geert
    ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2013, 6 (02) : 149 - 165