Polya-Aeppli regression model for overdispersed count data

被引:4
作者
Borges, Patrick [1 ]
Godoi, Luciana G. [1 ]
机构
[1] Univ Fed Espirito Santo, Dept Estat, Ave Fernando Ferrari 514, BR-29075910 Vitoria, ES, Brazil
关键词
bootstrap; EM algorithm; Generalized linear models (GLM); overdispersion; zero-inflated models;
D O I
10.1177/1471082X18766797
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The log-linear Poisson model, characterized by linear variance function and a logarithmic relation between means and covariates, embedded in the exponential family regression framework provided by generalized linear models (GLM) is still the standard approach for analyzing count data responses with regression models. In practice, however, count data are often overdispersed and, thus, not conducive to Poisson regression. Therefore, the main goal of this article is to introduce a log-linear model based on the P null ' lya-Aeppli (PA) distribution, which is an extension of the Poisson distribution by including a dispersion parameter rho, to address the problem of overdispersion. Maximum likelihood (ML) estimation procedure is discussed as well as a test for determining the need for a PA regression over a standard Poisson regression. In addition, a simple EM-type algorithm for iteratively computing ML estimates is presented. In order to study departures from the error assumption as well as the presence of outliers, we perform residual analysis based on the standardized Pearson residuals. Furthermore, for different parameter settings and sample sizes, various simulations are performed. Finally, we also illustrated the new method on three real datasets, two of them are from biological researches and the other is from a violence study.
引用
收藏
页码:362 / 385
页数:24
相关论文
共 50 条
  • [41] Exponential dispersion models for overdispersed zero-inflated count data
    Bar-Lev, Shaul K.
    Ridder, Ad
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3286 - 3304
  • [42] Pathway-based genetic association analysis for overdispersed count data
    Liu, Yang
    JOURNAL OF APPLIED STATISTICS, 2025,
  • [43] A comparison study on modeling of clustered and overdispersed count data for multiple comparisons
    Kruppa, Jochen
    Hothorn, Ludwig
    JOURNAL OF APPLIED STATISTICS, 2021, 48 (16) : 3220 - 3232
  • [44] Model selection with overdispersed distance sampling data
    Howe, Eric J.
    Buckland, Stephen T.
    Despres-Einspenner, Marie-Lyne
    Kuehl, Hjalmar S.
    METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (01): : 38 - 47
  • [45] On the Bell distribution and its associated regression model for count data
    Castellares, Fredy
    Ferrari, Silvia L. P.
    Lemonte, Artur J.
    APPLIED MATHEMATICAL MODELLING, 2018, 56 : 172 - 185
  • [46] Zero-inflated Modified Borel-Tanner Regression Model for Count Data
    Hassan, Anwar
    Ahmad, Ishfaq S.
    Ahmad, Peer Bilal
    AUSTRIAN JOURNAL OF STATISTICS, 2022, 51 (02) : 28 - 39
  • [47] Hierarchical Bayesian Models for Small Area Estimation under Overdispersed Count Data
    Wulandari, Ita
    Notodiputro, Khairil Anwar
    Fitrianto, Anwar
    Kurnia, Anang
    ENGINEERING LETTERS, 2023, 31 (04) : 1333 - 1342
  • [48] A TRANSITION MODEL FOR ANALYSIS OF ZERO-INFLATED LONGITUDINAL COUNT DATA USING GENERALIZED POISSON REGRESSION MODEL
    Baghfalaki, Taban
    Ganjali, Mojtaba
    REVSTAT-STATISTICAL JOURNAL, 2020, 18 (01) : 27 - 45
  • [49] A bivariate zero-inflated negative binomial regression model for count data with excess zeros
    Wang, PM
    ECONOMICS LETTERS, 2003, 78 (03) : 373 - 378
  • [50] A Poisson Regression Model For Analysis of Censored Count Data with Excess Zeroes
    Saffari, Seyed Ehsan
    Adnan, Robiah
    Greene, William
    Ahmad, Maizah Hura
    JURNAL TEKNOLOGI, 2013, 63 (02):