R2 measures for zero-inflated regression models for count data with excess zeros

被引:13
作者
Martin, Jacob [1 ]
Hall, Daniel B. [1 ]
机构
[1] Univ Geortia, Dept Stat, Athens, GA 30602 USA
关键词
Adjusted R-2; Poisson regression; negative binomial regression; overdispersion; deviance; zero inflation; LOGISTIC-REGRESSION; POISSON REGRESSION; COEFFICIENTS;
D O I
10.1080/00949655.2016.1186166
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Generalized linear models are often used to analyse discrete data. There are many proposed R2 measures for this class of models. For loglinear models for count data, Cameron and Windmeijer [An R-squared measure of goodness of fit for some common nonlinear regression models. J Econometrics. 1997; 77: 329-342] developed an R2-like measure based on a ratio of deviances. This quantity has since been adjusted to accommodate both overspecification and overdispersion. While these statistics are useful for Poisson and negative binomial regression models, count data often include many zeros, a phenomenon that is often handled via zero-inflated (ZI) regression models. Building on Cameron and Windmeijer's work, we propose R2 statistics for the ZI Poisson and ZI negative binomial regression contexts. We also propose adjusted R2-like versions of these quantities to avoid inflation of these statistics due to the inclusion of irrelevant covariates in the model. The properties of the proposed measures of fit are examined via simulation, and their use is illustrated on two data sets involving counts with excess zeros.
引用
收藏
页码:3777 / 3790
页数:14
相关论文
共 50 条
  • [41] Models for Zero-Inflated and Overdispersed Correlated Count Data: An Application to Cigarette Use
    Pittman, Brian
    Buta, Eugenia
    Garrison, Kathleen
    Gueorguieva, Ralitza
    NICOTINE & TOBACCO RESEARCH, 2023, 25 (05) : 996 - 1003
  • [42] Geographically Weighted Zero-Inflated Negative Binomial Regression: A general case for count data
    da Silva, Alan Ricardo
    de Sousa, Marcos Douglas Rodrigues
    SPATIAL STATISTICS, 2023, 58
  • [43] Zero-inflated count time series models using Gaussian copula
    Alqawba, Mohammed
    Diawara, Norou
    Chaganty, N. Rao
    SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2019, 38 (03): : 342 - 357
  • [44] Multilevel modeling in single-case studies with zero-inflated and overdispersed count data
    Li, Haoran
    Luo, Wen
    Baek, Eunkyeng
    BEHAVIOR RESEARCH METHODS, 2024, 56 (04) : 2765 - 2781
  • [45] Zero-inflated Modified Borel-Tanner Regression Model for Count Data
    Hassan, Anwar
    Ahmad, Ishfaq S.
    Ahmad, Peer Bilal
    AUSTRIAN JOURNAL OF STATISTICS, 2022, 51 (02) : 28 - 39
  • [46] Time Series Regression for Zero-Inflated and Overdispersed Count Data: A Functional Response Model Approach
    M. Ghahramani
    S. S. White
    Journal of Statistical Theory and Practice, 2020, 14
  • [47] Semiparametric frailty models for zero-inflated event count data in the presence of informative dropout
    Diao, Guoqing
    Zeng, Donglin
    Hu, Kuolung
    Ibrahim, Joseph G.
    BIOMETRICS, 2019, 75 (04) : 1168 - 1178
  • [48] Bayesian variable selection for multivariate zero-inflated models: Application to microbiome count data
    Lee, Kyu Ha
    Coull, Brent A.
    Moscicki, Anna-Barbara
    Paster, Bruce J.
    Starr, Jacqueline R.
    BIOSTATISTICS, 2020, 21 (03) : 499 - 517
  • [49] Estimating overall exposure effects for zero-inflated regression models with application to dental caries
    Albert, Jeffrey M.
    Wang, Wei
    Nelson, Suchitra
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2014, 23 (03) : 257 - 278
  • [50] Hierarchical Bayesian analysis of correlated zero-inflated count data
    Dagne, GA
    BIOMETRICAL JOURNAL, 2004, 46 (06) : 653 - 663