Regression to the mean for overdispersed count data

被引:0
|
作者
Iftikhar, Kiran [1 ,3 ]
Khan, Manzoor [1 ]
Olivier, Jake [2 ]
机构
[1] Quaid I Azam Univ, Dept Stat, Islamabad, Pakistan
[2] Univ New South Wales, Sch Math & Stat, Sydney, Australia
[3] Univ Agr Faisalabad, Dept Math & Stat, Faisalabad, Pakistan
关键词
Bivariate negative binomial distribution; Regression to the mean; Over-dispersion; Treatment effect; MODEL;
D O I
10.1016/j.jspi.2024.106211
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In repeated measurements, regression to the mean (RTM) is a tendency of subjects with observed extreme values to move closer to the mean when measured a second time. Not accounting for RTM could lead to incorrect decisions such as when observed natural variation is incorrectly attributed to the effect of a treatment/intervention. A strategy for addressing RTM is to decompose the total effect , the expected difference in paired random variables conditional on the first being in the tail of its distribution, into regression to the mean and unbiased treatment effects. The unbiased treatment effect can then be estimated by subtraction. Formulae are available in the literature to quantify RTM for Poisson distributed data which are constrained by mean-variance equivalence, although there are many real life examples of overdispersed count data that are not well approximated by the Poisson. The negative binomial can be considered an explicit overdispersed Poisson process where the Poisson intensity is chosen from a gamma distribution. In this study, the truncated bivariate negative binomial distribution is used to decompose the total effect formulae into RTM and treatment effects. Maximum likelihood estimators (MLE) and method of moments estimators are developed for the total, RTM, and treatment effects. A simulation study is carried out to investigate the properties of the estimators and compare them with those developed under the assumption of the Poisson process. Data on the incidence of dengue cases reported from 2007 to 2017 are used to estimate the total, RTM, and treatment effects.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Estimation of mean using under-reported and overdispersed count data
    Sengupta, Debjit
    Roy, Surupa
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [2] Polya-Aeppli regression model for overdispersed count data
    Borges, Patrick
    Godoi, Luciana G.
    STATISTICAL MODELLING, 2019, 19 (04) : 362 - 385
  • [3] Mean and Variance Modeling of Under- and Overdispersed Count Data
    Smith, David M.
    Faddy, Malcolm J.
    JOURNAL OF STATISTICAL SOFTWARE, 2016, 69 (06): : 1 - 23
  • [4] A hyper-Poisson regression model for overdispersed and underdispersed count data
    Saez-Castillo, A. J.
    Conde-Sanchez, A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 61 : 148 - 157
  • [5] Structured additive regression for overdispersed and zero-inflated-count data
    Fahrmeir, Ludwig
    Echavarria, Leyre Osuna
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2006, 22 (04) : 351 - 369
  • [6] Count Regression and Machine Learning Techniques for Zero-Inflated Overdispersed Count Data: Application to Ecological Data
    Sidumo B.
    Sonono E.
    Takaidza I.
    Annals of Data Science, 2024, 11 (03) : 803 - 817
  • [7] A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
    Bertoli, Wesley
    Conceicao, Katiane S.
    Andrade, Marinho G.
    Louzada, Francisco
    ENTROPY, 2021, 23 (06)
  • [8] Distributions to model overdispersed count data
    Coly, Sylvain
    Yao, Anne-Franoise
    Abrial, David
    Charras-Garrido, Myriam
    JOURNAL OF THE SFDS, 2016, 157 (02): : 39 - 63
  • [9] A generalized model for overdispersed count data
    Okamura, Hiroshi
    Punt, Andre E.
    Amano, Tatsuya
    POPULATION ECOLOGY, 2012, 54 (03) : 467 - 474
  • [10] Score Tests for Zero-Inflation in Overdispersed Count Data
    Yang, Zhao
    Hardin, James W.
    Addy, Cheryl L.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2010, 39 (11) : 2008 - 2030