Regression to the mean for overdispersed count data

被引:0
|
作者
Iftikhar, Kiran [1 ,3 ]
Khan, Manzoor [1 ]
Olivier, Jake [2 ]
机构
[1] Quaid I Azam Univ, Dept Stat, Islamabad, Pakistan
[2] Univ New South Wales, Sch Math & Stat, Sydney, Australia
[3] Univ Agr Faisalabad, Dept Math & Stat, Faisalabad, Pakistan
关键词
Bivariate negative binomial distribution; Regression to the mean; Over-dispersion; Treatment effect; MODEL;
D O I
10.1016/j.jspi.2024.106211
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In repeated measurements, regression to the mean (RTM) is a tendency of subjects with observed extreme values to move closer to the mean when measured a second time. Not accounting for RTM could lead to incorrect decisions such as when observed natural variation is incorrectly attributed to the effect of a treatment/intervention. A strategy for addressing RTM is to decompose the total effect , the expected difference in paired random variables conditional on the first being in the tail of its distribution, into regression to the mean and unbiased treatment effects. The unbiased treatment effect can then be estimated by subtraction. Formulae are available in the literature to quantify RTM for Poisson distributed data which are constrained by mean-variance equivalence, although there are many real life examples of overdispersed count data that are not well approximated by the Poisson. The negative binomial can be considered an explicit overdispersed Poisson process where the Poisson intensity is chosen from a gamma distribution. In this study, the truncated bivariate negative binomial distribution is used to decompose the total effect formulae into RTM and treatment effects. Maximum likelihood estimators (MLE) and method of moments estimators are developed for the total, RTM, and treatment effects. A simulation study is carried out to investigate the properties of the estimators and compare them with those developed under the assumption of the Poisson process. Data on the incidence of dengue cases reported from 2007 to 2017 are used to estimate the total, RTM, and treatment effects.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Examples of Computing Power for Zero-Inflated and Overdispersed Count Data
    Doyle, Suzanne R.
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2009, 8 (02) : 360 - 376
  • [32] GWRM: An R Package for Identifying Sources of Variation in Overdispersed Count Data
    Vilchez-Lopez, Silverio
    Jose Saez-Castillo, Antonio
    Jose Olmo-Jimenez, Maria
    PLOS ONE, 2016, 11 (12):
  • [33] Pathway-based genetic association analysis for overdispersed count data
    Liu, Yang
    JOURNAL OF APPLIED STATISTICS, 2025,
  • [34] Exponential dispersion models for overdispersed zero-inflated count data
    Bar-Lev, Shaul K.
    Ridder, Ad
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3286 - 3304
  • [35] A comparison study on modeling of clustered and overdispersed count data for multiple comparisons
    Kruppa, Jochen
    Hothorn, Ludwig
    JOURNAL OF APPLIED STATISTICS, 2021, 48 (16) : 3220 - 3232
  • [36] A regression model for overdispersed data without too many zeros
    Rodriguez-Avi, Jose
    Jose Olmo-Jimenez, Maria
    STATISTICAL PAPERS, 2017, 58 (03) : 749 - 773
  • [37] A regression model for overdispersed data without too many zeros
    José Rodríguez-Avi
    María José Olmo-Jiménez
    Statistical Papers, 2017, 58 : 749 - 773
  • [38] Overdispersed exponential regression models
    Seeber, GUH
    COMPUTATIONAL STATISTICS, 1997, 12 (02) : 209 - 218
  • [39] Hierarchical Bayesian Models for Small Area Estimation under Overdispersed Count Data
    Wulandari, Ita
    Notodiputro, Khairil Anwar
    Fitrianto, Anwar
    Kurnia, Anang
    ENGINEERING LETTERS, 2023, 31 (04) : 1333 - 1342
  • [40] A joint model for hierarchical continuous and zero-inflated overdispersed count data
    Kassahun, Wondwosen
    Neyens, Thomas
    Molenberghs, Geert
    Faes, Christel
    Verbeke, Geert
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (03) : 552 - 571