Regression to the mean for overdispersed count data

被引:0
|
作者
Iftikhar, Kiran [1 ,3 ]
Khan, Manzoor [1 ]
Olivier, Jake [2 ]
机构
[1] Quaid I Azam Univ, Dept Stat, Islamabad, Pakistan
[2] Univ New South Wales, Sch Math & Stat, Sydney, Australia
[3] Univ Agr Faisalabad, Dept Math & Stat, Faisalabad, Pakistan
关键词
Bivariate negative binomial distribution; Regression to the mean; Over-dispersion; Treatment effect; MODEL;
D O I
10.1016/j.jspi.2024.106211
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In repeated measurements, regression to the mean (RTM) is a tendency of subjects with observed extreme values to move closer to the mean when measured a second time. Not accounting for RTM could lead to incorrect decisions such as when observed natural variation is incorrectly attributed to the effect of a treatment/intervention. A strategy for addressing RTM is to decompose the total effect , the expected difference in paired random variables conditional on the first being in the tail of its distribution, into regression to the mean and unbiased treatment effects. The unbiased treatment effect can then be estimated by subtraction. Formulae are available in the literature to quantify RTM for Poisson distributed data which are constrained by mean-variance equivalence, although there are many real life examples of overdispersed count data that are not well approximated by the Poisson. The negative binomial can be considered an explicit overdispersed Poisson process where the Poisson intensity is chosen from a gamma distribution. In this study, the truncated bivariate negative binomial distribution is used to decompose the total effect formulae into RTM and treatment effects. Maximum likelihood estimators (MLE) and method of moments estimators are developed for the total, RTM, and treatment effects. A simulation study is carried out to investigate the properties of the estimators and compare them with those developed under the assumption of the Poisson process. Data on the incidence of dengue cases reported from 2007 to 2017 are used to estimate the total, RTM, and treatment effects.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A full likelihood procedure of exchangeable negative binomials for modelling correlated and overdispersed count data
    Tan, Fei
    Rayner, Gibson Johnston
    Wang, Xiaodong
    Peng, Hanxiang
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (10) : 2849 - 2859
  • [42] Comparison of Hierarchical Bayesian Models for Overdispersed Count Data using DIC and Bayes' Factors
    Millar, Russell B.
    BIOMETRICS, 2009, 65 (03) : 962 - 969
  • [43] Analysis of overdispersed count data: application to the Human Papillomavirus Infection in Men (HIM) Study
    Lee, J. -H.
    Han, G.
    Fulp, W. J.
    Giuliano, A. R.
    EPIDEMIOLOGY AND INFECTION, 2012, 140 (06) : 1087 - 1094
  • [44] Semi-parametric approach for modelling overdispersed count data with application to Industry 4.0
    Bonnini, S.
    Borghesi, M.
    Giacalone, M.
    SOCIO-ECONOMIC PLANNING SCIENCES, 2024, 95
  • [45] Simultaneous confidence intervals for comparing biodiversity indices estimated from overdispersed count data
    Scherer, Ralph
    Schaarschmidt, Frank
    Prescher, Sabine
    Priesnitz, Kai U.
    BIOMETRICAL JOURNAL, 2013, 55 (02) : 246 - 263
  • [46] Modelling multivariate, overdispersed count data with correlated and non-normal heterogeneity effects
    Kazemi, Iraj
    Hassanzadeh, Fatemeh
    SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2020, 44 (02) : 335 - 356
  • [47] A new regression model for overdispersed binomial data accounting for outliers and an excess of zeros
    Ascari, Roberto
    Migliorati, Sonia
    STATISTICS IN MEDICINE, 2021, 40 (17) : 3895 - 3914
  • [48] Multilevel modeling in single-case studies with zero-inflated and overdispersed count data
    Li, Haoran
    Luo, Wen
    Baek, Eunkyeng
    BEHAVIOR RESEARCH METHODS, 2024, 56 (04) : 2765 - 2781
  • [49] A non-parametric model to address overdispersed count response in a longitudinal data setting with missingness
    Zhang, Hui
    He, Hua
    Lu, Naiji
    Zhu, Liang
    Zhang, Bo
    Zhang, Zhiwei
    Tang, Li
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (03) : 1461 - 1475
  • [50] Modeling overdispersed or underdispersed count data with generalized Poisson integer-valued autoregressive processes
    Yang, Kai
    Kang, Yao
    Wang, Dehui
    Li, Han
    Diao, Yajing
    METRIKA, 2019, 82 (07) : 863 - 889