Regression to the mean for overdispersed count data

被引：0

作者：

Iftikhar, Kiran ^{[1
,3
]}

Khan, Manzoor ^{[1
]}

Olivier, Jake ^{[2
]}

机构：

[1] Quaid I Azam Univ, Dept Stat, Islamabad, Pakistan

[2] Univ New South Wales, Sch Math & Stat, Sydney, Australia

[3] Univ Agr Faisalabad, Dept Math & Stat, Faisalabad, Pakistan

来源：

JOURNAL OF STATISTICAL PLANNING AND INFERENCE | 2025年 / 234卷

关键词：

Bivariate negative binomial distribution; Regression to the mean; Over-dispersion; Treatment effect; MODEL;

D O I：

10.1016/j.jspi.2024.106211

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In repeated measurements, regression to the mean (RTM) is a tendency of subjects with observed extreme values to move closer to the mean when measured a second time. Not accounting for RTM could lead to incorrect decisions such as when observed natural variation is incorrectly attributed to the effect of a treatment/intervention. A strategy for addressing RTM is to decompose the total effect , the expected difference in paired random variables conditional on the first being in the tail of its distribution, into regression to the mean and unbiased treatment effects. The unbiased treatment effect can then be estimated by subtraction. Formulae are available in the literature to quantify RTM for Poisson distributed data which are constrained by mean-variance equivalence, although there are many real life examples of overdispersed count data that are not well approximated by the Poisson. The negative binomial can be considered an explicit overdispersed Poisson process where the Poisson intensity is chosen from a gamma distribution. In this study, the truncated bivariate negative binomial distribution is used to decompose the total effect formulae into RTM and treatment effects. Maximum likelihood estimators (MLE) and method of moments estimators are developed for the total, RTM, and treatment effects. A simulation study is carried out to investigate the properties of the estimators and compare them with those developed under the assumption of the Poisson process. Data on the incidence of dengue cases reported from 2007 to 2017 are used to estimate the total, RTM, and treatment effects.

引用

页数：14

共 50 条

[31] Examples of Computing Power for Zero-Inflated and Overdispersed Count Data
Doyle, Suzanne R.
JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2009, 8 (02) : 360 - 376
[32] GWRM: An R Package for Identifying Sources of Variation in Overdispersed Count Data
Vilchez-Lopez, Silverio
Jose Saez-Castillo, Antonio
Jose Olmo-Jimenez, Maria
PLOS ONE, 2016, 11 (12):
[33] Pathway-based genetic association analysis for overdispersed count data
Liu, Yang
JOURNAL OF APPLIED STATISTICS, 2025,
[34] Exponential dispersion models for overdispersed zero-inflated count data
Bar-Lev, Shaul K.
Ridder, Ad
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (07) : 3286 - 3304
[35] A comparison study on modeling of clustered and overdispersed count data for multiple comparisons
Kruppa, Jochen
Hothorn, Ludwig
JOURNAL OF APPLIED STATISTICS, 2021, 48 (16) : 3220 - 3232
[36] A regression model for overdispersed data without too many zeros
Rodriguez-Avi, Jose
Jose Olmo-Jimenez, Maria
STATISTICAL PAPERS, 2017, 58 (03) : 749 - 773
[37] A regression model for overdispersed data without too many zeros
José Rodríguez-Avi
María José Olmo-Jiménez
Statistical Papers, 2017, 58 : 749 - 773
[38] Overdispersed exponential regression models
Seeber, GUH
COMPUTATIONAL STATISTICS, 1997, 12 (02) : 209 - 218
[39] Hierarchical Bayesian Models for Small Area Estimation under Overdispersed Count Data
Wulandari, Ita
Notodiputro, Khairil Anwar
Fitrianto, Anwar
Kurnia, Anang
ENGINEERING LETTERS, 2023, 31 (04) : 1333 - 1342
[40] A joint model for hierarchical continuous and zero-inflated overdispersed count data
Kassahun, Wondwosen
Neyens, Thomas
Molenberghs, Geert
Faes, Christel
Verbeke, Geert
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (03) : 552 - 571

← 1 2 3 4 5 →