Robust Inference in the Negative Binomial Regression Model with an Application to Falls Data

被引:45
作者
Aeberhard, William H. [1 ,2 ,3 ]
Cantoni, Eva [1 ,2 ]
Heritier, Stephane [3 ,4 ]
机构
[1] Univ Geneva, Res Ctr Stat, Geneva, Switzerland
[2] Univ Geneva, Geneva Sch Econ & Management, Geneva, Switzerland
[3] Univ Sydney, Sydney Sch Publ Hlth, Sydney, NSW 2006, Australia
[4] Macquarie Univ, Dept Stat, Sydney, NSW 2109, Australia
关键词
Bounded influence function; Negative binomial regression; Overdispersed count data; Redescending estimators; Weighted maximum likelihood; POISSON; ESTIMATORS;
D O I
10.1111/biom.12212
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A popular way to model overdispersed count data, such as the number of falls reported during intervention studies, is by means of the negative binomial (NB) distribution. Classical estimating methods are well-known to be sensitive to model misspecifications, taking the form of patients falling much more than expected in such intervention studies where the NB regression model is used. We extend in this article two approaches for building robust M-estimators of the regression parameters in the class of generalized linear models to the NB distribution. The first approach achieves robustness in the response by applying a bounded function on the Pearson residuals arising in the maximum likelihood estimating equations, while the second approach achieves robustness by bounding the unscaled deviance components. For both approaches, we explore different choices for the bounding functions. Through a unified notation, we show how close these approaches may actually be as long as the bounding functions are chosen and tuned appropriately, and provide the asymptotic distributions of the resulting estimators. Moreover, we introduce a robust weighted maximum likelihood estimator for the overdispersion parameter, specific to the NB distribution. Simulations under various settings show that redescending bounding functions yield estimates with smaller biases under contamination while keeping high efficiency at the assumed model, and this for both approaches. We present an application to a recent randomized controlled trial measuring the effectiveness of an exercise program at reducing the number of falls among people suffering from Parkinsons disease to illustrate the diagnostic use of such robust procedures and their need for reliable inference.
引用
收藏
页码:920 / 931
页数:12
相关论文
共 26 条
  • [1] Amiguet M, 2011, THESIS U LAUSANNE SW
  • [2] [Anonymous], 1996, ROBUST STAT DATA ANA
  • [3] [Anonymous], 1985, MATH STAT APPL, V8, P283, DOI DOI 10.1007/978-94-009-5438-0_20
  • [4] [Anonymous], 2011, NEGATIVE BINOMIAL RE
  • [5] [Anonymous], 2009, Wiley Series in Probability and Statistics, DOI DOI 10.1002/9780470434697.CH7
  • [6] [Anonymous], 1983, Generalized Linear Models
  • [7] THE STATISTICAL ANALYSIS OF INSECT COUNTS BASED ON THE NEGATIVE BINOMIAL DISTRIBUTION
    ANSCOMBE, FJ
    [J]. BIOMETRICS, 1949, 5 (02) : 165 - 173
  • [8] Robust tests in generalized linear models with missing responses
    Bianco, Ana M.
    Boente, Graciela
    Rodrigues, Isabel M.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 65 : 80 - 97
  • [9] Resistant estimators in Poisson and Gamma models with missing responses and an application to outlier detection
    Bianco, Ana M.
    Boente, Graciela
    Rodrigues, Isabel M.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 114 : 209 - 226
  • [10] BRESLOW NE, 1984, J R STAT SOC C-APPL, V33, P38