IMPUTATION-BASED ADJUSTED SCORE EQUATIONS IN GENERALIZED LINEAR MODELS WITH NONIGNORABLE MISSING COVARIATE VALUES

被引:16
作者
Fang, Fang [1 ]
Zhao, Jiwei [2 ]
Shao, Jun [1 ,3 ]
机构
[1] East China Normal Univ, Sch Stat, 500 Dongchuan Rd, Shanghai 200241, Peoples R China
[2] SUNY Buffalo, Dept Biostat, Buffalo, NY 14214 USA
[3] Univ Wisconsin, Dept Stat, 1300 Univ Ave, Madison, WI 53706 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Adjusted likelihood; identifiability; instruments; nonignorable missing covariate data; pseudo-likelihood; semiparametric; LIKELIHOOD; REGRESSION; RESPONSES;
D O I
10.5705/ss.202015.0437
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the estimation of unknown parameters in a generalized linear model when some covariates have nonignorable missing values. When an instrument, a covariate that helps identifying parameters under nonignorable missingness, is appropriately specified, a pseudo likelihood approach similar to that in Tang, Little and Raghunathan (2003) or Zhao and Shao (2015) can be applied. However, this approach does not work well when the instrument is a weak predictor of the response given other covariates. We show that the asymptotic variances of the pseudo likelihood estimators for the regression coefficients of covariates other than the instrument diverge to infinity as the regression coefficient of the instrument goes to 0. By an imputation-based adjustment for the score equations, we propose a new estimator for the regression coefficients of the covariates other than the instrument. This works well even if the instrument is a weak predictor. It is semiparametric since the propensity of missing covariate data is completely unspecified. To solve the adjusted score equation, we develop an iterative algorithm that can be applied by using standard softwares at each iteration. We establish some theoretical results on the convergence of the proposed iterative algorithm and asymptotic normality of the resulting estimators. A variance estimation formula is also derived. Some simulation results and a data example are presented for illustration.
引用
收藏
页码:1677 / 1701
页数:25
相关论文
共 24 条
[1]   The correction of risk estimates for measurement error [J].
Bashir, SA ;
Duffy, SW .
ANNALS OF EPIDEMIOLOGY, 1997, 7 (02) :154-164
[2]   Maximum likelihood estimation in random effects cure rate models with nonignorable missing covariates [J].
Herring, AH ;
Ibrahim, JG .
BIOSTATISTICS, 2002, 3 (03) :387-405
[3]   Maximum likelihood analysis of logistic regression models with incomplete covariate data and auxiliary information [J].
Horton, NJ ;
Laird, NM .
BIOMETRICS, 2001, 57 (01) :34-42
[4]   Bayesian analysis for generalized linear models with nonignorably missing covariates [J].
Huang, L ;
Chen, MH ;
Ibrahim, JG .
BIOMETRICS, 2005, 61 (03) :767-780
[5]   Missing-data methods for generalized linear models: A comparative review [J].
Ibrahim, JG ;
Chen, MH ;
Lipsitz, SR ;
Herring, AH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) :332-346
[6]   Missing covariates in generalized linear models when the missing data mechanism is non-ignorable [J].
Ibrahim, JG ;
Lipsitz, SR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 :173-190
[7]  
Ibrahim JG, 2009, TEST-SPAIN, V18, P1, DOI 10.1007/s11749-009-0138-x
[8]   A weighted estimating equation for missing covariate data with properties similar to maximum likelihood [J].
Lipsitz, SR ;
Ibrahim, JG ;
Zhao, LP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (448) :1147-1160
[9]  
Lipsitz SR, 1999, STAT MED, V18, P2435, DOI 10.1002/(SICI)1097-0258(19990915/30)18:17/18<2435::AID-SIM267>3.0.CO
[10]  
2-B