Empirical likelihood method for non-ignorable missing data problems

被引:7
作者
Guan, Zhong [1 ]
Qin, Jing [2 ]
机构
[1] Indiana Univ, Dept Math Sci, South Bend, IN 46634 USA
[2] NIAID, Biostat Res Branch, 9000 Rockville Pike, Bethesda, MD 20892 USA
关键词
Constrained estimation; Empirical likelihood; Non-ignorable missing data; Survey sampling; SEMIPARAMETRIC ESTIMATION; AUXILIARY INFORMATION; ESTIMATING EQUATIONS; ROBUST ESTIMATION; REGRESSION; INFERENCE; NONRESPONSE; MODELS; BIAS; IMPUTATION;
D O I
10.1007/s10985-016-9381-0
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Missing response problem is ubiquitous in survey sampling, medical, social science and epidemiology studies. It is well known that non-ignorable missing is the most difficult missing data problem where the missing of a response depends on its own value. In statistical literature, unlike the ignorable missing data problem, not many papers on non-ignorable missing data are available except for the full parametric model based approach. In this paper we study a semiparametric model for non-ignorable missing data in which the missing probability is known up to some parameters, but the underlying distributions are not specified. By employing Owen (1988)'s empirical likelihood method we can obtain the constrained maximum empirical likelihood estimators of the parameters in the missing probability and the mean response which are shown to be asymptotically normal. Moreover the likelihood ratio statistic can be used to test whether the missing of the responses is non-ignorable or completely at random. The theoretical results are confirmed by a simulation study. As an illustration, the analysis of a real AIDS trial data shows that the missing of CD4 counts around two years are non-ignorable and the sample mean based on observed data only is biased.
引用
收藏
页码:113 / 135
页数:23
相关论文
共 38 条
  • [1] ADJUSTING FOR NONRESPONSE BIAS USING LOGISTIC-REGRESSION
    ALHO, JM
    [J]. BIOMETRIKA, 1990, 77 (03) : 617 - 624
  • [2] [Anonymous], 1977, SAMPLING TECHNIQUES
  • [3] Oracle, Multiple Robust and Multipurpose Calibration in a Missing Response Problem
    Chan, Kwun Chuen Gary
    Yam, Sheung Chi Phillip
    [J]. STATISTICAL SCIENCE, 2014, 29 (03) : 380 - 396
  • [4] CHEN JH, 1993, BIOMETRIKA, V80, P107, DOI 10.1093/biomet/80.1.107
  • [5] Parametric models for response-biased sampling
    Chen, KN
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2001, 63 : 775 - 789
  • [6] Semiparametric estimation of treatment effect in a pretest-posttest study with missing data
    Davidian, M
    Tsiatis, AA
    Leon, S
    [J]. STATISTICAL SCIENCE, 2005, 20 (03) : 261 - 282
  • [7] AN OPTIMUM PROPERTY OF REGULAR MAXIMUM-LIKELIHOOD ESTIMATION
    GODAMBE, VP
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1960, 31 (04): : 1208 - 1211
  • [8] GREENLEES JS, 1982, J AM STAT ASSOC, V77, P251
  • [9] METHODOLOGY AND ALGORITHMS OF EMPIRICAL LIKELIHOOD
    HALL, P
    LASCALA, B
    [J]. INTERNATIONAL STATISTICAL REVIEW, 1990, 58 (02) : 109 - 127
  • [10] A trial comparing nucleoside monotherapy with combination therapy in HIV-infected adults with CD4 cell counts from 200 to 500 per cubic millimeter
    Hammer, SM
    Katzenstein, DA
    Hughes, MD
    Gundacker, H
    Schooley, RT
    Haubrich, RH
    Henry, WK
    Lederman, MM
    Phair, JP
    Niu, M
    Hirsch, MS
    Merigan, TC
    Blaschke, TF
    Simpson, D
    McLaren, C
    Rooney, J
    Salgo, M
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 1996, 335 (15) : 1081 - 1090