A robust imputation method for missing responses and covariates in sample selection models

被引:9
|
作者
Ogundimu, Emmanuel O. [1 ]
Collins, Gary S. [2 ]
机构
[1] Northumbria Univ, Dept Math, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
[2] Univ Oxford, Ctr Stat Med, Oxford, England
关键词
Student-t distribution; Heckman model; missing data; multiple imputation; robust method; MICE package; INFERENCE; BIAS;
D O I
10.1177/0962280217715663
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Sample selection arises when the outcome of interest is partially observed in a study. Although sophisticated statistical methods in the parametric and non-parametric framework have been proposed to solve this problem, it is yet unclear how to deal with selectively missing covariate data using simple multiple imputation techniques, especially in the absence of exclusion restrictions and deviation from normality. Motivated by the 2003-2004 NHANES data, where previous authors have studied the effect of socio-economic status on blood pressure with missing data on income variable, we proposed the use of a robust imputation technique based on the selection-t sample selection model. The imputation method, which is developed within the frequentist framework, is compared with competing alternatives in a simulation study. The results indicate that the robust alternative is not susceptible to the absence of exclusion restrictions - a property inherited from the parent selection-t model - and performs better than models based on the normal assumption even when the data is generated from the normal distribution. Applications to missing outcome and covariate data further corroborate the robustness properties of the proposed method. We implemented the proposed approach within the MICE environment in R Statistical Software.
引用
收藏
页码:102 / 116
页数:15
相关论文
共 50 条
  • [1] Imputation and variable selection in linear regression models with missing covariates
    Yang, XW
    Belin, TR
    Boscardin, WJ
    BIOMETRICS, 2005, 61 (02) : 498 - 506
  • [2] Sequential BART for imputation of missing covariates
    Xu, Dandan
    Daniels, Michael J.
    Winterstein, Almut G.
    BIOSTATISTICS, 2016, 17 (03) : 589 - 602
  • [3] Robust location estimators in regression models with covariates and responses missing at random
    Bianco, Ana M.
    Boente, Graciela
    Gonzalez-Manteiga, Wenceslao
    Perez-Gonzalez, Ana
    JOURNAL OF NONPARAMETRIC STATISTICS, 2020, 32 (04) : 915 - 939
  • [4] Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous-discrete covariates
    Kato, Ryo
    Hoshino, Takahiro
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2020, 72 (03) : 803 - 825
  • [5] Multiple Imputation of Missing Covariates in NONMEM and Evaluation of the Method’s Sensitivity to η-Shrinkage
    Åsa M. Johansson
    Mats O. Karlsson
    The AAPS Journal, 2013, 15 : 1035 - 1042
  • [6] Multiple Imputation of Missing Covariates in NONMEM and Evaluation of the Method's Sensitivity to η-Shrinkage
    Johansson, Asa M.
    Karlsson, Mats O.
    AAPS JOURNAL, 2013, 15 (04): : 1035 - 1042
  • [7] Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates
    Ryo Kato
    Takahiro Hoshino
    Annals of the Institute of Statistical Mathematics, 2020, 72 : 803 - 825
  • [8] Application of Multiple Imputation Method in Analyzing Data with Missing Continuous Covariates
    Tamar, S. Ghasemizadeh
    Ganjali, M.
    KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (04) : 659 - 664
  • [9] Robust inference in sample selection models
    Zhelonkin, Mikhail
    Genton, Marc G.
    Ronchetti, Elvezio
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (04) : 805 - 827
  • [10] Variable selection for additive models with missing data via multiple imputation
    Yuta Shimazu
    Takayuki Yamaguchi
    Ibuki A. J. Hoshina
    Hidetoshi Matsui
    Behaviormetrika, 2025, 52 (1) : 163 - 178