A multiple imputation-based sensitivity analysis approach for regression analysis with an missing not at random covariate

被引:1
|
作者
Hsu, Chiu-Hsieh [1 ,4 ]
He, Yulei [2 ]
Hu, Chengcheng [1 ]
Zhou, Wei [3 ]
机构
[1] Univ Arizona, Coll Publ Hlth, Dept Epidemiol & Biostat, Tucson, AZ USA
[2] Ctr Dis Control & Prevent, Natl Ctr Hlth Stat, Hyattsville, MD USA
[3] Univ Arizona, Dept Surg, Tucson, AZ USA
[4] Univ Arizona, Dept Epidemiol & Biostat, 1295 N Martin Ave, Tucson, AZ 85724 USA
基金
美国国家卫生研究院;
关键词
missing covariate; missing not at random; multiple imputation; selection model; sensitivity analysis; GENERALIZED LINEAR-MODELS;
D O I
10.1002/sim.9723
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Missing covariate problems are common in biomedical and electrical medical record data studies while evaluating the relationship between a biomarker and certain clinical outcome, when biomarker data are not collected for all subjects. However, missingness mechanism is unverifiable based on observed data. If there is a suspicion of missing not at random (MNAR), researchers often perform sensitivity analysis to evaluate the impact of various missingness mechanisms. Under the selection modeling framework, we propose a sensitivity analysis approach with a standardized sensitivity parameter using a nonparametric multiple imputation strategy. The proposed approach requires fitting two working models to derive two predictive scores: one for predicting missing covariate values and the other for predicting missingness probabilities. For each missing covariate observation, the two predictive scores along with the pre-specified sensitivity parameter are used to define an imputing set. The proposed approach is expected to be robust against mis-specifications of the selection model and the sensitivity parameter since the selection model and the sensitivity parameter are not directly used to impute missing covariate values. A simulation study is conducted to study the performance of the proposed approach when MNAR is induced by Heckman's selection model. Simulation results show the proposed approach can produce plausible regression coefficient estimates. The proposed sensitivity analysis approach is also applied to evaluate the impact of MNAR on the relationship between post-operative outcomes and incomplete pre-operative Hemoglobin A1c level for patients who underwent carotid intervetion for advanced atherosclerotic disease.
引用
收藏
页码:2275 / 2292
页数:18
相关论文
共 50 条
  • [31] Missing data analysis and imputation via latent Gaussian Markov random fields
    Gomez-Rubio, Virgilio
    Cameletti, Michela
    Blangiardo, Marta
    SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2022, 46 (02) : 217 - 244
  • [32] Multiple imputation of missing data for survey data analysis
    Lupo, Coralie
    Le Bouquin, Sophie
    Michel, Virginie
    Colin, Pierre
    Chauvin, Claire
    EPIDEMIOLOGIE ET SANTE ANIMALE, 2008, NO 53, 2008, (53): : 73 - 83
  • [33] Sensitivity analysis for clinical trials with missing continuous outcome data using controlled multiple imputation: A practical guide
    Cro, Suzie
    Morris, Tim P.
    Kenward, Michael G.
    Carpenter, James R.
    STATISTICS IN MEDICINE, 2020, 39 (21) : 2815 - 2842
  • [34] Imputation-based strategies for clinical trial longitudinal data with nonignorable missing values
    Yang, Xiaowei
    Li, Jinhui
    Shoptaw, Steven
    STATISTICS IN MEDICINE, 2008, 27 (15) : 2826 - 2849
  • [35] Imputation of data Missing Not at Random: Artificial generation and benchmark analysis
    Pereira, Ricardo Cardoso
    Abreu, Pedro Henriques
    Rodrigues, Pedro Pereira
    Figueiredo, Mario A. T.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [36] A NOTE ON THE ANALYSIS OF CENSORED REGRESSION DATA BY MULTIPLE IMPUTATION
    JAMES, IR
    BIOMETRICS, 1995, 51 (01) : 358 - 362
  • [37] Nonparametric multiple imputation for receiver operating characteristics analysis when some biomarker values are missing at random
    Long, Qi
    Zhang, Xiaoxi
    Hsu, Chiu-Hsieh
    STATISTICS IN MEDICINE, 2011, 30 (26) : 3149 - 3161
  • [38] Using multiple imputation to estimate cumulative distribution functions in longitudinal data analysis with data missing at random
    Dinh, Phillip
    PHARMACEUTICAL STATISTICS, 2013, 12 (05) : 260 - 267
  • [39] Approaches for missing covariate data in logistic regression with MNAR sensitivity analyses
    Ward, Ralph C.
    Axon, Robert Neal
    Gebregziabher, Mulugeta
    BIOMETRICAL JOURNAL, 2020, 62 (04) : 1025 - 1037
  • [40] Multiple imputation of missing fMRI data in whole brain analysis
    Vaden, Kenneth I., Jr.
    Gebregziabher, Mulugeta
    Kuchinsky, Stefanie E.
    Eckert, Marl A.
    NEUROIMAGE, 2012, 60 (03) : 1843 - 1855