Accounting for bias due to outcome data missing not at random: comparison and illustration of two approaches to probabilistic bias analysis: a simulation study

被引:1
|
作者
Kawabata, Emily [1 ,2 ]
Major-Smith, Daniel [1 ,2 ]
Clayton, Gemma L. [1 ,2 ]
Shapland, Chin Yang [1 ,2 ]
Morris, Tim P. [3 ]
Carter, Alice R. [1 ,2 ]
Fernandez-Sanles, Alba [4 ]
Borges, Maria Carolina [1 ,2 ]
Tilling, Kate [1 ,2 ]
Griffith, Gareth J. [1 ,2 ]
Millard, Louise A. C. [1 ,2 ]
Smith, George Davey [1 ,2 ]
Lawlor, Deborah A. [1 ,2 ]
Hughes, Rachael A. [1 ,2 ]
机构
[1] Univ Bristol, MRC Integrat Epidemiol Unit, Bristol, England
[2] Univ Bristol, Bristol Med Sch, Populat Hlth Sci, Bristol, England
[3] UCL, MRC Clin Trials Unit, London, England
[4] UCL, MRC Unit Lifelong Hlth & Ageing, London, England
基金
英国医学研究理事会; 英国惠康基金;
关键词
Bayesian bias analysis; Inverse probability weighting; Missing not at random; Monte Carlo bias analysis; Multiple imputation; Probabilistic bias analysis; Sensitivity analysis; UK Biobank; FULLY CONDITIONAL SPECIFICATION; PATTERN-MIXTURE ANALYSIS; MULTIPLE IMPUTATION; SELECTION BIAS; FRAMEWORK; MODELS;
D O I
10.1186/s12874-024-02382-4
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BackgroundBias from data missing not at random (MNAR) is a persistent concern in health-related research. A bias analysis quantitatively assesses how conclusions change under different assumptions about missingness using bias parameters that govern the magnitude and direction of the bias. Probabilistic bias analysis specifies a prior distribution for these parameters, explicitly incorporating available information and uncertainty about their true values. A Bayesian bias analysis combines the prior distribution with the data's likelihood function whilst a Monte Carlo bias analysis samples the bias parameters directly from the prior distribution. No study has compared a Monte Carlo bias analysis to a Bayesian bias analysis in the context of MNAR missingness.MethodsWe illustrate an accessible probabilistic bias analysis using the Monte Carlo bias analysis approach and a well-known imputation method. We designed a simulation study based on a motivating example from the UK Biobank study, where a large proportion of the outcome was missing and missingness was suspected to be MNAR. We compared the performance of our Monte Carlo bias analysis to a principled Bayesian bias analysis, complete case analysis (CCA) and multiple imputation (MI) assuming missing at random.ResultsAs expected, given the simulation study design, CCA and MI estimates were substantially biased, with 95% confidence interval coverages of 7-48%. Including auxiliary variables (i.e., variables not included in the substantive analysis that are predictive of missingness and the missing data) in MI's imputation model amplified the bias due to assuming missing at random. With reasonably accurate and precise information about the bias parameter, the Monte Carlo bias analysis performed as well as the Bayesian bias analysis. However, when very limited information was provided about the bias parameter, only the Bayesian bias analysis was able to eliminate most of the bias due to MNAR whilst the Monte Carlo bias analysis performed no better than the CCA and MI.ConclusionThe Monte Carlo bias analysis we describe is easy to implement in standard software and, in the setting we explored, is a viable alternative to a Bayesian bias analysis. We caution careful consideration of choice of auxiliary variables when applying imputation where data may be MNAR.
引用
收藏
页数:14
相关论文
共 14 条
  • [1] Attrition Bias Related to Missing Outcome Data: A Longitudinal Simulation Study
    Lewin, Antoine
    Brondeel, Ruben
    Benmarhnia, Tarik
    Thomas, Frederique
    Chaix, Basile
    EPIDEMIOLOGY, 2018, 29 (01) : 87 - 95
  • [2] Evaluation of bias and precision in methods of analysis for pragmatic trials with missing outcome data: a simulation study
    Royes Joseph
    Julius Sim
    Reuben Ogollah
    Martyn Lewis
    Trials, 14 (Suppl 1)
  • [3] A wide range of missing imputation approaches in longitudinal data: a simulation study and real data analysis
    Jahangiri, Mina
    Kazemnejad, Anoshirvan
    Goldfeld, Keith S.
    Daneshpour, Maryam S.
    Mostafaei, Shayan
    Khalili, Davood
    Moghadas, Mohammad Reza
    Akbarzadeh, Mahdi
    BMC MEDICAL RESEARCH METHODOLOGY, 2023, 23 (01)
  • [4] Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study
    van Kuijk, Sander M. J.
    Viechtbauer, Wolfgang
    Peeters, Louis L.
    Smits, Luc
    EPIDEMIOLOGY BIOSTATISTICS AND PUBLIC HEALTH, 2016, 13 (01)
  • [5] The M-Value: A Simple Sensitivity Analysis for Bias Due to Missing Data in Treatment Effect Estimates
    Mathur, Maya B.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (04) : 612 - 620
  • [6] Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random
    Curnow, Elinor
    Cornish, Rosie P.
    Heron, Jon E.
    Carpenter, James R.
    Tilling, Kate
    BMC MEDICAL RESEARCH METHODOLOGY, 2024, 24 (01)
  • [7] Target Trial Emulation and Bias Through Missing Eligibility Data: An Application to a Study of Palivizumab for the Prevention of Hospitalization Due to Infant Respiratory Illness
    Tompsett, Daniel
    Zylbersztejn, Ania
    Hardelid, Pia
    De Stavola, Bianca
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (04) : 600 - 611
  • [8] Bayesian sensitivity analysis methods to evaluate bias due to misclassification and missing data using informative priors and external validation data
    Luta, George
    Ford, Melissa B.
    Bondy, Melissa
    Shields, Peter G.
    Stamey, James D.
    CANCER EPIDEMIOLOGY, 2013, 37 (02) : 121 - 126
  • [9] Multiple imputation using linked proxy outcome data resulted in important bias reduction and efficiency gains: A simulation study
    Cornish R.P.
    Macleod J.
    Carpenter J.R.
    Tilling K.
    Emerging Themes in Epidemiology, 14 (1):
  • [10] Does pattern mixture modelling reduce bias due to informative attrition compared to fitting a mixed effects model to the available cases or data imputed using multiple imputation?: a simulation study
    Welch, Catherine A.
    Sabia, Severine
    Brunner, Eric
    Kivimaki, Mika
    Shipley, Martin J.
    BMC MEDICAL RESEARCH METHODOLOGY, 2018, 18