Bayesian Estimation of Propensity Scores for Integrating Multiple Cohorts with High-Dimensional Covariates

被引:0
|
作者
Guha, Subharup [1 ]
Li, Yi [2 ]
机构
[1] Univ Florida, Dept Biostat, 2004 Mowry Rd, Gainesville, FL 32603 USA
[2] Univ Michigan, Dept Biostat, 1415 Washington Hts, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
B-MSC; Data integration; Covariate imbalance; High-dimensional covariates; Hybrid Bayesian frequentist; VARIABLE SELECTION; REGRESSION; LASSO;
D O I
10.1007/s12561-024-09470-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Comparative meta-analyses of groups of subjects by integrating multiple observational studies rely on estimated propensity scores (PSs) to mitigate covariate imbalances. However, PS estimation grapples with the theoretical and practical challenges posed by high-dimensional covariates. Motivated by an integrative analysis of breast cancer patients across seven medical centers, this paper tackles the challenges of integrating multiple observational datasets. The proposed inferential technique, called Bayesian Motif Submatrices for Covariates (B-MSC), addresses the curse of dimensionality by a hybrid of Bayesian and frequentist approaches. B-MSC uses nonparametric Bayesian "Chinese restaurant" processes to eliminate redundancy in the high-dimensional covariates and discover latent motifs or lower-dimensional structures. With these motifs as potential predictors, standard regression techniques can be utilized to accurately infer the PSs and facilitate covariate-balanced group comparisons. Simulations and meta-analysis of the motivating cancer investigation demonstrate the efficacy of the B-MSC approach to accurately estimate the propensity scores and efficiently address covariate imbalance when integrating observational health studies with high-dimensional covariates.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Estimation of high-dimensional propensity scores with multiple exposure levels
    Eberg, Maria
    Platt, Robert W.
    Reynier, Pauline
    Filion, Kristian B.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 53 - 60
  • [2] Regularized calibrated estimation of propensity scores with model misspecification and high-dimensional data
    Tan, Z.
    BIOMETRIKA, 2020, 107 (01) : 137 - 158
  • [3] Improving Variable Selection for High-Dimensional Propensity Scores
    Wyss, Richard
    Schneeweiss, Sebastian
    Eddings, Wesley
    van der Laan, Mark J.
    Franklin, Jessica M.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2016, 25 : 146 - 147
  • [4] Evaluation of propensity score methods for causal inference with high-dimensional covariates
    Gao, Qian
    Zhang, Yu
    Sun, Hongwei
    Wang, Tong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)
  • [5] Penalized estimation for competing risks regression with applications to high-dimensional covariates
    Ambrogi, Federico
    Scheike, Thomas H.
    BIOSTATISTICS, 2016, 17 (04) : 708 - 721
  • [6] Regularized estimation in the accelerated failure time model with high-dimensional covariates
    Huang, Jian
    Ma, Shuangge
    Xie, Huiliang
    BIOMETRICS, 2006, 62 (03) : 813 - 820
  • [7] Mean response estimation with missing response in the presence of high-dimensional covariates
    Li, Yongjin
    Wang, Qihua
    Zhu, Liping
    Ding, Xiaobo
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (02) : 628 - 643
  • [8] Testing covariates in high-dimensional regression
    Lan, Wei
    Wang, Hansheng
    Tsai, Chih-Ling
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2014, 66 (02) : 279 - 301
  • [9] Testing covariates in high-dimensional regression
    Wei Lan
    Hansheng Wang
    Chih-Ling Tsai
    Annals of the Institute of Statistical Mathematics, 2014, 66 : 279 - 301
  • [10] Survival analysis with high-dimensional covariates
    Witten, Daniela M.
    Tibshirani, Robert
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (01) : 29 - 51