The spike-and-slab quantile LASSO for robust variable selection in cancer genomics studies

被引:1
|
作者
Liu, Yuwen [1 ]
Ren, Jie [2 ]
Ma, Shuangge [3 ]
Wu, Cen [1 ]
机构
[1] Kansas State Univ, Dept Stat, Manhattan, KS 66506 USA
[2] Indiana Univ Sch Med, Dept Biostat & Hlth Data Sci, Indianapolis, IN USA
[3] Yale Univ, Dept Biostat, New Haven, CT USA
基金
美国国家卫生研究院;
关键词
expectation-maximization (EM) algorithm; quantile LASSO; regularized Bayesian quantile regression; robust variable selection; spike-and-slab prior; GENERALIZED LINEAR-MODELS; REGRESSION SHRINKAGE; GENE; REGULARIZATION; TRANSPORTER; PREDICTION; LIKELIHOOD; INFERENCE; MELANOMA; DIRC2;
D O I
10.1002/sim.10196
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data irregularity in cancer genomics studies has been widely observed in the form of outliers and heavy-tailed distributions in the complex traits. In the past decade, robust variable selection methods have emerged as powerful alternatives to the nonrobust ones to identify important genes associated with heterogeneous disease traits and build superior predictive models. In this study, to keep the remarkable features of the quantile LASSO and fully Bayesian regularized quantile regression while overcoming their disadvantage in the analysis of high-dimensional genomics data, we propose the spike-and-slab quantile LASSO through a fully Bayesian spike-and-slab formulation under the robust likelihood by adopting the asymmetric Laplace distribution (ALD). The proposed robust method has inherited the prominent properties of selective shrinkage and self-adaptivity to the sparsity pattern from the spike-and-slab LASSO (Rockova and George, J Am Stat Associat, 2018, 113(521): 431-444). Furthermore, the spike-and-slab quantile LASSO has a computational advantage to locate the posterior modes via soft-thresholding rule guided Expectation-Maximization (EM) steps in the coordinate descent framework, a phenomenon rarely observed for robust regularization with nondifferentiable loss functions. We have conducted comprehensive simulation studies with a variety of heavy-tailed errors in both homogeneous and heterogeneous model settings to demonstrate the superiority of the spike-and-slab quantile LASSO over its competing methods. The advantage of the proposed method has been further demonstrated in case studies of the lung adenocarcinomas (LUAD) and skin cutaneous melanoma (SKCM) data from The Cancer Genome Atlas (TCGA).
引用
收藏
页码:4928 / 4983
页数:56
相关论文
共 50 条
  • [1] Simultaneous Variable and Covariance Selection With the Multivariate Spike-and-Slab LASSO
    Deshpande, Sameer K.
    Rockova, Veronika
    George, Edward, I
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (04) : 921 - 931
  • [2] The Spike-and-Slab LASSO
    Rockova, Veronika
    George, Edward I.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (521) : 431 - 444
  • [3] SPIKE-AND-SLAB LASSO BICLUSTERING
    Moran, Gemma E.
    Rockova, Veronika
    George, Edward, I
    ANNALS OF APPLIED STATISTICS, 2021, 15 (01): : 148 - 173
  • [4] The spike-and-slab lasso and scalable algorithm to accommodate multinomial outcomes in variable selection problems
    Leach, Justin M.
    Yi, Nengjun
    Aban, Inmaculada
    JOURNAL OF APPLIED STATISTICS, 2024, 51 (11) : 2039 - 2061
  • [5] Bayesian Bootstrap Spike-and-Slab LASSO
    Nie, Lizhen
    Rockova, Veronika
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 2013 - 2028
  • [6] Dynamic Variable Selection with Spike-and-Slab Process Priors
    Rockova, Veronika
    McAlinn, Kenichiro
    BAYESIAN ANALYSIS, 2021, 16 (01): : 233 - 269
  • [7] Bayesian Joint Spike-and-Slab Graphical Lasso
    Li, Zehang Richard
    McCormick, Tyler H.
    Clark, Samuel J.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [8] Robust variable selection based on the random quantile LASSO
    Wang, Yan
    Jiang, Yunlu
    Zhang, Jiantao
    Chen, Zhongran
    Xie, Baojian
    Zhao, Chengxiang
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (01) : 29 - 39
  • [9] The spike-and-slab lasso Cox model for survival prediction and associated genes detection
    Tang, Zaixiang
    Shen, Yueping
    Zhang, Xinyan
    Yi, Nengjun
    BIOINFORMATICS, 2017, 33 (18) : 2799 - 2807
  • [10] The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection
    Tang, Zaixiang
    Shen, Yueping
    Zhang, Xinyan
    Yi, Nengjun
    GENETICS, 2017, 205 (01) : 77 - +