The spike-and-slab quantile LASSO for robust variable selection in cancer genomics studies

被引:1
作者
Liu, Yuwen [1 ]
Ren, Jie [2 ]
Ma, Shuangge [3 ]
Wu, Cen [1 ]
机构
[1] Kansas State Univ, Dept Stat, Manhattan, KS 66506 USA
[2] Indiana Univ Sch Med, Dept Biostat & Hlth Data Sci, Indianapolis, IN USA
[3] Yale Univ, Dept Biostat, New Haven, CT USA
基金
美国国家卫生研究院;
关键词
expectation-maximization (EM) algorithm; quantile LASSO; regularized Bayesian quantile regression; robust variable selection; spike-and-slab prior; GENERALIZED LINEAR-MODELS; REGRESSION SHRINKAGE; GENE; REGULARIZATION; TRANSPORTER; PREDICTION; LIKELIHOOD; INFERENCE; MELANOMA; DIRC2;
D O I
10.1002/sim.10196
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data irregularity in cancer genomics studies has been widely observed in the form of outliers and heavy-tailed distributions in the complex traits. In the past decade, robust variable selection methods have emerged as powerful alternatives to the nonrobust ones to identify important genes associated with heterogeneous disease traits and build superior predictive models. In this study, to keep the remarkable features of the quantile LASSO and fully Bayesian regularized quantile regression while overcoming their disadvantage in the analysis of high-dimensional genomics data, we propose the spike-and-slab quantile LASSO through a fully Bayesian spike-and-slab formulation under the robust likelihood by adopting the asymmetric Laplace distribution (ALD). The proposed robust method has inherited the prominent properties of selective shrinkage and self-adaptivity to the sparsity pattern from the spike-and-slab LASSO (Rockova and George, J Am Stat Associat, 2018, 113(521): 431-444). Furthermore, the spike-and-slab quantile LASSO has a computational advantage to locate the posterior modes via soft-thresholding rule guided Expectation-Maximization (EM) steps in the coordinate descent framework, a phenomenon rarely observed for robust regularization with nondifferentiable loss functions. We have conducted comprehensive simulation studies with a variety of heavy-tailed errors in both homogeneous and heterogeneous model settings to demonstrate the superiority of the spike-and-slab quantile LASSO over its competing methods. The advantage of the proposed method has been further demonstrated in case studies of the lung adenocarcinomas (LUAD) and skin cutaneous melanoma (SKCM) data from The Cancer Genome Atlas (TCGA).
引用
收藏
页码:4928 / 4983
页数:56
相关论文
共 33 条
[21]   Weighted LAD-LASSO method for robust parameter estimation and variable selection in regression [J].
Arslan, Olcay .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (06) :1952-1965
[22]   A Lasso-type Robust Variable Selection for Time-Course Microarray Data [J].
Kim, Ji Young .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2015, 44 (07) :1411-1425
[23]   WLAD-LASSO method for robust estimation and variable selection in partially linear models [J].
Yang, Hu ;
Li, Ning .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (20) :4958-4976
[24]   Outlier detection and robust variable selection via the penalized weighted LAD-LASSO method [J].
Jiang, Yunlu ;
Wang, Yan ;
Zhang, Jiantao ;
Xie, Baojian ;
Liao, Jibiao ;
Liao, Wenhui .
JOURNAL OF APPLIED STATISTICS, 2021, 48 (02) :234-246
[25]   Quantile regression for robust estimation and variable selection in partially linear varying-coefficient models [J].
Yang, Jing ;
Lu, Fang ;
Yang, Hu .
STATISTICS, 2017, 51 (06) :1179-1199
[26]   From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data [J].
Joyner, Chase N. ;
McMahan, Christopher S. ;
Tebbs, Joshua M. ;
Bilder, Christopher R. .
BIOMETRICS, 2020, 76 (03) :913-923
[27]   Robust Adaptive Lasso method for parameter's estimation and variable selection in high-dimensional sparse models [J].
Wahid, Abdul ;
Khan, Dost Muhammad ;
Hussain, Ijaz .
PLOS ONE, 2017, 12 (08)
[28]   Robust variable selection in high-dimensional varying coefficient models based on weighted composite quantile regression [J].
Guo, Chaohui ;
Yang, Hu ;
Lv, Jing .
STATISTICAL PAPERS, 2017, 58 (04) :1009-1033
[29]   Sparse Regression in Cancer Genomics: Comparing Variable Selection and Predictions in Real World Data [J].
O'Shea, Robert J. ;
Tsoka, Sophia ;
Cook, Gary J. R. ;
Goh, Vicky .
CANCER INFORMATICS, 2021, 20
[30]   Robust Variable Selection Based on Penalized Composite Quantile Regression for High-Dimensional Single-Index Models [J].
Song, Yunquan ;
Li, Zitong ;
Fang, Minglu .
MATHEMATICS, 2022, 10 (12)