NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA

被引:55
|
作者
Sun, Hokeun [1 ]
Lin, Wei [2 ]
Feng, Rui [2 ]
Li, Hongzhe [2 ]
机构
[1] Pusan Natl Univ, Dept Stat, Pusan 609735, South Korea
[2] Univ Penn, Perelman Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
关键词
Laplacian penalty; network analysis; regularization; sparsity; survival data; variable selection; weak oracle property; PROPORTIONAL HAZARDS MODEL; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; DANTZIG SELECTOR; ADAPTIVE LASSO; EXPRESSION; METASTASIS; SHRINKAGE;
D O I
10.5705/ss.2012.317
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider estimation and variable selection in high-dimensional Cox regression when a prior knowledge of the relationships among the covariates, described by a network or graph, is available. A limitation of the existing methodology for survival analysis with high-dimensional genomic data is that a wealth of structural information about many biological processes, such as regulatory networks and pathways, has often been ignored. In order to incorporate such prior network information into the analysis of genomic data, we propose a network-based regularization method for high-dimensional Cox regression; it uses an l(1)-penalty to induce sparsity of the regression coefficients and a quadratic Laplacian penalty to encourage smoothness between the coefficients of neighboring variables on a given network. The proposed method is implemented by an efficient coordinate descent algorithm. In the setting where the dimensionality p can grow exponentially fast with the sample size n, we establish model selection consistency and estimation bounds for the proposed estimators. The theoretical results provide insights into the gain from taking into account the network structural information. Extensive simulation studies indicate that our method outperforms Lasso and elastic net in terms of variable selection accuracy and stability. We apply our method to a breast cancer gene expression study and identify several biologically plausible subnetworks and pathways that are associated with breast cancer distant metastasis.
引用
收藏
页码:1433 / 1459
页数:27
相关论文
共 50 条
  • [1] Prognostic scoring system for osteosarcoma using network-regularized high-dimensional Cox-regression analysis and potential therapeutic targets
    Goh, Tae Sik
    Lee, Jung Sub
    Kim, Jeung Il
    Park, Yong Geon
    Pak, Kyoungjune
    Jeong, Dae Cheon
    Oh, Sae-Ock
    Kim, Yun Hak
    JOURNAL OF CELLULAR PHYSIOLOGY, 2019, 234 (08) : 13851 - 13857
  • [2] Doubly regularized Cox regression for high-dimensional survival data with group structures
    Wu, Tong Tong
    Wang, Sijian
    STATISTICS AND ITS INTERFACE, 2013, 6 (02) : 175 - 186
  • [3] On constrained and regularized high-dimensional regression
    Xiaotong Shen
    Wei Pan
    Yunzhang Zhu
    Hui Zhou
    Annals of the Institute of Statistical Mathematics, 2013, 65 : 807 - 832
  • [4] On constrained and regularized high-dimensional regression
    Shen, Xiaotong
    Pan, Wei
    Zhu, Yunzhang
    Zhou, Hui
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2013, 65 (05) : 807 - 832
  • [5] Partial Cox regression analysis for high-dimensional microarray gene expression data
    Li, Hongzhe
    Gui, Jiang
    BIOINFORMATICS, 2004, 20 : 208 - 215
  • [6] Robust regularized cluster analysis for high-dimensional data
    Kalina, Jan
    Vlckova, Katarina
    MATHEMATICAL METHODS IN ECONOMICS (MME 2014), 2014, : 378 - 383
  • [7] A connected network-regularized logistic regression model for feature selection
    Lingyu Li
    Zhi-Ping Liu
    Applied Intelligence, 2022, 52 : 11672 - 11702
  • [8] The nonparametric Box-Cox model for high-dimensional regression analysis
    Zhou, He
    Zou, Hui
    JOURNAL OF ECONOMETRICS, 2024, 239 (02)
  • [9] A connected network-regularized logistic regression model for feature selection
    Li, Lingyu
    Liu, Zhi-Ping
    APPLIED INTELLIGENCE, 2022, 52 (10) : 11672 - 11702
  • [10] Forward regression for Cox models with high-dimensional covariates
    Hong, Hyokyoung G.
    Zheng, Qi
    Li, Yi
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 173 : 268 - 290