Survival Analysis with High-Dimensional Omics Data Using a Threshold Gradient Descent Regularization-Based Neural Network Approach

被引:1
作者
Fan, Yu [1 ,2 ]
Zhang, Sanguo [1 ,2 ]
Ma, Shuangge [3 ]
机构
[1] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[3] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT 06511 USA
基金
中国国家自然科学基金;
关键词
survival analysis; high-dimensional omics data; neural network; TGDR; SELECTION; CLASSIFICATION; SIGNATURE;
D O I
10.3390/genes13091674
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Analysis of data with a censored survival response and high-dimensional omics measurements is now common. Most of the existing analyses are based on specific (semi)parametric models, in particular the Cox model. Such analyses may be limited by not having sufficient flexibility, for example, in accommodating nonlinearity. For categorical and continuous responses, neural networks (NNs) have provided a highly competitive alternative. Comparatively, NNs for censored survival data remain limited. Omics measurements are usually high-dimensional, and only a small subset is expected to be survival-associated. As such, regularized estimation and selection are needed. In the existing NN studies, this is usually achieved via penalization. In this article, we propose adopting the threshold gradient descent regularization (TGDR) technique, which has competitive performance (for example, when compared to penalization) and unique advantages in regression analysis, but has not been adopted with NNs. The TGDR-based NN has a highly sensible formulation and an architecture different from the unregularized and penalization-based ones. Simulations show its satisfactory performance. Its practical effectiveness is further established via the analysis of two cancer omics datasets. Overall, this study can provide a practical and useful new way in the NN paradigm for survival analysis with high-dimensional omics measurements.
引用
收藏
页数:46
相关论文
共 50 条
  • [21] Filter and Wrapper Stacking Ensemble (FWSE): a robust approach for reliable biomarker discovery in high-dimensional omics data
    Budhraja, Sugam
    Doborjeh, Maryam
    Singh, Balkaran
    Tan, Samuel
    Doborjeh, Zohreh
    Lai, Edmund
    Merkin, Alexander
    Lee, Jimmy
    Goh, Wilson
    Kasabov, Nikola
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [22] Massively parallelization strategy for material simulation using high-dimensional neural network potential
    Shang, Cheng
    Huang, Si-Da
    Liu, Zhi-Pan
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2019, 40 (10) : 1091 - 1096
  • [23] Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data
    Simon, Richard M.
    Subramanian, Jyothi
    Li, Ming-Chung
    Menezes, Supriya
    BRIEFINGS IN BIOINFORMATICS, 2011, 12 (03) : 203 - 214
  • [24] A Fast Nonnegative Autoencoder-Based Approach to Latent Feature Analysis on High-Dimensional and Incomplete Data
    Bi, Fanghui
    He, Tiantian
    Luo, Xin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 733 - 746
  • [25] A kernel-based approach for detecting outliers of high-dimensional biological data
    Oh, Jung Hun
    Gao, Jean
    BMC BIOINFORMATICS, 2009, 10
  • [26] Integration of pathway structure information into a reweighted partial Cox regression approach for survival analysis on high-dimensional gene expression data
    Liu, Wei
    Wang, Qiuyu
    Zhao, Jianmei
    Zhang, Chunlong
    Liu, Yuejuan
    Zhang, Jian
    Bai, Xuefeng
    Li, Xuecang
    Feng, Houming
    Liao, Mingzhi
    Wang, Wei
    Li, Chunquan
    MOLECULAR BIOSYSTEMS, 2015, 11 (07) : 1876 - 1886
  • [27] High-dimensional data structure analysis using Self-Organising Maps
    Hodych, Oles
    Nikolski, Iouri
    Pasichnyk, Volodymyr
    Shcherbyna, Yuri
    2007 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS, 2007, : 218 - 221
  • [28] Instrumental variable-based high-dimensional mediation analysis with unmeasured confounders for survival data in the observational epigenetic study
    Chen, Fangyao
    Hu, Weiwei
    Cai, Jiaxin
    Chen, Shiyu
    Si, Aima
    Zhang, Yuxiang
    Liu, Wei
    FRONTIERS IN GENETICS, 2023, 14
  • [29] A comparison of machine learning methods for survival analysis of high-dimensional clinical data for dementia prediction
    Spooner, Annette
    Chen, Emily
    Sowmya, Arcot
    Sachdev, Perminder
    Kochan, Nicole A.
    Trollor, Julian
    Brodaty, Henry
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [30] ML-Accelerated Yield Analysis Framework Using Regularization for Sparsity in High-Sigma and High-Dimensional Scenarios
    Xu, Haoran
    Fan, Haoran
    Jiang, Bo
    Chen, Jianfei
    Tong, Qiaoling
    Zou, Xuecheng
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (04) : 1161 - 1170