Survival Analysis with High-Dimensional Omics Data Using a Threshold Gradient Descent Regularization-Based Neural Network Approach

被引:1
|
作者
Fan, Yu [1 ,2 ]
Zhang, Sanguo [1 ,2 ]
Ma, Shuangge [3 ]
机构
[1] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
[2] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[3] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT 06511 USA
基金
中国国家自然科学基金;
关键词
survival analysis; high-dimensional omics data; neural network; TGDR; SELECTION; CLASSIFICATION; SIGNATURE;
D O I
10.3390/genes13091674
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Analysis of data with a censored survival response and high-dimensional omics measurements is now common. Most of the existing analyses are based on specific (semi)parametric models, in particular the Cox model. Such analyses may be limited by not having sufficient flexibility, for example, in accommodating nonlinearity. For categorical and continuous responses, neural networks (NNs) have provided a highly competitive alternative. Comparatively, NNs for censored survival data remain limited. Omics measurements are usually high-dimensional, and only a small subset is expected to be survival-associated. As such, regularized estimation and selection are needed. In the existing NN studies, this is usually achieved via penalization. In this article, we propose adopting the threshold gradient descent regularization (TGDR) technique, which has competitive performance (for example, when compared to penalization) and unique advantages in regression analysis, but has not been adopted with NNs. The TGDR-based NN has a highly sensible formulation and an architecture different from the unregularized and penalization-based ones. Simulations show its satisfactory performance. Its practical effectiveness is further established via the analysis of two cancer omics datasets. Overall, this study can provide a practical and useful new way in the NN paradigm for survival analysis with high-dimensional omics measurements.
引用
收藏
页数:46
相关论文
共 50 条
  • [1] Structured sparsity regularization for analyzing high-dimensional omics data
    Vinga, Susana
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) : 77 - 87
  • [2] Analysis of high-dimensional genomic data using MapReduce based probabilistic neural network
    Baliarsingh, Santos Kumar
    Vipsita, Swati
    Gandomi, Amir H.
    Panda, Abhijeet
    Bakshi, Sambit
    Ramasubbareddy, Somula
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 195
  • [3] Contrastive learning enhanced deep neural network with serial regularization for high-dimensional tabular data
    Wu, Yao
    Zhu, Donghua
    Wang, Xuefeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [4] Deep Learning-Based Survival Analysis for High-Dimensional Survival Data
    Hao, Lin
    Kim, Juncheol
    Kwon, Sookhee
    Ha, Il Do
    MATHEMATICS, 2021, 9 (11)
  • [5] AucPR: An AUC-based approach using penalized regression for disease prediction with high-dimensional omics data
    Yu, Wenbao
    Park, Taesung
    BMC GENOMICS, 2014, 15
  • [6] Shrinkage-based regularization tests for high-dimensional data with application to gene set analysis
    Shen, Yanfeng
    Lin, Zhengyan
    Zhu, Jun
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (07) : 2221 - 2233
  • [7] A Normality Test for High-dimensional Data Based on the Nearest Neighbor Approach
    Chen, Hao
    Xia, Yin
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (541) : 719 - 731
  • [8] High-dimensional omics data analysis using a variable screening protocol with prior knowledge integration (SKI)
    Liu, Cong
    Jiang, Jianping
    Gu, Jianlei
    Yu, Zhangsheng
    Wang, Tao
    Lu, Hui
    BMC SYSTEMS BIOLOGY, 2016, 10
  • [9] Adaptive threshold-based classification of sparse high-dimensional data
    Pavlenko, Tatjana
    Stepanova, Natalia
    Thompson, Lee
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 1952 - 1996
  • [10] A two-stage approach of gene network analysis for high-dimensional heterogeneous data
    Lee, Sangin
    Liang, Faming
    Cai, Ling
    Xiao, Guanghua
    BIOSTATISTICS, 2018, 19 (02) : 216 - 232