Improved survival analysis by learning shared genomic information from pan-cancer data

被引:36
|
作者
Kim, Sunkyu [1 ]
Kim, Keonwoo [1 ]
Choe, Junseok [1 ]
Lee, Inggeol [1 ]
Kang, Jaewoo [1 ,2 ]
机构
[1] Korea Univ, Coll Informat, Dept Comp Sci & Engn, Seoul 02841, South Korea
[2] Korea Univ, Coll Informat, Interdisciplinary Grad Program Bioinformat, Seoul 02841, South Korea
基金
新加坡国家研究基金会;
关键词
PROGNOSTIC MARKER;
D O I
10.1093/bioinformatics/btaa462
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recent advances in deep learning have offered solutions to many biomedical tasks. However, there remains a challenge in applying deep learning to survival analysis using human cancer transcriptome data. As the number of genes, the input variables of survival model, is larger than the amount of available cancer patient samples, deep-learning models are prone to overfitting. To address the issue, we introduce a new deep-learning architecture called VAECox. VAECox uses transfer learning and fine tuning. Results: We pre-trained a variational autoencoder on all RNA-seq data in 20 TCGA datasets and transferred the trained weights to our survival prediction model. Then we fine-tuned the transferred weights during training the survival model on each dataset. Results show that our model outperformed other previous models such as Cox Proportional Hazard with LASSO and ridge penalty and Cox-nnet on the 7 of 10 TCGA datasets in terms of C-index. The results signify that the transferred information obtained from entire cancer transcriptome data helped our survival prediction model reduce overfitting and show robust performance in unseen cancer patient samples.
引用
收藏
页码:389 / 398
页数:10
相关论文
共 50 条
  • [21] Multi-omic integrated curvature study on pan-cancer genomic data
    Jiening Zhu
    Anh Phong Tran
    Joseph O. Deasy
    Allen Tannenbaum
    Mathematics of Control, Signals, and Systems, 2024, 36 : 101 - 120
  • [22] The molecular and genomic drivers of immunologically cold tumors: Novel discoveries from pan-cancer analysis
    Han, Guangchun
    Wang, Ruiping
    Shen, Jianfeng
    Singh, Pankaj
    Zhang, Shaojun
    Rao, Arvind
    Gao, Jianjun
    Lazar, Alexander
    Futreal, Andrew
    Peng, Guang
    Rai, Kunal
    Wang, Linghua
    CANCER RESEARCH, 2019, 79 (13)
  • [23] Pan-cancer analysis of neoepitopes
    Gabriel N. Teku
    Mauno Vihinen
    Scientific Reports, 8
  • [24] Pan-cancer analysis of neoepitopes
    Teku, Gabriel N.
    Vihinen, Mauno
    SCIENTIFIC REPORTS, 2018, 8
  • [25] Pan-cancer analysis of genomic and transcriptomic data reveals the prognostic relevance of human proteasome genes in different cancer types
    Larsson, Peter
    Pettersson, Daniella
    Engqvist, Hanna
    Ronnerman, Elisabeth Werner
    Forssell-Aronsson, Eva
    Kovacs, Aniko
    Karlsson, Per
    Helou, Khalil
    Parris, Toshima Z.
    BMC CANCER, 2022, 22 (01)
  • [26] Pan-cancer analysis of genomic and transcriptomic data reveals the prognostic relevance of human proteasome genes in different cancer types
    Peter Larsson
    Daniella Pettersson
    Hanna Engqvist
    Elisabeth Werner Rönnerman
    Eva Forssell-Aronsson
    Anikó Kovács
    Per Karlsson
    Khalil Helou
    Toshima Z. Parris
    BMC Cancer, 22
  • [27] A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data
    Li, Yuanyuan
    Kang, Kai
    Krahn, Juno M.
    Croutwater, Nicole
    Lee, Kevin
    Umbach, David M.
    Li, Leping
    BMC GENOMICS, 2017, 18
  • [28] A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data
    Yuanyuan Li
    Kai Kang
    Juno M. Krahn
    Nicole Croutwater
    Kevin Lee
    David M. Umbach
    Leping Li
    BMC Genomics, 18
  • [29] Machine learning for pan-cancer classification based on RNA sequencing data
    Stancl, Paula
    Karlic, Rosa
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2023, 10
  • [30] Pan-cancer genomic analyses uncover molecular drivers
    David Killock
    Nature Reviews Clinical Oncology, 2018, 15 : 263 - 263