DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data

被引:0
作者
Olivier B. Poirion
Zheng Jing
Kumardeep Chaudhary
Sijia Huang
Lana X. Garmire
机构
[1] The Jackson Laboratory,Current address: Computational Sciences
[2] University of Hawaii Cancer Center,Current address: Department of Computational Medicine and Bioinformatics
[3] University of Michigan,Current address: Department of Genetics and Genomic Sciences
[4] Icahn School of Medicine at Mount Sinai,Current address: Department of Biostatistics, Epidemiology and Informatics
[5] University of Pennsylvania,undefined
来源
Genome Medicine | / 13卷
关键词
Survival; Prognosis; multi-omics; Cancer; Ensemble learning; Deep learning; Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-omics data are good resources for prognosis and survival prediction; however, these are difficult to integrate computationally. We introduce DeepProg, a novel ensemble framework of deep-learning and machine-learning approaches that robustly predicts patient survival subtypes using multi-omics data. It identifies two optimal survival subtypes in most cancers and yields significantly better risk-stratification than other multi-omics integration methods. DeepProg is highly predictive, exemplified by two liver cancer (C-index 0.73–0.80) and five breast cancer datasets (C-index 0.68–0.73). Pan-cancer analysis associates common genomic signatures in poor survival subtypes with extracellular matrix modeling, immune deregulation, and mitosis processes. DeepProg is freely available at https://github.com/lanagarmire/DeepProg
引用
收藏
相关论文
共 153 条
[1]  
Anaya J(2016)A pan-cancer analysis of prognostic genes PeerJ. 3 85-10562
[2]  
Reon B(2015)Methods of integrating data to uncover genotype--phenotype interactions Nat Rev Genet. 16 211-2912
[3]  
Chen W-M(2017)Pan-cancer analysis of systematic batch effects on somatic sequence variations BMC Bioinformatics. 18 11305-203
[4]  
Bekiranov S(2016)High-dimensional genomic data bias correction and data integration using MANCIE Nat Commun. 7 46-1212
[5]  
Dutta A(2015)Diagnostic biases in translational bioinformatics BMC Med Genomics. 8 10546-102
[6]  
Ritchie MD(2018)Multi-omic and multi-view clustering algorithms: review and cancer benchmark Nucleic Acids Res. 46 2906-10301
[7]  
Holzinger ER(2009)Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis Bioinformatics. 25 333-13555
[8]  
Li R(2014)Similarity network fusion for aggregating data types on a genomic scale Nat Methods. 11 185-3214
[9]  
Pendergrass SA(2018)Multi-Omics Factor Analysis a framework for unsupervised integration of multi-omics data sets Mol Syst Biol. 14 11966-74
[10]  
Kim D(2017)Integrated genomic characterization of pancreatic ductal adenocarcinoma Cancer Cell. 32 1202-387