Subtype-GAN: a deep learning approach for integrative cancer subtyping of multi-omics data

被引:81
作者
Yang, Hai [1 ]
Chen, Rui [2 ,3 ]
Li, Dongdong [1 ]
Wang, Zhe [1 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[2] Vanderbilt Univ, Dept Mol Physiol & Biophys, Nashville, TN 37240 USA
[3] Vanderbilt Univ, Vanderbilt Genet Inst, Nashville, TN 37240 USA
关键词
LATENT VARIABLE MODEL; GENOMIC CHARACTERIZATION; EXPRESSION; BREAST; CLASSIFICATION; DISCOVERY;
D O I
10.1093/bioinformatics/btab109
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The discovery of cancer subtyping can help explore cancer pathogenesis, determine clinical actionability in treatment, and improve patients' survival rates. However, due to the diversity and complexity of multi-omics data, it is still challenging to develop integrated clustering algorithms for tumor molecular subtyping. Results: We propose Subtype-GAN, a deep adversarial learning approach based on the multiple-input multiple-output neural network to model the complex omics data accurately. With the latent variables extracted from the neural network, Subtype-GAN uses consensus clustering and the Gaussian Mixture model to identify tumor samples' molecular subtypes. Compared with other state-of-the-art subtyping approaches, Subtype-GAN achieved outstanding performance on the benchmark datasets consisting of 4000 TCGA tumors from 10 types of cancer. We found that on the comparison dataset, the clustering scheme of Subtype-GAN is not always similar to that of the deep learning method AE but is identical to that of NEMO, MCCA, VAE and other excellent approaches. Finally, we applied Subtype-GAN to the BRCA dataset and automatically obtained the number of subtypes and the subtype labels of 1031 BRCA tumors. Through the detailed analysis, we found that the identified subtypes are clinically meaningful and show distinct patterns in the feature space, demonstrating the practicality of Subtype-GAN.
引用
收藏
页码:2231 / 2237
页数:7
相关论文
共 50 条
[41]   Bayesian tensor factorization-drive breast cancer subtyping by integrating multi-omics data [J].
Liu, Qian ;
Cheng, Bowen ;
Jin, Yongwon ;
Hu, Pingzhao .
JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 125
[42]   Capturing the latent space of an Autoencoder for multi-omics integration and cancer subtyping [J].
Madhumita ;
Paul, Sushmita .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 148
[43]   OmiEmbed: A Unified Multi-Task Deep Learning Framework for Multi-Omics Data [J].
Zhang, Xiaoyu ;
Xing, Yuting ;
Sun, Kai ;
Guo, Yike .
CANCERS, 2021, 13 (12)
[44]   Comprehensive Evaluation of Multi-Omics Clustering Algorithms for Cancer Molecular Subtyping [J].
Wang, Juan ;
Wang, Lingxiao ;
Liu, Yi ;
Li, Xiao ;
Ma, Jie ;
Li, Mansheng ;
Zhu, Yunping .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2025, 26 (03)
[45]   Interactive gene identification for cancer subtyping based on multi-omics clustering [J].
Ye, Xiucai ;
Shi, Tianyi ;
Cui, Yaxuan ;
Sakurai, Tetsuya .
METHODS, 2023, 211 :61-67
[46]   Integrative Network Fusion: A Multi-Omics Approach in Molecular Profiling [J].
Chierici, Marco ;
Bussola, Nicole ;
Marcolini, Alessia ;
Francescatto, Margherita ;
Zandona, Alessandro ;
Trastulla, Lucia ;
Agostinelli, Claudio ;
Jurman, Giuseppe ;
Furlanello, Cesare .
FRONTIERS IN ONCOLOGY, 2020, 10
[47]   Integrative Multi-Omics Approach for Improving Causal Gene Identification [J].
King, Austin ;
Wu, Chong .
GENETIC EPIDEMIOLOGY, 2025, 49 (01)
[48]   The benefits of smoking cessation on survival in cancer patients by integrative analysis of multi-omics data [J].
Yang, Sheng ;
Liu, Tong ;
Liang, Geyu .
MOLECULAR ONCOLOGY, 2020, 14 (09) :2069-2080
[49]   Functional Analysis of Molecular Subtypes with Deep Similarity Learning Model Based on Multi-omics Data [J].
Liu, Shuhui ;
Zhang Yupei ;
Shang, Xuequn .
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 :126-137
[50]   Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data [J].
Min, Eun Jeong ;
Chang, Changgee ;
Long, Qi .
2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, :109-119