Subtype-GAN: a deep learning approach for integrative cancer subtyping of multi-omics data

被引:81
作者
Yang, Hai [1 ]
Chen, Rui [2 ,3 ]
Li, Dongdong [1 ]
Wang, Zhe [1 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
[2] Vanderbilt Univ, Dept Mol Physiol & Biophys, Nashville, TN 37240 USA
[3] Vanderbilt Univ, Vanderbilt Genet Inst, Nashville, TN 37240 USA
关键词
LATENT VARIABLE MODEL; GENOMIC CHARACTERIZATION; EXPRESSION; BREAST; CLASSIFICATION; DISCOVERY;
D O I
10.1093/bioinformatics/btab109
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The discovery of cancer subtyping can help explore cancer pathogenesis, determine clinical actionability in treatment, and improve patients' survival rates. However, due to the diversity and complexity of multi-omics data, it is still challenging to develop integrated clustering algorithms for tumor molecular subtyping. Results: We propose Subtype-GAN, a deep adversarial learning approach based on the multiple-input multiple-output neural network to model the complex omics data accurately. With the latent variables extracted from the neural network, Subtype-GAN uses consensus clustering and the Gaussian Mixture model to identify tumor samples' molecular subtypes. Compared with other state-of-the-art subtyping approaches, Subtype-GAN achieved outstanding performance on the benchmark datasets consisting of 4000 TCGA tumors from 10 types of cancer. We found that on the comparison dataset, the clustering scheme of Subtype-GAN is not always similar to that of the deep learning method AE but is identical to that of NEMO, MCCA, VAE and other excellent approaches. Finally, we applied Subtype-GAN to the BRCA dataset and automatically obtained the number of subtypes and the subtype labels of 1031 BRCA tumors. Through the detailed analysis, we found that the identified subtypes are clinically meaningful and show distinct patterns in the feature space, demonstrating the practicality of Subtype-GAN.
引用
收藏
页码:2231 / 2237
页数:7
相关论文
共 50 条
[21]   Multi-omics data integration for hepatocellular carcinoma subtyping with multi-kernel learning [J].
Wang, Jiaying ;
Miao, Yuting ;
Li, Lingmei ;
Wu, Yongqing ;
Ren, Yan ;
Cui, Yuehua ;
Cao, Hongyan .
FRONTIERS IN GENETICS, 2022, 13
[22]   Subtype-WGME enables whole-genome-wide multi-omics cancer subtyping [J].
Yang, Hai ;
Zhao, Liang ;
Li, Dongdong ;
An, Congcong ;
Fang, Xiaoyang ;
Chen, Yiwen ;
Liu, Jingping ;
Xiao, Ting ;
Wang, Zhe .
CELL REPORTS METHODS, 2024, 4 (06)
[23]   A Comprehensive Review of Deep Learning Applications with Multi-Omics Data in Cancer Research [J].
Sartori, Flavio ;
Codice, Francesco ;
Caranzano, Isabella ;
Rollo, Cesare ;
Birolo, Giovanni ;
Fariselli, Piero ;
Pancotti, Corrado .
GENES, 2025, 16 (06)
[24]   Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration [J].
Chen, Fangxu ;
Peng, Wei ;
Dai, Wei ;
Wei, Shoulin ;
Fu, Xiaodong ;
Liu, Li ;
Liu, Lijun .
HEALTH INFORMATION SCIENCE AND SYSTEMS, 2024, 12 (01)
[25]   Molecular Subtyping of Cancer Based on Robust Graph Neural Network and Multi-Omics Data Integration [J].
Yin, Chaoyi ;
Cao, Yangkun ;
Sun, Peishuo ;
Zhang, Hengyuan ;
Li, Zhi ;
Xu, Ying ;
Sun, Huiyan .
FRONTIERS IN GENETICS, 2022, 13
[26]   Robust Prognostic Subtyping of Muscle-Invasive Bladder Cancer Revealed by Deep Learning-Based Multi-Omics Data Integration [J].
Zhang, Xiaolong ;
Wang, Jiayin ;
Lu, Jiabin ;
Su, Lili ;
Wang, Changxi ;
Huang, Yuhua ;
Zhang, Xuanping ;
Zhu, Xiaoyan .
FRONTIERS IN ONCOLOGY, 2021, 11
[27]   Sliced inverse regression for integrative multi-omics data analysis [J].
Jain, Yashita ;
Ding, Shanshan ;
Qiu, Jing .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (01)
[28]   HCNM: Heterogeneous Correlation Network Model for Multi-level Integrative Study of Multi-omics Data for Cancer Subtype Prediction [J].
Vangimalla, Reddy Rani ;
Sreevalsan-Nair, Jaya .
2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, :1880-1886
[29]   Integration strategies of multi-omics data for machine learning analysis [J].
Picard, Milan ;
Scott-Boyer, Marie -Pier ;
Bodein, Antoine ;
Perin, Olivier ;
Droit, Arnaud .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 :3735-3746
[30]   Multi-Omics Data Fusion via a Joint Kernel Learning Model for Cancer Subtype Discovery and Essential Gene Identification [J].
Feng, Jie ;
Jiang, Limin ;
Li, Shuhao ;
Tang, Jijun ;
Wen, Lan .
FRONTIERS IN GENETICS, 2021, 12