Deep learning;
Cancer classification;
Clustering;
Survival analysis;
Multi-omics data;
Contrastive learning;
Cancer analysis;
Dimensionality reduction;
ARTIFICIAL-INTELLIGENCE;
CANCER SUBTYPES;
IDENTIFICATION;
D O I:
10.1016/j.ins.2024.121864
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Cancer is a highly complex and fatal disease that affects various human organs. Early and accurate cancer analysis is crucial for timely treatment, prognosis, and understanding of the disease's development. Recent research utilizes deep learning-based models to combine multi-omics data for tasks such as cancer classification, clustering, and survival prediction. However, these models often overlook interactions between different types of data, which leads to suboptimal performance. In this paper, we present a Contrastive Multi-Modal Encoder (CMME) that integrates and maps multi-omics data into a lower-dimensional latent space, enabling the model to better understand relationships between different data types. The challenging distribution and organization of the data into anchors, positive samples, and negative samples encourage the model to learn synergies among different modalities, pay attention to both strong and weak modalities, and avoid biased learning. The performance of the proposed model is evaluated on downstream tasks such as clustering, classification, and survival prediction. The CMME achieved an accuracy of 98.16% and an F1 score of 98.09% in classifying breast cancer subtypes. For clustering tasks across ten cancer types based on TCGA data, the adjusted Rand index reached 0.966. Additionally, survival analysis results highlighted significant differences in survival rates between different cancer subtypes. The comprehensive qualitative and quantitative results demonstrate that the proposed method outperforms existing methods.
机构:
Chinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R ChinaChinese Acad Agr Sci, Biotechnol Res Inst, Beijing 100081, Peoples R China
机构:
European Mol Biol Lab, Heidelberg, GermanyEuropean Mol Biol Lab, European Bioinformat Inst, Cambridge, England
Velten, Britta
论文数: 引用数:
h-index:
机构:
Arnol, Damien
Dietrich, Sascha
论文数: 0引用数: 0
h-index: 0
机构:
Heidelberg Univ Hosp, Heidelberg, GermanyEuropean Mol Biol Lab, European Bioinformat Inst, Cambridge, England
Dietrich, Sascha
论文数: 引用数:
h-index:
机构:
Zenz, Thorsten
Marioni, John C.
论文数: 0引用数: 0
h-index: 0
机构:
European Mol Biol Lab, European Bioinformat Inst, Cambridge, England
Univ Cambridge, Canc Res UK Cambridge Inst, Cambridge, England
Wellcome Trust Sanger Inst, Cambridge, EnglandEuropean Mol Biol Lab, European Bioinformat Inst, Cambridge, England
Marioni, John C.
Buettner, Florian
论文数: 0引用数: 0
h-index: 0
机构:
European Mol Biol Lab, European Bioinformat Inst, Cambridge, England
German Res Ctr Environm Hlth, Helmholtz Zentrum Munchen, Inst Computat Biol, Neuherberg, GermanyEuropean Mol Biol Lab, European Bioinformat Inst, Cambridge, England
Buettner, Florian
Huber, Wolfgang
论文数: 0引用数: 0
h-index: 0
机构:
European Mol Biol Lab, Heidelberg, GermanyEuropean Mol Biol Lab, European Bioinformat Inst, Cambridge, England
Huber, Wolfgang
Stegle, Oliver
论文数: 0引用数: 0
h-index: 0
机构:
European Mol Biol Lab, European Bioinformat Inst, Cambridge, England
European Mol Biol Lab, Heidelberg, GermanyEuropean Mol Biol Lab, European Bioinformat Inst, Cambridge, England
机构:
Univ Penn, Perelman Sch Med, Grad Grp Genom & Computat Biol, 3700 Hamilton Walk, Philadelphia, PA 19104 USAUniv Penn, Perelman Sch Med, Grad Grp Genom & Computat Biol, 3700 Hamilton Walk, Philadelphia, PA 19104 USA
Ballard, Jenna L.
Wang, Zexuan
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Grad Grp Appl Math & Computat Sci, 209 S 33rd St, Philadelphia, PA 19104 USAUniv Penn, Perelman Sch Med, Grad Grp Genom & Computat Biol, 3700 Hamilton Walk, Philadelphia, PA 19104 USA
Wang, Zexuan
Li, Wenrui
论文数: 0引用数: 0
h-index: 0
机构:
Univ Connecticut, Dept Stat, 215 Glenbrook Rd, Storrs, CT 06269 USAUniv Penn, Perelman Sch Med, Grad Grp Genom & Computat Biol, 3700 Hamilton Walk, Philadelphia, PA 19104 USA
Li, Wenrui
Shen, Li
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, 423 Guardian Dr, Philadelphia, PA 19104 USAUniv Penn, Perelman Sch Med, Grad Grp Genom & Computat Biol, 3700 Hamilton Walk, Philadelphia, PA 19104 USA
Shen, Li
Long, Qi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, 423 Guardian Dr, Philadelphia, PA 19104 USAUniv Penn, Perelman Sch Med, Grad Grp Genom & Computat Biol, 3700 Hamilton Walk, Philadelphia, PA 19104 USA
机构:
Key Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, ChinaKey Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, China
Zhong, Yating
Lin, Yanmei
论文数: 0引用数: 0
h-index: 0
机构:
Key Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, ChinaKey Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, China
Lin, Yanmei
Chen, Dingjia
论文数: 0引用数: 0
h-index: 0
机构:
Key Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, ChinaKey Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, China
Chen, Dingjia
Peng, Yuzhong
论文数: 0引用数: 0
h-index: 0
机构:
Key Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, ChinaKey Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, China
Peng, Yuzhong
Zeng, Yuanpeng
论文数: 0引用数: 0
h-index: 0
机构:
Key Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, ChinaKey Laboratory of Scientific Computing and Intelligent Information Processing of Guangxi Universities, College of Computer and Information Engineering, Nanning Normal University, Nanning,530100, China
机构:
Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
Res Inst Air Firce, Beijing, Peoples R ChinaXidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
Lu, Yang
Li, Qin
论文数: 0引用数: 0
h-index: 0
机构:
Shenzhen Inst Informat Technol, Sch Software Engn, Shenzhen 518172, Peoples R ChinaXidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
Li, Qin
Zhang, Xiangdong
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R ChinaXidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
Zhang, Xiangdong
Gao, Quanxue
论文数: 0引用数: 0
h-index: 0
机构:
Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R ChinaXidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
机构:
Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QCMolecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC
Picard M.
Scott-Boyer M.-P.
论文数: 0引用数: 0
h-index: 0
机构:
Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QCMolecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC
Scott-Boyer M.-P.
Bodein A.
论文数: 0引用数: 0
h-index: 0
机构:
Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QCMolecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC
Bodein A.
Périn O.
论文数: 0引用数: 0
h-index: 0
机构:
Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-boisMolecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC
Périn O.
Droit A.
论文数: 0引用数: 0
h-index: 0
机构:
Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QCMolecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC