A Contrastive-Learning-Based Deep Neural Network for Cancer Subtyping by Integrating Multi-Omics Data

被引:0
|
作者
Chai, Hua [1 ]
Deng, Weizhen [1 ]
Wei, Junyu [1 ]
Guan, Ting [1 ]
He, Minfan [1 ]
Liang, Yong [3 ]
Li, Le [2 ,3 ]
机构
[1] Foshan Univ, Sch Math & Big Data, Foshan 528000, Peoples R China
[2] Macau Univ Sci & Technol, Fac Innovat Engn, Macau 999078, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Cancer subtype identification; Multi-omics data; Contrastive learning; Bioinformatics; EXPRESSION; POLYMORPHISMS;
D O I
10.1007/s12539-024-00641-y
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Accurate identification of cancer subtypes is crucial for disease prognosis evaluation and personalized patient management. Recent advances in computational methods have demonstrated that multi-omics data provides valuable insights into tumor molecular subtyping. However, the high dimensionality and small sample size of the data may result in ambiguous and overlapping cancer subtypes during clustering. In this study, we propose a novel contrastive-learning-based approach to address this issue. The proposed end-to-end deep learning method can extract crucial information from the multi-omics features by self-supervised learning for patient clustering. Results By applying our method to nine public cancer datasets, we have demonstrated superior performance compared to existing methods in separating patients with different survival outcomes (p < 0.05). To further evaluate the impact of various omics data on cancer survival, we developed an XGBoost classification model and found that mRNA had the highest importance score, followed by DNA methylation and miRNA. In the presented case study, our method successfully clustered subtypes and identified 14 cancer-related genes, of which 12 (85.7%) were validated through literature review. Conclusions Our findings demonstrate that our method is capable of identifying cancer subtypes that are both statistically and biologically significant. The code about COLCS is given at: https://github.com/Mercuriiio/COLCS.
引用
收藏
页码:966 / 975
页数:10
相关论文
共 50 条
  • [41] CLGSDN: Contrastive-Learning-Based Graph Structure Denoising Network for Traffic Prediction
    Peng, Peng
    Chen, Xuewen
    Zhang, Xudong
    Tang, Haina
    Shen, Hanji
    Li, Jun
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (07): : 8638 - 8652
  • [42] SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer
    Huang, Zhi
    Zhan, Xiaohui
    Xiang, Shunian
    Johnson, Travis S.
    Helm, Bryan
    Yu, Christina Y.
    Zhang, Jie
    Salama, Paul
    Rizkalla, Maher
    Han, Zhi
    Huang, Kun
    FRONTIERS IN GENETICS, 2019, 10
  • [43] Integrating multi-omics data reveals function and therapeutic potential of deubiquitinating enzymes
    Doherty, Laura M.
    Mills, Caitlin E.
    Boswell, Sarah A.
    Liu, Xiaoxi
    Hoyt, Charles Tapley
    Gyori, Benjamin
    Buhrlage, Sara J.
    Sorger, Peter K.
    Hauf, Silke
    ELIFE, 2022, 11
  • [44] MONet: cancer driver gene identification algorithm based on integrated analysis of multi-omics data and network models
    Ren, Yingzan
    Zhang, Tiantian
    Liu, Jian
    Ma, Fubin
    Chen, Jiaxin
    Li, Ponian
    Xiao, Guodong
    Sun, Chuanqi
    Zhang, Yusen
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2025, 250
  • [45] Multi-Omics Data Integration Patient Classification Method Based on Deep Dense Residual Shrinkage Network
    Li, Wenyao
    Lin, Kai
    Hou, Yaqing
    Zhang, Qiang
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1098 - 1104
  • [46] Self-omics: A Self-supervised Learning Framework for Multi-omics Cancer Data
    Hashim, Sayed
    Nandakumar, Karthik
    Yaqub, Mohammad
    BIOCOMPUTING 2023, PSB 2023, 2023, : 263 - 274
  • [47] Moanna: Multi-Omics Autoencoder-Based Neural Network Algorithm for Predicting Breast Cancer Subtypes
    Lupat, Richard
    Perera, Rashindrie
    Loi, Sherene
    Li, Jason
    IEEE ACCESS, 2023, 11 : 10912 - 10924
  • [48] TMODINET: A trustworthy multi-omics dynamic learning integration network for cancer diagnostic
    Du, Ling
    Gao, Peipei
    Liu, Zhuang
    Yin, Nan
    Wang, Xiaochao
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 113
  • [49] Multi-fusion strategy network-guided cancer subtypes discovering based on multi-omics data
    Liu, Jian
    Xue, Xinzheng
    Wen, Pengbo
    Song, Qian
    Yao, Jun
    Ge, Shuguang
    FRONTIERS IN GENETICS, 2024, 15
  • [50] A machine learning framework that integrates multi-omics data predicts cancer-related LncRNAs
    Yuan, Lin
    Zhao, Jing
    Sun, Tao
    Shen, Zhen
    BMC BIOINFORMATICS, 2021, 22 (01)