Multi-fusion strategy network-guided cancer subtypes discovering based on multi-omics data

被引:0
作者
Liu, Jian [1 ]
Xue, Xinzheng [1 ]
Wen, Pengbo [2 ]
Song, Qian [3 ]
Yao, Jun [4 ]
Ge, Shuguang [2 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou, Peoples R China
[2] Xuzhou Med Univ, Sch Med Informat & Engn, Xuzhou, Peoples R China
[3] Taizhou Canc Hosp, Dept Gynecol & Obstet, Wenling, Peoples R China
[4] Taizhou Canc Hosp, Dept Colorectal Surg, Wenling, Peoples R China
基金
中国国家自然科学基金;
关键词
cancer subtypes discovering; multi-omics data; clustering; deep learning; fusion strategy; BREAST;
D O I
10.3389/fgene.2024.1466825
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Introduction The combination of next-generation sequencing technology and Cancer Genome Atlas (TCGA) data provides unprecedented opportunities for the discovery of cancer subtypes. Through comprehensive analysis and in-depth analysis of the genomic data of a large number of cancer patients, researchers can more accurately identify different cancer subtypes and reveal their molecular heterogeneity.Methods In this paper, we propose the SMMSN (Self-supervised Multi-fusion Strategy Network) model for the discovery of cancer subtypes. SMMSN can not only fuse multi-level data representations of single omics data by Graph Convolutional Network (GCN) and Stacked Autoencoder Network (SAE), but also achieve the organic fusion of multi- -omics data through multiple fusion strategies. In response to the problem of lack label information in multi-omics data, SMMSN propose to use dual self-supervise method to cluster cancer subtypes from the integrated data.Results We conducted experiments on three labeled and five unlabeled multi-omics datasets to distinguish potential cancer subtypes. Kaplan Meier survival curves and other results showed that SMMSN can obtain cancer subtypes with significant differences.Discussion In the case analysis of Glioblastoma Multiforme (GBM) and Breast Invasive Carcinoma (BIC), we conducted survival time and age distribution analysis, drug response analysis, differential expression analysis, functional enrichment analysis on the predicted cancer subtypes. The research results showed that SMMSN can discover clinically meaningful cancer subtypes.
引用
收藏
页数:17
相关论文
共 46 条
  • [1] A pan-cancer proteomic analysis of The Cancer Genome Atlas (TCGA) project
    Akbani, Rehan
    Ng, Kwok-Shing
    Werner, Henrica M.
    Zhang, Fan
    Ju, Zhenlin
    Liu, Wenbin
    Yang, Ji-Yeon
    Lu, Yiling
    Weinstein, John N.
    Mills, Gordon B.
    [J]. CANCER RESEARCH, 2014, 74 (19)
  • [2] GS-TCGA: Gene Set-Based Analysis of The Cancer Genome Atlas
    Baird, Tarrion
    Roychoudhuri, Rahul
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2024, 31 (03) : 229 - 240
  • [3] Bennett DA, 2012, CURR ALZHEIMER RES, V9, P628
  • [4] Bruna J, 2014, Arxiv, DOI arXiv:1312.6203
  • [5] Chen JF, 2018, Arxiv, DOI [arXiv:1710.10568, DOI 10.48550/ARXIV.1710.10568]
  • [6] Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data
    Chen, Runpu
    Yang, Le
    Goodison, Steve
    Sun, Yijun
    [J]. BIOINFORMATICS, 2020, 36 (05) : 1476 - 1483
  • [7] Bioinformatic analysis reveals lysosome-related biomarkers and molecular subtypes in preeclampsia: novel insights into the pathogenesis of preeclampsia
    Chen, Yao
    Liu, Miao
    Wang, Yonghong
    [J]. FRONTIERS IN GENETICS, 2023, 14
  • [8] Dai HJ, 2018, PR MACH LEARN RES, V80
  • [9] A multi-omic atlas of the human frontal cortex for aging and Alzheimer's disease research
    De Jager, Philip L.
    Ma, Yiyi
    McCabe, Cristin
    Xu, Jishu
    Vardarajan, Badri N.
    Felsky, Daniel
    Klein, Hans-Ulrich
    White, Charles C.
    Peters, Mette A.
    Lodgson, Ben
    Nejad, Parham
    Tang, Anna
    Mangravite, Lara M.
    Yu, Lei
    Gaiteri, Chris
    Mostafavi, Sara
    Schneider, Julie A.
    Bennett, David A.
    [J]. SCIENTIFIC DATA, 2018, 5
  • [10] Defferrard M, 2016, ADV NEUR IN, V29