Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration

被引:6
作者
Chen, Fangxu [1 ,2 ]
Peng, Wei [1 ,2 ]
Dai, Wei [1 ,2 ]
Wei, Shoulin [1 ,2 ]
Fu, Xiaodong [1 ,2 ]
Liu, Li [1 ,2 ]
Liu, Lijun [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Comp Technol Applicat Key Lab Yunnan Prov, Kunming 650050, Peoples R China
基金
中国国家自然科学基金;
关键词
Cancer-subtype classification; Graph contrastive learning; Multi-omics integration; TUMOR HETEROGENEITY; BREAST;
D O I
10.1007/s13755-024-00274-x
中图分类号
R-058 [];
学科分类号
摘要
Cancer is one of the most deadly diseases in the world. Accurate cancer subtype classification is critical for patient diagnosis, treatment, and prognosis. Ever-increasing multi-omics data describes the characteristics of the patients from different views and serves as complementary information to promote cancer subtype identification. However, omics data generally have different distributions and high dimensions. How to effectively integrate multiple omics data to classify cancer subtypes accurately is a challenge for researchers. This work proposes a method integrating multi-omics data based on supervised graph contrast learning (MCRGCN) to classify cancer subtypes. The method considers the unique feature distribution of each omics data and the interaction of different omics data features to improve the accuracy of cancer subtype classification. To achieve this, MCRGCN first constructs different sample networks based on the multi-omics data of the samples. Then, it puts the omics data and adjacency matrix of the sample into different residual graph convolution models to get multi-omics features of the samples, which are trained with a supervised comparison loss to maintain that the sample features of each omics should be as consistent as possible. Finally, we input the sample features combining multi-omics features into a classifier to obtain the cancer subtypes. We applied MCRGCN to the invasive breast carcinoma (BRCA) and glioblastoma multiforme (GBM) datasets, integrating gene expression, miRNA expression, and DNA methylation data. The results demonstrate that our model is superior to other methods in integrating multi-omics data. Moreover, the results of survival analysis experiments demonstrate that the cancer subtypes identified by our model have significant clinical features. Furthermore, our model can help to identify potential biomarkers and pathways associated with cancer subtypes.
引用
收藏
页数:12
相关论文
共 31 条
[1]   CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training [J].
Bao, Jianmin ;
Chen, Dong ;
Wen, Fang ;
Li, Houqiang ;
Hua, Gang .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2764-2773
[2]   Tumour heterogeneity in the clinic [J].
Bedard, Philippe L. ;
Hansen, Aaron R. ;
Ratain, Mark J. ;
Siu, Lillian L. .
NATURE, 2013, 501 (7467) :355-364
[3]   The emerging clinical relevance of genomics in cancer medicine [J].
Berger, Michael F. ;
Mardis, Elaine R. .
NATURE REVIEWS CLINICAL ONCOLOGY, 2018, 15 (06) :353-365
[4]   Drug-Target Interactions Prediction Based on Signed Heterogeneous Graph Neural Networks [J].
Chen, Ming ;
Jiang, Yajian ;
Lei, Xiujuan ;
Pan, Yi ;
Ji, Chunyan ;
Jiang, Wei .
CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (01) :231-244
[5]   The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups [J].
Curtis, Christina ;
Shah, Sohrab P. ;
Chin, Suet-Feung ;
Turashvili, Gulisa ;
Rueda, Oscar M. ;
Dunning, Mark J. ;
Speed, Doug ;
Lynch, Andy G. ;
Samarajiwa, Shamith ;
Yuan, Yinyin ;
Graef, Stefan ;
Ha, Gavin ;
Haffari, Gholamreza ;
Bashashati, Ali ;
Russell, Roslin ;
McKinney, Steven ;
Langerod, Anita ;
Green, Andrew ;
Provenzano, Elena ;
Wishart, Gordon ;
Pinder, Sarah ;
Watson, Peter ;
Markowetz, Florian ;
Murphy, Leigh ;
Ellis, Ian ;
Purushotham, Arnie ;
Borresen-Dale, Anne-Lise ;
Brenton, James D. ;
Tavare, Simon ;
Caldas, Carlos ;
Aparicio, Samuel .
NATURE, 2012, 486 (7403) :346-352
[6]   Identifying Cancer Subtypes Using a Residual Graph Convolution Model on a Sample Similarity Network [J].
Dai, Wei ;
Yue, Wenhao ;
Peng, Wei ;
Fu, Xiaodong ;
Liu, Li ;
Liu, Lijun .
GENES, 2022, 13 (01)
[7]   Cancer heterogeneity: implications for targeted therapeutics [J].
Fisher, R. ;
Pusztai, L. ;
Swanton, C. .
BRITISH JOURNAL OF CANCER, 2013, 108 (03) :479-485
[8]   Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data [J].
Franco, Edian F. ;
Rana, Pratip ;
Cruz, Aline ;
Calderon, Victor V. ;
Azevedo, Vasco ;
Ramos, Rommel T. J. ;
Ghosh, Preetam .
CANCERS, 2021, 13 (09)
[9]   An Encoding-Decoding Framework Based on CNN for circRNA-RBP Binding Sites Prediction [J].
Guo, Yajing ;
Lei, Xiujuan ;
Pan, Yi .
CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (01) :256-263
[10]  
Hamilton WL, 2017, ADV NEUR IN, V30