MoCLIM: Towards Accurate Cancer Subtyping via Multi-Omics Contrastive Learning with Omics-Inference Modeling

被引:1
作者
Yang, Ziwei [1 ]
Chen, Zheng [2 ]
Matsubara, Yasuko [2 ]
Sakurai, Yasushi [2 ]
机构
[1] Kyoto Univ, Bioinformat Ctr, Kyoto, Japan
[2] Osaka Univ, SANKEN, Suita, Osaka, Japan
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
Cancer subtypes; Multi-omics data; Contrastive learning; EPIGENOMICS; EXPRESSION; NETWORK;
D O I
10.1145/3583780.3614970
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Precision medicine fundamentally aims to establish causality between dysregulated biochemical mechanisms and cancer subtypes. Omics-based cancer subtyping has emerged as a revolutionary approach, as different level of omics records the biochemical products of multistep processes in cancers. This paper focuses on fully exploiting the potential of multi-omics data to improve cancer subtyping outcomes, and hence developed MoCLIM, a representation learning framework. MoCLIM independently extracts the informative features from distinct omics modalities. Using a unified representation informed by contrastive learning of different omics modalities, we can well-cluster the subtypes, given cancer, into a lower latent space. This contrast can be interpreted as a projection of inter-omics inference observed in biological networks. Experimental results on six cancer datasets demonstrate that our approach significantly improves data fit and subtyping performance in fewer high-dimensional cancer instances. Moreover, our framework incorporates various medical evaluations as the final component, providing high interpretability in medical analysis.
引用
收藏
页码:2895 / 2905
页数:11
相关论文
共 76 条
[1]  
Alexe G, 2007, J BIOSCIENCES, P1027
[2]   Clinical manifestations, risk factors, and maternal and perinatal outcomes of coronavirus disease 2019 in pregnancy: living systematic review and meta-analysis [J].
Allotey, John ;
Stallings, Elena ;
Bonet, Mercedes ;
Yap, Magnus ;
Chatterjee, Shaunak ;
Kew, Tania ;
Debenham, Luke ;
Llavall, Anna Clave ;
Dixit, Anushka ;
Zhou, Dengyi ;
Balaji, Rishab ;
Lee, Siang Ing ;
Qiu, Xiu ;
Yuan, Mingyang ;
Coomar, Dyuti ;
van Wely, Madelon ;
van Leeuwen, Elizabeth ;
Kostova, Elena ;
Kunst, Heinke ;
Khalil, Asma ;
Tiberi, Simon ;
Brizuela, Vanessa ;
Broutet, Nathalie ;
Kara, Edna ;
Kim, Caron Rahn ;
Thorson, Anna ;
Oladapo, Olufemi T. ;
Mofenson, Lynne ;
Zamora, Javier ;
Thangaratinam, Shakila .
BMJ-BRITISH MEDICAL JOURNAL, 2020, 370
[3]  
[Anonymous], 2019, 7 INT C LEARN REPR I
[4]  
Bachman P, 2019, ADV NEUR IN, V32
[5]   The genetics and genomics of cancer [J].
Balmain, A ;
Gray, J ;
Ponder, B .
NATURE GENETICS, 2003, 33 (Suppl 3) :238-244
[6]  
Bo Jiang, 2014, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2014. Proceedings: LNCS 8724, P595, DOI 10.1007/978-3-662-44848-9_38
[7]   Prediction of Dynamical Properties of Biochemical Pathways with Graph Neural Networks [J].
Bove, Pasquale ;
Micheli, Alessio ;
Milazzo, Paolo ;
Podda, Marco .
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, :32-43
[8]  
Bray F, 2018, CA-CANCER J CLIN, V68, P394, DOI [10.3322/caac.21492, 10.3322/caac.21609]
[9]  
Cancer World Health Organization, About us
[10]   Multi-omics single-cell data integration and regulatory inference with graph-linked embedding [J].
Cao, Zhi-Jie ;
Gao, Ge .
NATURE BIOTECHNOLOGY, 2022, 40 (10) :1458-+