Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms

被引:109
|
作者
Schulte-Sasse, Roman [1 ]
Budach, Stefan [1 ]
Hnisz, Denes [1 ]
Marsico, Annalisa [1 ,2 ]
机构
[1] Max Planck Inst Mol Genet, Berlin, Germany
[2] German Res Ctr Environm Hlth, Helmholtz Zentrum Munich, Inst Computat Biol, Munich, Germany
关键词
Convolution - Diseases - Alkylation - Learning systems - Backpropagation - Proteins;
D O I
10.1038/s42256-021-00325-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying cancer driver genes from high-throughput genomic data is an important task to understand the molecular basis of cancer and to develop new treatments including precision medicine. To tackle this challenge, EMOGI, a new deep learning method based on graph convolutional networks is developed, which combines protein-protein interaction networks with multiomics datasets. The increase in available high-throughput molecular data creates computational challenges for the identification of cancer genes. Genetic as well as non-genetic causes contribute to tumorigenesis, and this necessitates the development of predictive models to effectively integrate different data modalities while being interpretable. We introduce EMOGI, an explainable machine learning method based on graph convolutional networks to predict cancer genes by combining multiomics pan-cancer data-such as mutations, copy number changes, DNA methylation and gene expression-together with protein-protein interaction (PPI) networks. EMOGI was on average more accurate than other methods across different PPI networks and datasets. We used layer-wise relevance propagation to stratify genes according to whether their classification was driven by the interactome or any of the omics levels, and to identify important modules in the PPI network. We propose 165 novel cancer genes that do not necessarily harbour recurrent alterations but interact with known cancer genes, and we show that they correspond to essential genes from loss-of-function screens. We believe that our method can open new avenues in precision oncology and be applied to predict biomarkers for other complex diseases.
引用
收藏
页码:513 / +
页数:16
相关论文
共 50 条
  • [41] Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks
    Bari, Mehrab Ghanat
    Ung, Choong Yong
    Zhang, Cheng
    Zhu, Shizhen
    Li, Hu
    SCIENTIFIC REPORTS, 2017, 7
  • [42] Clustering and machine learning-based integration identify cancer associated fibroblasts genes' signature in head and neck squamous cell carcinoma
    Wang, Qiwei
    Zhao, Yinan
    Wang, Fang
    Tan, Guolin
    FRONTIERS IN GENETICS, 2023, 14
  • [43] Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks
    Mehrab Ghanat Bari
    Choong Yong Ung
    Cheng Zhang
    Shizhen Zhu
    Hu Li
    Scientific Reports, 7
  • [44] Differential Proteomics Data Integration Reveals Anxiety-associated Molecular and Cellular Mechanisms in Cingulate Cortex Synapses.
    Iris, F.
    Filiou, M.
    Turck, C.
    EUROPEAN PSYCHIATRY, 2015, 30
  • [45] The integration of transcriptome-wide association study and mRNA expression profiling data to identify candidate genes and gene sets associated with dental caries
    Tong, Xiangyao
    Hou, Siyu
    Ma, Mei
    Zhang, Lu
    Zou, Rui
    Hou, Tiezhou
    Niu, Lin
    ARCHIVES OF ORAL BIOLOGY, 2020, 118
  • [46] Molecular changes in the EGFR pathway and RAS/RAF genes identify subtypes of metastatic and nonmetastatic colorectal cancer associated with different outcomes
    Bahnassy, Abeer A.
    Salem, Salem E.
    Hussein, Nehal
    Yousif, Hend F.
    Al-Desouky, Marwa Iman
    Zekri, Abdel-Rahman N.
    CANCER RESEARCH, 2015, 75
  • [47] Endometriosis and endometriosis-associated cancers: new insights into the molecular mechanisms of ovarian cancer development
    Dawson, Amy
    Llaurado Fernandez, Marta
    Anglesio, Michael
    Yong, Paul J.
    Carey, Mark S.
    ECANCERMEDICALSCIENCE, 2018, 12
  • [48] Multi-omics data integration and modeling unravels new mechanisms for pancreatic cancer and improves prognostic prediction
    Fraunhoffer, Nicolas A.
    Abuelafia, Analia Meilerman
    Bigonnet, Martin
    Gayet, Odile
    Roques, Julie
    Nicolle, Remy
    Lomberk, Gwen
    Urrutia, Raul
    Dusetti, Nelson
    Iovanna, Juan
    NPJ PRECISION ONCOLOGY, 2022, 6 (01)
  • [49] Multi-omics data integration and modeling unravels new mechanisms for pancreatic cancer and improves prognostic prediction
    Nicolas A. Fraunhoffer
    Analía Meilerman Abuelafia
    Martin Bigonnet
    Odile Gayet
    Julie Roques
    Remy Nicolle
    Gwen Lomberk
    Raul Urrutia
    Nelson Dusetti
    Juan Iovanna
    npj Precision Oncology, 6
  • [50] A semi-supervised approach for the integration of multi-omics data based on transformer multi-head self-attention mechanism and graph convolutional networks
    Wang, Jiahui
    Liao, Nanqing
    Du, Xiaofei
    Chen, Qingfeng
    Wei, Bizhong
    BMC GENOMICS, 2024, 25 (01)