Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms

被引:109
|
作者
Schulte-Sasse, Roman [1 ]
Budach, Stefan [1 ]
Hnisz, Denes [1 ]
Marsico, Annalisa [1 ,2 ]
机构
[1] Max Planck Inst Mol Genet, Berlin, Germany
[2] German Res Ctr Environm Hlth, Helmholtz Zentrum Munich, Inst Computat Biol, Munich, Germany
关键词
Convolution - Diseases - Alkylation - Learning systems - Backpropagation - Proteins;
D O I
10.1038/s42256-021-00325-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying cancer driver genes from high-throughput genomic data is an important task to understand the molecular basis of cancer and to develop new treatments including precision medicine. To tackle this challenge, EMOGI, a new deep learning method based on graph convolutional networks is developed, which combines protein-protein interaction networks with multiomics datasets. The increase in available high-throughput molecular data creates computational challenges for the identification of cancer genes. Genetic as well as non-genetic causes contribute to tumorigenesis, and this necessitates the development of predictive models to effectively integrate different data modalities while being interpretable. We introduce EMOGI, an explainable machine learning method based on graph convolutional networks to predict cancer genes by combining multiomics pan-cancer data-such as mutations, copy number changes, DNA methylation and gene expression-together with protein-protein interaction (PPI) networks. EMOGI was on average more accurate than other methods across different PPI networks and datasets. We used layer-wise relevance propagation to stratify genes according to whether their classification was driven by the interactome or any of the omics levels, and to identify important modules in the PPI network. We propose 165 novel cancer genes that do not necessarily harbour recurrent alterations but interact with known cancer genes, and we show that they correspond to essential genes from loss-of-function screens. We believe that our method can open new avenues in precision oncology and be applied to predict biomarkers for other complex diseases.
引用
收藏
页码:513 / +
页数:16
相关论文
共 50 条
  • [1] Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms
    Roman Schulte-Sasse
    Stefan Budach
    Denes Hnisz
    Annalisa Marsico
    Nature Machine Intelligence, 2021, 3 : 513 - 526
  • [2] SUPREME: multiomics data integration using graph convolutional networks
    Kesimoglu, Ziynet Nesibe
    Bozdag, Serdar
    NAR GENOMICS AND BIOINFORMATICS, 2023, 5 (02)
  • [3] Identification of Cancer Driver Genes by Integrating Multiomics Data with Graph Neural Networks
    Song, Hongzhi
    Yin, Chaoyi
    Li, Zhuopeng
    Feng, Ke
    Cao, Yangkun
    Gu, Yujie
    Sun, Huiyan
    METABOLITES, 2023, 13 (03)
  • [4] Integration Of Multi-Omics Data With Protein-Protein Interaction Networks To Identify New Osteoporosis Related Genes And Their Associated Molecular Mechanisms
    Liu, Anqi
    Su, Kuan-Jui
    Greenbaum, Jonathan
    Zhang, Xiao
    Qiu, Chuan
    Tian, Qing
    Shen, Hui
    Deng, Hong-Wen
    JOURNAL OF BONE AND MINERAL RESEARCH, 2023, 38 : 233 - 233
  • [5] A Multiomics Graph Database System for Biological Data Integration and Cancer Informatics
    Thapa, Ishwor
    Ali, Hesham
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (02) : 209 - 219
  • [6] Graph Convolutional Networks Improve the Prediction of Cancer Driver Genes
    Schulte-Sasse, Roman
    Budach, Stefan
    Hnisz, Denes
    Marsico, Annalisa
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 658 - 668
  • [7] Cancer Molecular Subtype Classification by Graph Convolutional Networks on Multi-omics Data
    Li, Bingjun
    Wang, Tianyu
    Nabavi, Sheida
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [8] Graph Convolutional Networks Based Multi-modal Data Integration for Breast Cancer Survival Prediction
    Hu, Hongbin
    Liang, Wenbin
    Zou, Xitao
    Zou, Xianchun
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT I, ICIC 2024, 2024, 14881 : 85 - 98
  • [9] DeepMoIC: multi-omics data integration via deep graph convolutional networks for cancer subtype classification
    Wu, Jiecheng
    Chen, Zhaoliang
    Xiao, Shunxin
    Liu, Genggeng
    Wu, Wenjie
    Wang, Shiping
    BMC GENOMICS, 2024, 25 (01):
  • [10] A universal framework for single-cell multi-omics data integration with graph convolutional networks
    Gao, Hongli
    Zhang, Bin
    Liu, Long
    Li, Shan
    Gao, Xin
    Yu, Bin
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)