Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms

被引:109
|
作者
Schulte-Sasse, Roman [1 ]
Budach, Stefan [1 ]
Hnisz, Denes [1 ]
Marsico, Annalisa [1 ,2 ]
机构
[1] Max Planck Inst Mol Genet, Berlin, Germany
[2] German Res Ctr Environm Hlth, Helmholtz Zentrum Munich, Inst Computat Biol, Munich, Germany
关键词
Convolution - Diseases - Alkylation - Learning systems - Backpropagation - Proteins;
D O I
10.1038/s42256-021-00325-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying cancer driver genes from high-throughput genomic data is an important task to understand the molecular basis of cancer and to develop new treatments including precision medicine. To tackle this challenge, EMOGI, a new deep learning method based on graph convolutional networks is developed, which combines protein-protein interaction networks with multiomics datasets. The increase in available high-throughput molecular data creates computational challenges for the identification of cancer genes. Genetic as well as non-genetic causes contribute to tumorigenesis, and this necessitates the development of predictive models to effectively integrate different data modalities while being interpretable. We introduce EMOGI, an explainable machine learning method based on graph convolutional networks to predict cancer genes by combining multiomics pan-cancer data-such as mutations, copy number changes, DNA methylation and gene expression-together with protein-protein interaction (PPI) networks. EMOGI was on average more accurate than other methods across different PPI networks and datasets. We used layer-wise relevance propagation to stratify genes according to whether their classification was driven by the interactome or any of the omics levels, and to identify important modules in the PPI network. We propose 165 novel cancer genes that do not necessarily harbour recurrent alterations but interact with known cancer genes, and we show that they correspond to essential genes from loss-of-function screens. We believe that our method can open new avenues in precision oncology and be applied to predict biomarkers for other complex diseases.
引用
收藏
页码:513 / +
页数:16
相关论文
共 50 条
  • [31] Human tumor-associated viruses and new insights into the molecular mechanisms of cancer
    Martin D.
    Gutkind J.S.
    Oncogene, 2008, 27 (Suppl 2) : S31 - S42
  • [32] Spatial-Temporal Synchronous Graph Convolutional Networks: A New Framework for Spatial-Temporal Network Data Forecasting
    Song, Chao
    Lin, Youfang
    Guo, Shengnan
    Wan, Huaiyu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 914 - 921
  • [33] Knowledge graph integration of germline, primary cancer and cancer genetic dependency data prioritizes new target candidates.
    Dutkowski, Janusz
    Bielecki, Radoslaw
    Nienaltowski, Karol
    Kukielka, Michal
    Ronen, Roy
    CANCER RESEARCH, 2022, 82 (12)
  • [34] Explaining decisions of graph convolutional neural networks: patient-specific molecular subnetworks responsible for metastasis prediction in breast cancer
    Hryhorii Chereda
    Annalen Bleckmann
    Kerstin Menck
    Júlia Perera-Bel
    Philip Stegmaier
    Florian Auer
    Frank Kramer
    Andreas Leha
    Tim Beißbarth
    Genome Medicine, 13
  • [35] Explaining decisions of graph convolutional neural networks: patient-specific molecular subnetworks responsible for metastasis prediction in breast cancer
    Chereda, Hryhorii
    Bleckmann, Annalen
    Menck, Kerstin
    Perera-Bel, Julia
    Stegmaier, Philip
    Auer, Florian
    Kramer, Frank
    Leha, Andreas
    Beissbarth, Tim
    GENOME MEDICINE, 2021, 13 (01)
  • [36] Integrated microRNA and mRNA data sets identify molecular pathways associated with ovarian cancer platinum response
    Bansal, N.
    Cragun, J.
    Xiong, Y.
    Humphrey, M. M.
    Karnath, S.
    Wenham, R. M.
    Apte, S. M.
    Hakam, A.
    Chen, D.
    Lancaster, J. M.
    GYNECOLOGIC ONCOLOGY, 2009, 112 (02) : S113 - S113
  • [37] Identifying new cancer genes based on the integration of annotated gene sets via hypergraph neural networks
    Deng, Chao
    Li, Hong-Dong
    Zhang, Li-Shen
    Liu, Yiwei
    Li, Yaohang
    Wang, Jianxin
    BIOINFORMATICS, 2024, 40 : i511 - i520
  • [38] Integration of Graph Neural Networks and multi-omics analysis identify the predictive factor and key gene for immunotherapy response and prognosis of bladder cancer
    Shuai Ren
    Yongjian Lu
    Guangping Zhang
    Ke Xie
    Danni Chen
    Xiangna Cai
    Maodong Ye
    Journal of Translational Medicine, 22 (1)
  • [39] Multiomics Data Integration Identifies New Molecular Signatures for Abdominal Aortic Aneurysm and Aortic Occlusive Disease: Implications for Early Diagnosis, Prognosis, and Therapeutic Targets
    Kori, Medi
    Cig, Defne
    Arga, Kazim Yalcin
    Kasavi, Ceyda
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2022, 26 (05) : 290 - 304
  • [40] MOHGCN: A trustworthy multi-omics data integration framework based on specificity-aware heterogeneous graph convolutional neural networks for disease diagnosis
    Wu, Wenhao
    Wang, Shudong
    Zhang, Yuanyuan
    Zhang, Kuijie
    Yin, Wenjing
    Pang, Shanchen
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263