Improving cancer driver gene identification using multi-task learning on graph convolutional network

被引:69
作者
Peng, Wei [1 ]
Tang, Qi [1 ]
Dai, Wei [1 ]
Chen, Tielin [1 ]
机构
[1] Kunming Univ Sci & Technol, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
cancer driver genes; cancer genes; graph convolutional neural network; multi-task learning; EXPRESSION; MUTATIONS; PATHWAYS;
D O I
10.1093/bib/bbab432
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Cancer is thought to be caused by the accumulation of driver genetic mutations. Therefore, identifying cancer driver genes plays a crucial role in understanding the molecular mechanism of cancer and developing precision therapies and biomarkers. In this work, we propose a Multi-Task learning method, called MTGCN, based on the Graph Convolutional Network to identify cancer driver genes. First, we augment gene features by introducing their features on the protein-protein interaction (PPI) network. After that, the multi-task learning framework propagates and aggregates nodes and graph features from input to next layer to learn node embedding features, simultaneously optimizing the node prediction task and the link prediction task. Finally, we use a Bayesian task weight learner to balance the two tasks automatically. The outputs of MTGCN assign each gene a probability of being a cancer driver gene. Our method and the other four existing methods are applied to predict cancer drivers for pan-cancer and some single cancer types. The experimental results show that our model shows outstanding performance compared with the state-of-the-art methods in terms of the area under the Receiver Operating Characteristic (ROC) curves and the area under the precision-recall curves. The MTGCN is freely available via https://github.com/weiba/MTGCN.
引用
收藏
页数:12
相关论文
共 47 条
[11]  
Hamilton WL, 2017, ADV NEUR IN, V30
[12]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[13]   DawnRank: discovering personalized driver genes in cancer [J].
Hou, Jack P. ;
Ma, Jian .
GENOME MEDICINE, 2014, 6
[14]  
Huang W, 2020, INT C LEARN REPR
[15]   Adjusting batch effects in microarray expression data using empirical Bayes methods [J].
Johnson, W. Evan ;
Li, Cheng ;
Rabinovic, Ariel .
BIOSTATISTICS, 2007, 8 (01) :118-127
[16]   ConsensusPathDB: toward a more complete picture of cell biology [J].
Kamburov, Atanas ;
Pentchev, Konstantin ;
Galicka, Hanna ;
Wierling, Christoph ;
Lehrach, Hans ;
Herwig, Ralf .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D712-D717
[17]   A systematic view of computational methods for identifying driver genes based on somatic mutation data [J].
Kan, Yingxin ;
Jiang, Limin ;
Tang, Jijun ;
Guo, Yan ;
Guo, Fei .
BRIEFINGS IN FUNCTIONAL GENOMICS, 2021, 20 (05) :333-343
[18]   Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics [J].
Kendall, Alex ;
Gal, Yarin ;
Cipolla, Roberto .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7482-7491
[19]  
Kipf Thomas N., 2017, P 5 INT C LEARNING
[20]   Discovery and saturation analysis of cancer genes across 21 tumour types [J].
Lawrence, Michael S. ;
Stojanov, Petar ;
Mermel, Craig H. ;
Robinson, James T. ;
Garraway, Levi A. ;
Golub, Todd R. ;
Meyerson, Matthew ;
Gabriel, Stacey B. ;
Lander, Eric S. ;
Getz, Gad .
NATURE, 2014, 505 (7484) :495-+