Improving cancer driver gene identification using multi-task learning on graph convolutional network

被引:62
作者
Peng, Wei [1 ]
Tang, Qi [1 ]
Dai, Wei [1 ]
Chen, Tielin [1 ]
机构
[1] Kunming Univ Sci & Technol, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
cancer driver genes; cancer genes; graph convolutional neural network; multi-task learning; EXPRESSION; MUTATIONS; PATHWAYS;
D O I
10.1093/bib/bbab432
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Cancer is thought to be caused by the accumulation of driver genetic mutations. Therefore, identifying cancer driver genes plays a crucial role in understanding the molecular mechanism of cancer and developing precision therapies and biomarkers. In this work, we propose a Multi-Task learning method, called MTGCN, based on the Graph Convolutional Network to identify cancer driver genes. First, we augment gene features by introducing their features on the protein-protein interaction (PPI) network. After that, the multi-task learning framework propagates and aggregates nodes and graph features from input to next layer to learn node embedding features, simultaneously optimizing the node prediction task and the link prediction task. Finally, we use a Bayesian task weight learner to balance the two tasks automatically. The outputs of MTGCN assign each gene a probability of being a cancer driver gene. Our method and the other four existing methods are applied to predict cancer drivers for pan-cancer and some single cancer types. The experimental results show that our model shows outstanding performance compared with the state-of-the-art methods in terms of the area under the Receiver Operating Characteristic (ROC) curves and the area under the precision-recall curves. The MTGCN is freely available via https://github.com/weiba/MTGCN.
引用
收藏
页数:12
相关论文
共 47 条
[1]   Signatures of mutational processes in human cancer [J].
Alexandrov, Ludmil B. ;
Nik-Zainal, Serena ;
Wedge, David C. ;
Aparicio, Samuel A. J. R. ;
Behjati, Sam ;
Biankin, Andrew V. ;
Bignell, Graham R. ;
Bolli, Niccolo ;
Borg, Ake ;
Borresen-Dale, Anne-Lise ;
Boyault, Sandrine ;
Burkhardt, Birgit ;
Butler, Adam P. ;
Caldas, Carlos ;
Davies, Helen R. ;
Desmedt, Christine ;
Eils, Roland ;
Eyfjord, Jorunn Erla ;
Foekens, John A. ;
Greaves, Mel ;
Hosoda, Fumie ;
Hutter, Barbara ;
Ilicic, Tomislav ;
Imbeaud, Sandrine ;
Imielinsk, Marcin ;
Jaeger, Natalie ;
Jones, David T. W. ;
Jones, David ;
Knappskog, Stian ;
Kool, Marcel ;
Lakhani, Sunil R. ;
Lopez-Otin, Carlos ;
Martin, Sancha ;
Munshi, Nikhil C. ;
Nakamura, Hiromi ;
Northcott, Paul A. ;
Pajic, Marina ;
Papaemmanuil, Elli ;
Paradiso, Angelo ;
Pearson, John V. ;
Puente, Xose S. ;
Raine, Keiran ;
Ramakrishna, Manasa ;
Richardson, Andrea L. ;
Richter, Julia ;
Rosenstiel, Philip ;
Schlesner, Matthias ;
Schumacher, Ton N. ;
Span, Paul N. ;
Teague, Jon W. .
NATURE, 2013, 500 (7463) :415-+
[2]   Epigenetic Determinants of Cancer [J].
Baylin, Stephen B. ;
Jones, Peter A. .
COLD SPRING HARBOR PERSPECTIVES IN BIOLOGY, 2016, 8 (09)
[3]  
Chakravarty D, 2017, JCO PRECIS ONCOL, V1
[4]   Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes [J].
Cheng, Feixiong ;
Zhao, Junfei ;
Zhao, Zhongming .
BRIEFINGS IN BIOINFORMATICS, 2016, 17 (04) :642-656
[5]   MUFFINN: cancer gene discovery via network analysis of somatic mutation data [J].
Cho, Ara ;
Shim, Jung Eun ;
Kim, Eiru ;
Supek, Fran ;
Lehner, Ben ;
Lee, Insuk .
GENOME BIOLOGY, 2016, 17
[6]   Network Embedding the Protein-Protein Interaction Network for Human Essential Genes Identification [J].
Dai, Wei ;
Chang, Qi ;
Peng, Wei ;
Zhong, Jiancheng ;
Li, Yongjiang .
GENES, 2020, 11 (02)
[7]  
Defferrard M, 2016, ADV NEUR IN, V29
[8]   Lessons from the Cancer Genome [J].
Garraway, Levi A. ;
Lander, Eric S. .
CELL, 2013, 153 (01) :17-37
[9]   node2vec: Scalable Feature Learning for Networks [J].
Grover, Aditya ;
Leskovec, Jure .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :855-864
[10]   Accurate prediction of human essential genes using only nucleotide composition and association information [J].
Guo, Feng-Biao ;
Dong, Chuan ;
Hua, Hong-Li ;
Liu, Shuo ;
Luo, Hao ;
Zhang, Hong-Wan ;
Jin, Yan-Ting ;
Zhang, Kai-Yue .
BIOINFORMATICS, 2017, 33 (12) :1758-1764