Using deep mutational scanning to benchmark variant effect predictors and identify disease mutations

被引:98
作者
Livesey, Benjamin J. [1 ]
Marsh, Joseph A. [1 ]
机构
[1] Univ Edinburgh, Inst Genet & Mol Med, Human Genet Unit, MRC, Edinburgh, Midlothian, Scotland
基金
英国医学研究理事会;
关键词
missense mutations; phenotype prediction; protein structure; saturation mutagenesis; variant effect; THIAMINE PYROPHOSPHOKINASE DEFICIENCY; MISSENSE VARIANTS; NOVO MUTATIONS; CONSEQUENCES; ANNOTATIONS; SPECTRUM; TOOLS;
D O I
10.15252/msb.20199380
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To deal with the huge number of novel protein-coding variants identified by genome and exome sequencing studies, many computational variant effect predictors (VEPs) have been developed. Such predictors are often trained and evaluated using different variant data sets, making a direct comparison between VEPs difficult. In this study, we use 31 previously published deep mutational scanning (DMS) experiments, which provide quantitative, independent phenotypic measurements for large numbers of single amino acid substitutions, in order to benchmark and compare 46 different VEPs. We also evaluate the ability of DMS measurements and VEPs to discriminate between pathogenic and benign missense variants. We find that DMS experiments tend to be superior to the top-ranking predictors, demonstrating the tremendous potential of DMS for identifying novel human disease mutations. Among the VEPs, DeepSequence clearly stood out, showing both the strongest correlations with DMS data and having the best ability to predict pathogenic mutations, which is especially remarkable given that it is an unsupervised method. We further recommend SNAP2, DEOGEN2, SNPs&GO, SuSPect and REVEL based upon their performance in these analyses.
引用
收藏
页数:12
相关论文
共 66 条
[1]   Protein Model Discrimination Using Mutational Sensitivity Derived from Deep Sequencing [J].
Adkar, Bharat V. ;
Tripathi, Arti ;
Sahoo, Anusmita ;
Bajaj, Kanika ;
Goswami, Devrishi ;
Chakrabarti, Purbani ;
Swarnkar, Mohit K. ;
Gokhale, Rajesh S. ;
Varadarajan, Raghavan .
STRUCTURE, 2012, 20 (02) :371-381
[2]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[3]   De novo mutations in epileptic encephalopathies [J].
Allen, Andrew S. ;
Berkovic, Samuel F. ;
Cossette, Patrick ;
Delanty, Norman ;
Dlugos, Dennis ;
Eichler, Evan E. ;
Epstein, Michael P. ;
Glauser, Tracy ;
Goldstein, David B. ;
Han, Yujun ;
Heinzen, Erin L. ;
Hitomi, Yuki ;
Howell, Katherine B. ;
Johnson, Michael R. ;
Kuzniecky, Ruben ;
Lowenstein, Daniel H. ;
Lu, Yi-Fan ;
Madou, Maura R. Z. ;
Marson, Anthony G. ;
Mefford, Heather C. ;
Nieh, Sahar Esmaeeli ;
O'Brien, Terence J. ;
Ottman, Ruth ;
Petrovski, Slave ;
Poduri, Annapurna ;
Ruzzo, Elizabeth K. ;
Scheffer, Ingrid E. ;
Sherr, Elliott H. ;
Yuskaitis, Christopher J. ;
Abou-Khalil, Bassel ;
Alldredge, Brian K. ;
Bautista, Jocelyn F. ;
Berkovic, Samuel F. ;
Boro, Alex ;
Cascino, Gregory D. ;
Consalvo, Damian ;
Crumrine, Patricia ;
Devinsky, Orrin ;
Dlugos, Dennis ;
Epstein, Michael P. ;
Fiol, Miguel ;
Fountain, Nathan B. ;
French, Jacqueline ;
Friedman, Daniel ;
Geller, Eric B. ;
Glauser, Tracy ;
Glynn, Simon ;
Haut, Sheryl R. ;
Hayward, Jean ;
Helmers, Sandra L. .
NATURE, 2013, 501 (7466) :217-+
[4]  
[Anonymous], 2014, J MACHINE LEARNING R
[5]   Deconstruction of the Ras switching cycle through saturation mutagenesis [J].
Bandaru, Pradeep ;
Shah, Neel H. ;
Bhattacharyya, Moitrayee ;
Barton, John P. ;
Kondo, Yasushi ;
Cofsky, Joshua C. ;
Gee, Christine L. ;
Chakraborty, Arup K. ;
Kortemme, Tanja ;
Ranganathan, Rama ;
Kuriyan, John .
ELIFE, 2017, 6
[6]   Expanding the clinical and molecular spectrum of thiamine pyrophosphokinase deficiency: A treatable neurological disorder caused by TPK1 mutations [J].
Banka, Siddharth ;
de Goede, Christian ;
Yue, Wyatt W. ;
Morris, Andrew A. M. ;
Von Bremen, Beate ;
Chandler, Kate E. ;
Feichtinger, Rene G. ;
Hart, Claire ;
Khan, Nasaim ;
Lunzer, Verena ;
Matakovic, Lavinija ;
Marquardt, Thorsten ;
Makowski, Christine ;
Prokisch, Holger ;
Debus, Otfried ;
Nosaka, Kazuto ;
Sonwalkar, Hemant ;
Zimmermann, Franz A. ;
Sperl, Wolfgang ;
Mayr, Johannes A. .
MOLECULAR GENETICS AND METABOLISM, 2014, 113 (04) :301-306
[7]   The role of protein complexes in human genetic disease [J].
Bergendahl, L. Therese ;
Gerasimavicius, Lukas ;
Miles, Jamilla ;
Macdonald, Lewis ;
Wells, Jonathan N. ;
Welburn, Julie P., I ;
Marsh, Joseph A. .
PROTEIN SCIENCE, 2019, 28 (08) :1400-1411
[8]   WS-SNPs& GO: a web server for predicting the deleterious effect of human protein variants using functional annotation [J].
Capriotti, Emidio ;
Calabrese, Remo ;
Fariselli, Piero ;
Martelli, Pier Luigi ;
Altman, Russ B. ;
Casadio, Rita .
BMC GENOMICS, 2013, 14
[9]   Improving the prediction of disease-related variants using protein three-dimensional structure [J].
Capriotti, Emidio ;
Altman, Russ B. .
BMC BIOINFORMATICS, 2011, 12
[10]   Performance of in silico tools for the evaluation of p16INK4a (CDKN2A) variants in CAGI [J].
Carraro, Marco ;
Minervini, Giovanni ;
Giollo, Manuel ;
Bromberg, Yana ;
Capriotti, Emidio ;
Casadio, Rita ;
Dunbrack, Roland ;
Elefanti, Lisa ;
Fariselli, Pietro ;
Ferrari, Carlo ;
Gough, Julian ;
Katsonis, Panagiotis ;
Leonardi, Emanuela ;
Lichtarge, Olivier ;
Menin, Chiara ;
Martelli, Pier Luigi ;
Niroula, Abhishek ;
Pal, Lipika R. ;
Repo, Susanna ;
Scaini, Maria Chiara ;
Vihinen, Mauno ;
Wei, Qiong ;
Xu, Qifang ;
Yang, Yuedong ;
Yin, Yizhou ;
Zaucha, Jan ;
Zhao, Huiying ;
Zhou, Yaoqi ;
Brenner, Steven E. ;
Moult, John ;
Tosatto, Silvio C. E. .
HUMAN MUTATION, 2017, 38 (09) :1042-1050