UMD-Predictor: A High-Throughput Sequencing Compliant System for Pathogenicity Prediction of any Human cDNA Substitution

被引:105
作者
Salgado, David [1 ,2 ]
Desvignes, Jean-Pierre [1 ,2 ]
Rai, Ghadi [1 ,2 ]
Blanchard, Arnaud [1 ,2 ]
Miltgen, Morgane [1 ,2 ]
Pinard, Amelie [1 ,2 ]
Levy, Nicolas [1 ,2 ,3 ]
Collod-Beroud, Gwenaelle [1 ,2 ]
Beroud, Christophe [1 ,2 ,3 ]
机构
[1] Aix Marseille Univ, GMGF, F-13385 Marseille, France
[2] Hop Enfants La Timone, INSERM, UMR S 910, F-13385 Marseille, France
[3] Hop Enfants La Timone, APHM, Mol Genet Lab, F-13385 Marseille, France
关键词
pathogenicity prediction; mutation; bioinformatics; NGS; substitution; synonymous; nonsynonymous; nonsense; GENOME; DATABASE; INHERITANCE; GENES; TOOL;
D O I
10.1002/humu.22965
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Whole-exome sequencing (WES) is increasingly applied to research and clinical diagnosis of human diseases. It typically results in large amounts of genetic variations. Depending on the mode of inheritance, only one or two correspond to pathogenic mutations responsible for the disease and present in affected individuals. Therefore, it is crucial to filter out nonpathogenic variants and limit downstream analysis to a handful of candidate mutations. We have developed a new computational combinatorial system UMD-Predictor () to efficiently annotate cDNA substitutions of all human transcripts for their potential pathogenicity. It combines biochemical properties, impact on splicing signals, localization in protein domains, variation frequency in the global population, and conservation through the BLOSUM62 global substitution matrix and a protein-specific conservation among 100 species. We compared its accuracy with the seven most used and reliable prediction tools, using the largest reference variation datasets including more than 140,000 annotated variations. This system consistently demonstrated a better accuracy, specificity, Matthews correlation coefficient, diagnostic odds ratio, speed, and provided the shortest list of candidate mutations for WES. Webservices allow its implementation in any bioinformatics pipeline for next-generation sequencing analysis. It could benefit to a wide range of users and applications varying from gene discovery to clinical diagnosis.
引用
收藏
页码:439 / 446
页数:8
相关论文
共 33 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]  
Agarwala R, 2015, NUCLEIC ACIDS RES, V43, pD6, DOI [10.1093/nar/gku1130, 10.1093/nar/gkv1290]
[3]   Activities at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightingale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Corbett, Matt .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D191-D198
[4]   PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations [J].
Bendl, Jaroslav ;
Stourac, Jan ;
Salanda, Ondrej ;
Pavelka, Antonin ;
Wieben, Eric D. ;
Zendulka, Jaroslav ;
Brezovsky, Jan ;
Damborsky, Jiri .
PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (01)
[5]   SNAP: predict effect of non-synonymous polymorphisms on function [J].
Bromberg, Yana ;
Rost, Burkhard .
NUCLEIC ACIDS RESEARCH, 2007, 35 (11) :3823-3835
[6]  
Choi YH, 2012, PLOS ONE, V7, DOI [10.1371/journal.pone.0039927, 10.1371/journal.pone.0046688]
[7]   Ensembl 2015 [J].
Cunningham, Fiona ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Billis, Konstantinos ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Giron, Carlos Garcia ;
Gordon, Leo ;
Hourlier, Thibaut ;
Hunt, Sarah E. ;
Janacek, Sophie H. ;
Johnson, Nathan ;
Juettemann, Thomas ;
Kaehaeri, Andreas K. ;
Keenan, Stephen ;
Martin, Fergal J. ;
Maurel, Thomas ;
McLaren, William ;
Murphy, Daniel N. ;
Nag, Rishi ;
Overduin, Bert ;
Parker, Anne ;
Patricio, Mateus ;
Perry, Emily ;
Pignatelli, Miguel ;
Riat, Harpreet Singh ;
Sheppard, Daniel ;
Taylor, Kieron ;
Thormann, Anja ;
Vullo, Alessandro ;
Wilder, Steven P. ;
Zadissa, Amonida ;
Aken, Bronwen L. ;
Birney, Ewan ;
Harrow, Jennifer ;
Kinsella, Rhoda ;
Muffato, Matthieu ;
Ruffier, Magali ;
Searle, Stephen M. J. ;
Spudich, Giulietta ;
Trevanion, Stephen J. ;
Yates, Andy ;
Zerbino, Daniel R. ;
Flicek, Paul .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D662-D669
[8]   SNPeffect 4.0: on-line prediction of molecular and structural effects of protein-coding variants [J].
De Baets, Greet ;
Van Durme, Joost ;
Reumers, Joke ;
Maurer-Stroh, Sebastian ;
Vanhee, Peter ;
Dopazo, Joaquin ;
Schymkowitz, Joost ;
Rousseau, Frederic .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D935-D939
[9]   Human Splicing Finder: an online bioinformatics tool to predict splicing signals [J].
Desmet, Francois-Olivier ;
Hamroun, Dalil ;
Lalande, Marine ;
Collod-Beroud, Gwenaelle ;
Claustres, Mireille ;
Beroud, Christophe .
NUCLEIC ACIDS RESEARCH, 2009, 37 (09)
[10]   UMD-Predictor, a New Prediction Tool for Nucleotide Substitution Pathogenicity-Application to Four Genes: FBN1, FBN2, TGFBR1, and TGFBR2 [J].
Frederic, Melissa Yana ;
Lalande, Marine ;
Boileau, Catherine ;
Hamroun, Dalil ;
Claustres, Mireille ;
Beroud, Christophe ;
Collod-Beroud, Gwenaelle .
HUMAN MUTATION, 2009, 30 (06) :952-959