Ranking non-synonymous single nucleotide polymorphisms based on disease concepts

被引:150
作者
Shihab, Hashem A. [1 ,2 ]
Gough, Julian [3 ]
Mort, Matthew [4 ]
Cooper, David N. [4 ]
Day, Ian N. M. [1 ,2 ]
Gaunt, Tom R. [1 ,2 ]
机构
[1] Univ Bristol, Bristol Ctr Syst Biomed, Bristol BS8 2BN, Avon, England
[2] Univ Bristol, MRC Integrat Epidemiol Unit, Sch Social & Community Med, Bristol BS8 2BN, Avon, England
[3] Univ Bristol, Dept Comp Sci, Bristol BS8 1UB, Avon, England
[4] Cardiff Univ, Sch Med, Inst Med Genet, Cardiff CF14 4XN, S Glam, Wales
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
SNV; nsSNPs; Disease causing; Disease specific; FATHMM; HMMs; SIFT; PolyPhen; Bioinformatics; FUNCTIONAL IMPACT; MUTATIONS; PREDICTION; DATABASE; TOOL; CONSEQUENCES;
D O I
10.1186/1479-7364-8-11
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As the number of non-synonymous single nucleotide polymorphisms (nsSNPs) identified through whole-exome/whole- genome sequencing programs increases, researchers and clinicians are becoming increasingly reliant upon computational prediction algorithms designed to prioritize potential functional variants for further study. A large proportion of existing prediction algorithms are 'disease agnostic' but are nevertheless quite capable of predicting when a mutation is likely to be deleterious. However, most clinical and research applications of these algorithms relate to specific diseases and would therefore benefit from an approach that discriminates between functional variants specifically related to that disease from those which are not. In a whole-exome/whole- genome sequencing context, such an approach could substantially reduce the number of false positive candidate mutations. Here, we test this postulate by incorporating a disease-specific weighting scheme into the Functional Analysis through Hidden Markov Models (FATHMM) algorithm. When compared to traditional prediction algorithms, we observed an overall reduction in the number of false positives identified using a disease-specific approach to functional prediction across 17 distinct disease concepts/categories. Our results illustrate the potential benefits of making disease-specific predictions when prioritizing candidate variants in relation to specific diseases.
引用
收藏
页数:6
相关论文
共 26 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   Classification of mismatch repair gene missense variants with PON-MMR [J].
Ali, Heidi ;
Olatubosun, Ayodeji ;
Vihinen, Mauno .
HUMAN MUTATION, 2012, 33 (04) :642-650
[3]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   Exome sequencing as a tool for Mendelian disease gene discovery [J].
Bamshad, Michael J. ;
Ng, Sarah B. ;
Bigham, Abigail W. ;
Tabor, Holly K. ;
Emond, Mary J. ;
Nickerson, Deborah A. ;
Shendure, Jay .
NATURE REVIEWS GENETICS, 2011, 12 (11) :745-755
[6]   Cancer-Specific High-Throughput Annotation of Somatic Mutations: Computational Prediction of Driver Missense Mutations [J].
Carter, Hannah ;
Chen, Sining ;
Isik, Leyla ;
Tyekucheva, Svitlana ;
Velculescu, Victor E. ;
Kinzler, Kenneth W. ;
Vogelstein, Bert ;
Karchin, Rachel .
CANCER RESEARCH, 2009, 69 (16) :6660-6667
[7]  
Eddy Sean R, 2009, Genome Inform, V23, P205
[8]   Improving the prediction of the functional impact of cancer mutations by baseline tolerance transformation [J].
Gonzalez-Perez, Abel ;
Deu-Pons, Jordi ;
Lopez-Bigas, Nuria .
GENOME MEDICINE, 2012, 4
[9]   Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure [J].
Gough, J ;
Karplus, K ;
Hughey, R ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (04) :903-919
[10]   CanPredict: a computational tool for predicting cancer-associated missense mutations [J].
Kaminker, Joshua S. ;
Zhang, Yan ;
Watanabe, Colin ;
Zhang, Zemin .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W595-W598