Prediction of Functional Regulatory SNPs in Monogenic and Complex Disease

被引:19
作者
Zhao, Yiqiang [1 ,2 ]
Clark, Wyatt T. [3 ]
Mort, Matthew [4 ]
Cooper, David N. [4 ]
Radivojac, Predrag [3 ]
Mooney, Sean D. [1 ,2 ]
机构
[1] Buck Inst Res Aging, Novato, CA 94945 USA
[2] Indiana Univ Sch Med, Dept Med & Mol Genet, Indianapolis, IN USA
[3] Indiana Univ, Sch Informat & Comp, Bloomington, IN USA
[4] Cardiff Univ, Sch Med, Inst Med Genet, Cardiff, S Glam, Wales
关键词
regulatory mutations; machine learning; monogenic disease; complex disease; single nucleotide polymorphisms; SNP; SINGLE-NUCLEOTIDE POLYMORPHISMS; INHERITED DISEASE; PROMOTER REGIONS; HUMAN GENOME; GENE DOSAGE; SEQUENCE; DATABASE; IDENTIFICATION; MUTATIONS; BIOINFORMATICS;
D O I
10.1002/humu.21559
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Next-generation sequencing (NGS) technologies are yielding ever higher volumes of human genome sequence data. Given this large amount of data, it has become both a possibility and a priority to determine how disease-causing single nucleotide polymorphisms (SNPs) detected within gene regulatory regions (rSNPs) exert their effects on gene expression. Recently, several studies have explored whether disease-causing polymorphisms have attributes that can distinguish them from those that are neutral, attaining moderate success at discriminating between functional and putatively neutral regulatory SNPs. Here, we have extended this work by assessing the utility of both SNP-based features (those associated only with the polymorphism site and the surrounding DNA) and gene-based features (those derived from the associated gene in whose regulatory region the SNP lies) in the identification of functional regulatory polymorphisms involved in either monogenic or complex disease. Gene-based features were found to be capable of both augmenting and enhancing the utility of SNP-based features in the prediction of known regulatory mutations. Adopting this approach, we achieved an AUC of 0.903 for predicting regulatory SNPs. Finally, our tool predicted 225 new regulatory SNPs with a high degree of confidence, with 105 of the 225 falling into linkage disequilibrium blocks of reported disease-associated genome-wide association studies SNPs. Hum Mutat 32:1183-1190, 2011. (C) 2011 Wiley-Liss, Inc.
引用
收藏
页码:1183 / 1190
页数:8
相关论文
共 48 条
[1]   Modulating influence on HIV/AIDS by interacting RANTES gene variants [J].
An, P ;
Nelson, GW ;
Wang, LH ;
Donfield, S ;
Goedert, JJ ;
Phair, J ;
Vlahov, D ;
Buchbinder, S ;
Farrar, WL ;
Modi, W ;
O'Brien, SJ ;
Winkler, CA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (15) :10002-10007
[2]   In silico detection of sequence variations modifying transcriptional regulation [J].
Andersen, Malin C. ;
Engstrom, Par R. G. ;
Lithwick, Stuart ;
Arenillas, David ;
Eriksson, Per ;
Lenhard, Boris ;
Wasserman, Wyeth W. ;
Odeberg, Jacob .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (01) :0043-0054
[3]  
ANNEREN G, 1993, APMIS, V101, P71
[4]   The importance and identification of regulatory polymorphisms and their mechanisms of action [J].
Buckland, PR .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR BASIS OF DISEASE, 2006, 1762 (01) :17-28
[5]   Strong bias the location of functional promoter polymorphisms [J].
Buckland, PR ;
Hoogendoorn, B ;
Coleman, SL ;
Guy, CA ;
Smith, SK ;
O'Donovan, MC .
HUMAN MUTATION, 2005, 26 (03) :214-223
[6]   A high proportion of polymorphisms in the promoters of brain expressed genes influences transcriptional activity [J].
Buckland, PR ;
Hoogendoom, B ;
Guy, CA ;
Coleman, SL ;
Smith, SK ;
Buxbaum, JD ;
Haroutunian, V ;
O'Donovan, MC .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR BASIS OF DISEASE, 2004, 1690 (03) :238-249
[7]   Validating Discovered Cis-Acting Regulatory Genetic Variants: Application of an Allele Specific Expression Approach to HapMap Populations [J].
Campino, Susana ;
Forton, Julian ;
Raj, Srilakshmi ;
Mohr, Bert ;
Auburn, Sarah ;
Fry, Andrew ;
Mangano, Valentina D. ;
Vandiedonck, Claire ;
Richardson, Anna ;
Rockett, Kirk ;
Clark, Taane G. ;
Kwiatkowski, Dominic P. .
PLOS ONE, 2008, 3 (12)
[8]   Discovery and verification of functional single nucleotide polymorphisms in regulatory genomic regions: Current and developing technologies [J].
Chorley, Brian N. ;
Wang, Xuting ;
Campbell, Michelle R. ;
Pittman, Gary S. ;
Noureddine, Maher A. ;
Bell, Douglas A. .
MUTATION RESEARCH-REVIEWS IN MUTATION RESEARCH, 2008, 659 (1-2) :147-157
[9]   Analysis of protein function and its prediction from amino acid sequence [J].
Clark, Wyatt T. ;
Radivojac, Predrag .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (07) :2086-2096
[10]   PupaSNP Finder: a web tool for finding SNPs with putative effect at transcriptional level [J].
Conde, L ;
Vaquerizas, JM ;
Santoyo, J ;
Al-Shahrour, F ;
Ruiz-Llorente, S ;
Robledo, M ;
Dopazo, J .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W242-W248