Using positional distribution to identify splicing elements and predict pre-mRNA processing defects in human genes

被引:215
作者
Lim, Kian Huat [1 ]
Ferraris, Luciana [1 ]
Filloux, Madeleine E. [1 ]
Raphael, Benjamin J. [2 ,3 ]
Fairbrother, William G. [1 ,3 ,4 ]
机构
[1] Brown Univ, Dept Mol & Cellular Biol & Biochem, Providence, RI 02903 USA
[2] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
[3] Ctr Computat Mol Biol, Providence, RI 02912 USA
[4] Brown Univ, Ctr Genom & Prote, Providence, RI 02903 USA
关键词
VIRUS NEGATIVE REGULATOR; BINDING SPECIFICITY; SR PROTEIN; PROTEOMIC ANALYSIS; MUTATIONS; IDENTIFICATION; SPLICEOSOMES; ENHANCERS; SEQUENCE; DISTINCT;
D O I
10.1073/pnas.1101135108
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present an intuitive strategy for predicting the effect of sequence variation on splicing. In contrast to transcriptional elements, splicing elements appear to be strongly position dependent. We demonstrated that exonic binding of the normally intronic splicing factor, U2AF65, inhibits splicing. Reasoning that the positional distribution of a splicing element is a signature of its function, we developed a method for organizing all possible sequence motifs into clusters based on the genomic profile of their positional distribution around splice sites. Binding sites for serine/arginine rich (SR) proteins tended to be exonic whereas heterogeneous ribonucleoprotein (hnRNP) recognition elements were mostly intronic. In addition to the known elements, novel motifs were returned and validated. This method was also predictive of splicing mutations. A mutation in a motif creates a new motif that sometimes has a similar distribution shape to the original motif and sometimes has a different distribution. We created an intraallelic distance measure to capture this property and found that mutations that created large intraallelic distances disrupted splicing in vivo whereas mutations with small distances did not alter splicing. Analyzing the dataset of human disease alleles revealed known splicing mutants to have high intraallelic distances and suggested that 22% of disease alleles that were originally classified as missense mutations may also affect splicing. This category together with mutations in the canonical splicing signals suggest that approximately one third of all disease-causing mutations alter pre-mRNA splicing.
引用
收藏
页码:11093 / 11098
页数:6
相关论文
共 46 条
[1]   Mutations affecting mRNA splicing are the most common molecular defects in patients with neurofibromatosis type 1 [J].
Ars, E ;
Serra, E ;
García, J ;
Kruyer, H ;
Gaona, A ;
Lázaro, C ;
Estivill, X .
HUMAN MOLECULAR GENETICS, 2000, 9 (02) :237-247
[2]   RNA-BINDING SPECIFICITY OF HNRNP A1 - SIGNIFICANCE OF HNRNP A1 HIGH-AFFINITY BINDING-SITES IN PRE-MESSENGER-RNA SPLICING [J].
BURD, CG ;
DREYFUSS, G .
EMBO JOURNAL, 1994, 13 (05) :1197-1204
[3]  
Calinski T., 1974, Communications in Statistics-theory and Methods, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
[4]   Determination of the RNA binding specificity of the heterogeneous nuclear ribonucleoprotein (hnRNP) H/H′/F/2H9 family [J].
Caputi, M ;
Zahler, AM .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (47) :43850-43859
[5]   The splicing factors 9G8 and SRp20 transactivate splicing through different and specific enhancers [J].
Cavaloc, Y ;
Bourgeois, CF ;
Kister, L ;
Stévenin, J .
RNA, 1999, 5 (03) :468-483
[6]   DIRECT SELECTION FOR MUTATIONS AFFECTING SPECIFIC SPLICE SITES IN A HAMSTER DIHYDROFOLATE-REDUCTASE MINIGENE [J].
CHEN, IT ;
CHASIN, LA .
MOLECULAR AND CELLULAR BIOLOGY, 1993, 13 (01) :289-300
[7]   Proteomic analysis of in vivo-assembled pre-mRNA splicing complexes expands the catalog of participating factors [J].
Chen, Yen-I G. ;
Moore, Roger E. ;
Ge, Helen Y. ;
Young, Mary K. ;
Lee, Terry D. ;
Stevens, Scott W. .
NUCLEIC ACIDS RESEARCH, 2007, 35 (12) :3928-3944
[8]   SR protein and snRNP requirements for assembly of the Rous sarcoma virus negative regulator of splicing complex in vitro [J].
Cook, CR ;
McNally, MT .
VIROLOGY, 1998, 242 (01) :211-220
[9]   Predictive identification of exonic splicing enhancers in human genes [J].
Fairbrother, WG ;
Yeh, RF ;
Sharp, PA ;
Burge, CB .
SCIENCE, 2002, 297 (5583) :1007-1013
[10]   Comparative analysis identifies exonic splicing regulatory sequences - The complex definition of enhancers and silencers [J].
Goren, Amir ;
Ram, Oren ;
Amit, Maayan ;
Keren, Hadas ;
Lev-Maor, Galit ;
Vig, Ida ;
Pupko, Tal ;
Ast, Gil .
MOLECULAR CELL, 2006, 22 (06) :769-781