Aberrant 5′ splice sites in human disease genes:: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization

被引:159
作者
Buratti, Emanuele
Chivers, Martin
Kralovicova, Jana
Romano, Maurizio
Baralle, Marco
Krainer, Adrian R.
Vorechovsky, Igor [1 ]
机构
[1] Int Ctr Genet Engn & Biotechnol, Trieste 34012, Italy
[2] Univ Southampton, Sch Med, Div Human Genet, Southampton SO16 6YD, Hants, England
[3] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
关键词
D O I
10.1093/nar/gkm402
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite a growing number of splicing mutations found in hereditary diseases, utilization of aberrant splice sites and their effects on gene expression remain challenging to predict. We compiled sequences of 346 aberrant 5'splice sites (5'ss) that were activated by mutations in 166 human disease genes. Mutations within the 5'ss consensus accounted for 254 cryptic 5'ss and mutations elsewhere activated 92 de novo 5'ss. Point mutations leading to cryptic 5'ss activation were most common in the first intron nucleotide, followed by the fifth nucleotide. Substitutions at position +5 were exclusively G>A transitions, which was largely attributable to high mutability rates of C/G>T/A. However, the frequency of point mutations at position +5 was significantly higher than that observed in the Human Gene Mutation Database, suggesting that alterations of this position are particularly prone to aberrant splicing, possibly due to a requirement for sequential interactions with U1 and U6 snRNAs. Cryptic 5'ss were best predicted by computational algorithms that accommodate nucleotide dependencies and not by weight-matrix models. Discrimination of intronic 5'ss from their authentic counterparts was less effective than for exonic sites, as the former were intrinsically stronger than the latter. Computational prediction of exonic de novo 5'ss was poor, suggesting that their activation critically depends on exonic splicing enhancers or silencers. The authentic counterparts of aberrant 5'ss were significantly weaker than the average human 5'ss. The development of an online database of aberrant 5'ss will be useful for studying basic mechanisms of splice-site selection, identifying splicing mutations and optimizing splice-site prediction algorithms.
引用
收藏
页码:4250 / 4263
页数:14
相关论文
共 95 条
  • [1] 5' CLEAVAGE SITE IN EUKARYOTIC PRE-MESSENGER-RNA SPLICING IS DETERMINED BY THE OVERALL 5' SPLICE REGION, NOT BY THE CONSERVED 5' GU
    AEBI, M
    HORNIG, H
    WEISSMANN, C
    [J]. CELL, 1987, 50 (02) : 237 - 246
  • [2] Activation of a cryptic 5′ splice site by U1 snRNA
    Alvarez, CJ
    Wise, JA
    [J]. RNA, 2001, 7 (03) : 342 - 350
  • [3] [Anonymous], 1993, Human gene mutation
  • [4] Familial adenomatous polyposis:: Aberrant splicing due to missense or silent mutations in the APC gene
    Aretz, S
    Uhlhaas, S
    Sun, Y
    Pagenstecher, C
    Mangold, E
    Caspari, R
    Möslein, G
    Schulmann, K
    Propping, P
    Friedl, W
    [J]. HUMAN MUTATION, 2004, 24 (05) : 370 - 380
  • [5] Mutations affecting mRNA splicing are the most common molecular defects in patients with neurofibromatosis type 1
    Ars, E
    Serra, E
    García, J
    Kruyer, H
    Gaona, A
    Lázaro, C
    Estivill, X
    [J]. HUMAN MOLECULAR GENETICS, 2000, 9 (02) : 237 - 247
  • [6] How did alternative splicing evolve?
    Ast, G
    [J]. NATURE REVIEWS GENETICS, 2004, 5 (10) : 773 - 782
  • [7] Activation of multiple cryptic donor splice sites by the common congenital afibrinogenemia mutation, FGA IVS4+1 G→T
    Attanasio, C
    de Moerloose, P
    Antonarakis, SE
    Morris, MA
    Neerman-Arbez, M
    [J]. BLOOD, 2001, 97 (06) : 1879 - 1881
  • [8] Splicing in action: assessing disease causing sequence changes
    Baralle, D
    Baralle, M
    [J]. JOURNAL OF MEDICAL GENETICS, 2005, 42 (10) : 737 - 748
  • [9] Efficient use of a 'dead-end' GA 5′ splice site in the human fibroblast growth factor receptor genes
    Brackenridge, S
    Wilkie, AOM
    Screaton, GR
    [J]. EMBO JOURNAL, 2003, 22 (07) : 1620 - 1631
  • [10] PREDICTION OF HUMAN MESSENGER-RNA DONOR AND ACCEPTOR SITES FROM THE DNA-SEQUENCE
    BRUNAK, S
    ENGELBRECHT, J
    KNUDSEN, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1991, 220 (01) : 49 - 65