Patterns of variant polyadenylation signal usage in human genes

被引:548
作者
Beaudoing, E [1 ]
Freier, S [1 ]
Wyatt, JR [1 ]
Claverie, JM [1 ]
Gautheret, D [1 ]
机构
[1] Struct & Genet Informat Lab, CNRS, UMR 1889, F-13402 Marseille 20, France
关键词
D O I
10.1101/gr.10.7.1001
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The formation of mature mRNAs in vertebrates involves the cleavage and polyadenylation of the pre-mRNA, 10-30 nt downstream of an AAUAAA or AUUAAA signal sequence. The extensive cDNA data now available shows that these hexamers are not strictly conserved. In order to identify variant polyadenylation signals on a large scale, we compared over 8700 human 3' untranslated sequences to 157,775 polyadenylated expressed sequence tags (ESTs], used as markers of actual mRNA 3' ends. About 5600 EST-supported putative mRNA 3' ends were collected and analyzed for significant hexameric sequences. Known polyadenylation signals were found in only 73% of the 3' fragments. Ten single-base variants of the AAUAAA sequence were identified with a highly significant occurrence rate, potentially representing 14.9% of the actual polyadenylation signals. Of the mRNAs, 28.6% displayed two or more polyadenylation sites. In these mRNAs, the poly(A) sites proximal to the coding sequence tend to use variant signals more often, while the 3'-most site tends to use a canonical signal. The average number of ESTs associated with each signal type suggests that variant signals (including the common AUUAAA] are processed less efficiently than the canonical signal and could therefore be selected for regulatory purposes. However, the position of the site in the untranslated region may also play a role in polyadenylation rate.
引用
收藏
页码:1001 / 1010
页数:10
相关论文
共 63 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Multiple transcripts of the murine immunoglobulin epsilon membrane locus are generated by alternative splicing and differential usage of two polyadenylation sites [J].
Anand, S ;
Batista, FD ;
Tkach, T ;
Efremov, DG ;
Burrone, OR .
MOLECULAR IMMUNOLOGY, 1997, 34 (02) :175-183
[3]   HIERARCHY OF POLYADENYLATION SITE USAGE BY BOVINE PAPILLOMAVIRUS IN TRANSFORMED MOUSE CELLS [J].
ANDREWS, EM ;
DIMAIO, D .
JOURNAL OF VIROLOGY, 1993, 67 (12) :7705-7710
[4]   Presence of multiple functional polyadenylation signals and a single nucleotide polymorphism in the 3′ untranslated region of the human serotonin transporter gene [J].
Battersby, S ;
Ogilvie, AD ;
Blackwood, DHR ;
Shen, SB ;
Muqit, MMK ;
Muir, WJ ;
Teague, P ;
Goodwin, GM ;
Harmar, AJ .
JOURNAL OF NEUROCHEMISTRY, 1999, 72 (04) :1384-1388
[5]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[6]   AU-RICH ELEMENTS - CHARACTERIZATION AND IMPORTANCE IN MESSENGER-RNA DEGRADATION [J].
CHEN, CYA ;
SHYU, AB .
TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (11) :465-470
[7]  
CHENG JF, 1986, J BIOL CHEM, V261, P839
[8]   Computational methods for the identification of genes in vertebrate genomic sequences [J].
Claverie, JM .
HUMAN MOLECULAR GENETICS, 1997, 6 (10) :1735-1744
[9]   Mechanism and regulation of mRNA polyadenylation [J].
Colgan, DF ;
Manley, JL .
GENES & DEVELOPMENT, 1997, 11 (21) :2755-2766
[10]   STRUCTURAL CHARACTERIZATION AND PROMOTER ACTIVITY ANALYSIS OF THE GAMMA-KAFIRIN GENE FROM SORGHUM [J].
DEFREITAS, FA ;
YUNES, JA ;
DASILVA, MJ ;
ARRUDA, P ;
LEITE, A .
MOLECULAR AND GENERAL GENETICS, 1994, 245 (02) :177-186