SAGE2Splice:: Unmapped SAGE tags reveal novel splice junctions

被引:8
作者
Kuo, Byron Yu-Lin
Chen, Ying
Bohacec, Slavita
Johansson, Ojvind
Wasserman, Wyeth W.
Simpson, Elizabeth M. [1 ]
机构
[1] Univ British Columbia, Grad Program Genet, Vancouver, BC V5Z 1M9, Canada
[2] Univ British Columbia, Child & Family Res Inst, Ctr Mol Med & Therapeut, Dept Med Genet, Vancouver, BC V5Z 1M9, Canada
[3] Stockholm Bioinformat Ctr, Kunliga Tekniska Hogskolan, Stockholm, Sweden
关键词
D O I
10.1371/journal.pcbi.0020034
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Serial analysis of gene expression ( SAGE) not only is a method for profiling the global expression of genes, but also offers the opportunity for the discovery of novel transcripts. SAGE tags are mapped to known transcripts to determine the gene of origin. Tags that map neither to a known transcript nor to the genome were hypothesized to span a splice junction, for which the exon combination or exon(s) are unknown. To test this hypothesis, we have developed an algorithm, SAGE2Splice, to efficiently map SAGE tags to potential splice junctions in a genome. The algorithm consists of three search levels. A scoring scheme was designed based on position weight matrices to assess the quality of candidates. Using optimized parameters for SAGE2Splice analysis and two sets of SAGE data, candidate junctions were discovered for 5%-6% of unmapped tags. Candidates were classified into three categories, reflecting the previous annotations of the putative splice junctions. Analysis of predicted tags extracted from EST sequences demonstrated that candidate junctions having the splice junction located closer to the center of the tags are more reliable. Nine of these 12 candidates were validated by RT-PCR and sequencing, and among these, four revealed previously uncharacterized exons. Thus, SAGE2Splice provides a new functionality for the identification of novel transcripts and exons.
引用
收藏
页码:276 / 287
页数:12
相关论文
共 33 条
[11]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[12]   A SAGE approach to discovery of genes involved in autophagic cell death [J].
Gorski, SM ;
Chittaranjan, S ;
Pleasance, ED ;
Freeman, JD ;
Anderson, CL ;
Varhol, RJ ;
Coughlin, SM ;
Zuyderduyn, SD ;
Jones, SJM ;
Marra, MA .
CURRENT BIOLOGY, 2003, 13 (04) :358-363
[13]   Changes in gene expression associated with developmental arrest and longevity in Caenorhabditis elegans [J].
Jones, SJM ;
Riddle, DL ;
Pouzyrev, AT ;
Velculescu, VE ;
Hillier, L ;
Eddy, SR ;
Stricklin, SL ;
Baillie, DL ;
Waterston, R ;
Marra, MA .
GENOME RESEARCH, 2001, 11 (08) :1346-1352
[14]   The UCSC Genome Browser Database [J].
Karolchik, D ;
Baertsch, R ;
Diekhans, M ;
Furey, TS ;
Hinrichs, A ;
Lu, YT ;
Roskin, KM ;
Schwartz, M ;
Sugnet, CW ;
Thomas, DJ ;
Weber, RJ ;
Haussler, D ;
Kent, WJ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :51-54
[15]   Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries [J].
Keime, C ;
Damiola, F ;
Mouchiroud, D ;
Duret, L ;
Gandrillon, O .
BMC BIOINFORMATICS, 2004, 5 (1)
[16]  
Korf I, 2001, Bioinformatics, V17 Suppl 1, pS140
[17]   SAGEmap: A public gene expression resource [J].
Lash, AE ;
Tolstoshev, CM ;
Wagner, L ;
Schuler, GD ;
Strausberg, RL ;
Riggins, GJ ;
Altschul, SF .
GENOME RESEARCH, 2000, 10 (07) :1051-1060
[18]   Serial analysis of gene expression: from gene discovery to target identification [J].
Madden, SL ;
Wang, CJ ;
Landes, G .
DRUG DISCOVERY TODAY, 2000, 5 (09) :415-425
[19]   Frequent alternative splicing of human genes [J].
Mironov, AA ;
Fickett, JW ;
Gelfand, MS .
GENOME RESEARCH, 1999, 9 (12) :1288-1293
[20]   A genomic view of alternative splicing [J].
Modrek, B ;
Lee, C .
NATURE GENETICS, 2002, 30 (01) :13-19