Prediction of alternative isoforms from exon expression levels in RNA-Seq experiments

被引:107
作者
Richard, Hugues [1 ]
Schulz, Marcel H. [1 ,2 ]
Sultan, Marc [3 ]
Nuernberger, Asja [3 ]
Schrinner, Sabine [3 ]
Balzereit, Daniela [3 ]
Dagand, Emilie [3 ]
Rasche, Axel [3 ]
Lehrach, Hans [3 ]
Vingron, Martin [1 ]
Haas, Stefan A. [1 ]
Yaspo, Marie-Laure [3 ]
机构
[1] Max Planck Inst Mol Genet, Dept Computat Mol Biol, D-14195 Berlin, Germany
[2] Int Max Planck Res Sch Computat Biol & Sci Comp, D-14195 Berlin, Germany
[3] Max Planck Inst Mol Genet, Dept Vertebrate Genom, D-14195 Berlin, Germany
关键词
GENE-EXPRESSION; TRANSCRIPTOME; IDENTIFICATION; ALGORITHM; DISCOVERY; ARRAYS; CELLS;
D O I
10.1093/nar/gkq041
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Alternative splicing, polyadenylation of pre-messenger RNA molecules and differential promoter usage can produce a variety of transcript isoforms whose respective expression levels are regulated in time and space, thus contributing specific biological functions. However, the repertoire of mammalian alternative transcripts and their regulation are still poorly understood. Second-generation sequencing is now opening unprecedented routes to address the analysis of entire transcriptomes. Here, we developed methods that allow the prediction and quantification of alternative isoforms derived solely from exon expression levels in RNA-Seq data. These are based on an explicit statistical model and enable the prediction of alternative isoforms within or between conditions using any known gene annotation, as well as the relative quantification of known transcript structures. Applying these methods to a human RNA-Seq dataset, we validated a significant fraction of the predictions by RT-PCR. Data further showed that these predictions correlated well with information originating from junction reads. A direct comparison with exon arrays indicated improved performances of RNA-Seq over microarrays in the prediction of skipped exons. Altogether, the set of methods presented here comprehensively addresses multiple aspects of alternative isoform analysis. The software is available as an open-source R-package called Solas at http://cmb.molgen.mpg.de/2ndGenerationSequencing/Solas/.
引用
收藏
页数:15
相关论文
共 53 条
[11]   Stem cell transcriptome profiling via massive-scale mRNA sequencing [J].
Cloonan, Nicole ;
Forrest, Alistair R. R. ;
Kolle, Gabriel ;
Gardiner, Brooke B. A. ;
Faulkner, Geoffrey J. ;
Brown, Mellissa K. ;
Taylor, Darrin F. ;
Steptoe, Anita L. ;
Wani, Shivangi ;
Bethel, Graeme ;
Robertson, Alan J. ;
Perkins, Andrew C. ;
Bruce, Stephen J. ;
Lee, Clarence C. ;
Ranade, Swati S. ;
Peckham, Heather E. ;
Manning, Jonathan M. ;
McKernan, Kevin J. ;
Grimmond, Sean M. .
NATURE METHODS, 2008, 5 (07) :613-619
[12]   Alternative splicing and the progesterone receptor in breast cancer [J].
Cork, David M. W. ;
Lennard, Thomas W. J. ;
Tyson-Capper, Alison J. .
BREAST CANCER RESEARCH, 2008, 10 (03)
[13]   Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data [J].
Dai, MH ;
Wang, PL ;
Boyd, AD ;
Kostov, G ;
Athey, B ;
Jones, EG ;
Bunney, WE ;
Myers, RM ;
Speed, TP ;
Akil, H ;
Watson, SJ ;
Meng, F .
NUCLEIC ACIDS RESEARCH, 2005, 33 (20) :e175.1-e175.9
[14]   A correlation with exon expression approach to identify cis-regulatory elements for tissue-specific alternative splicing [J].
Das, Debopriya ;
Clark, Tyson A. ;
Schweitzer, Anthony ;
Yamamoto, Miki ;
Marr, Henry ;
Arribere, Josh ;
Minovitsky, Simon ;
Poliakov, Alexander ;
Dubchak, Inna ;
Blume, John E. ;
Conboy, John G. .
NUCLEIC ACIDS RESEARCH, 2007, 35 (14) :4845-4857
[15]   The functional consequences of alternative promoter use in mammalian genomes [J].
Davuluri, Ramana V. ;
Suzuki, Yutaka ;
Sugano, Sumio ;
Plass, Christoph ;
Huang, Tim H. -M. .
TRENDS IN GENETICS, 2008, 24 (04) :167-177
[16]   Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[17]   Identification of differentially regulated splice variants and novel exons in glial brain tumors using exon expression arrays [J].
French, Pim J. ;
Peeters, Justine ;
Horsman, Sebastiaan ;
Duijm, Elza ;
Siccama, Ivar ;
van den Bent, Martin J. ;
Luider, Theo M. ;
Kros, Johan M. ;
van der Spek, Peter ;
Smitt, Peter A. Sillevis .
CANCER RESEARCH, 2007, 67 (12) :5635-5642
[18]   Alternative splicing and differential gene expression in colon cancer detected by a whole genome exon array [J].
Gardina, Paul J. ;
Clark, Tyson A. ;
Shimada, Brian ;
Staples, Michelle K. ;
Yang, Qing ;
Veitch, James ;
Schweitzer, Anthony ;
Awad, Tarif ;
Sugnet, Charles ;
Dee, Suzanne ;
Davies, Christopher ;
Williams, Alan ;
Turpaz, Yaron .
BMC GENOMICS, 2006, 7 (1)
[19]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[20]   Genome wide identification and classification of alternative splicing based on EST data [J].
Gupta, S ;
Zink, D ;
Korn, B ;
Vingron, M ;
Haas, SA .
BIOINFORMATICS, 2004, 20 (16) :2579-2585