Background: High-throughput RNA sequencing (RNA-Seq) enables quantification and identification of transcripts at single-base resolution. Recently, longer sequence reads become available thanks to the development of new types of sequencing technologies as well as improvements in chemical reagents for the Next Generation Sequencers. Although several computational methods have been proposed for quantifying gene expression levels from RNA-Seq data, they are not sufficiently optimized for longer reads (e.g. > 250 bp). Results: We propose TIGAR2, a statistical method for quantifying transcript isoforms from fixed and variable length RNA-Seq data. Our method models substitution, deletion, and insertion errors of sequencers based on gapped-alignments of reads to the reference cDNA sequences so that sensitive read-aligners such as Bowtie2 and BWA-MEM are effectively incorporated in our pipeline. Also, a heuristic algorithm is implemented in variational Bayesian inference for faster computation. We apply TIGAR2 to both simulation data and real data of human samples and evaluate performance of transcript quantification with TIGAR2 in comparison to existing methods. Conclusions: TIGAR2 is a sensitive and accurate tool for quantifying transcript isoform abundances from RNA-Seq data. Our method performs better than existing methods for the fixed-length reads (100 bp, 250 bp, 500 bp, and 1000 bp of both single-end and paired-end) and variable-length reads, especially for reads longer than 250 bp.
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Oshlack, Alicia
Robinson, Mark D.
论文数: 0引用数: 0
h-index: 0
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Garvan Inst Med Res, Canc Program, Epigenet Lab, Darlinghurst, NSW 2010, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
Robinson, Mark D.
Young, Matthew D.
论文数: 0引用数: 0
h-index: 0
机构:
Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, AustraliaWalter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
机构:
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R ChinaPeking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Wang, Zengmiao
Wang, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R ChinaPeking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Wang, Jun
Wu, Changjing
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Math Sci, Beijing 100871, Peoples R ChinaPeking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Wu, Changjing
Deng, Minghua
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Peking Univ, Sch Math Sci, LMAM, Beijing 100871, Peoples R China
Peking Univ, Ctr Stat Sci, Beijing 100871, Peoples R ChinaPeking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
机构:
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R ChinaPeking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Wang, Zengmiao
Wang, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R ChinaPeking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Wang, Jun
Deng, Minghua
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Sch Math Sci, Beijing 100871, Peoples R ChinaPeking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
Deng, Minghua
2015 34TH CHINESE CONTROL CONFERENCE (CCC),
2015,
: 8554
-
8558