Estimation of data-specific constitutive exons with RNA-Seq data

被引:4
|
作者
Patrick, Ellis [1 ,2 ]
Buckley, Michael [2 ]
Yang, Yee Hwa [1 ]
机构
[1] Univ Sydney, Sch Math & Stat, Sydney, NSW 2006, Australia
[2] CSIRO Math & Informat Sci, Clayton, Vic 3168, Australia
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
澳大利亚研究理事会;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; PRE-MESSENGER-RNA; GENE-EXPRESSION; NORMALIZATION; MECHANISMS; SEQUENCES; TOPHAT; TOOL;
D O I
10.1186/1471-2105-14-31
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: RNA-Seq has the potential to answer many diverse and interesting questions about the inner workings of cells. Estimating changes in the overall transcription of a gene is not straightforward. Changes in overall gene transcription can easily be confounded with changes in exon usage which alter the lengths of transcripts produced by a gene. Measuring the expression of constitutive exons-exons which are consistently conserved after splicing-offers an unbiased estimation of the overall transcription of a gene. Results: We propose a clustering-based method, exClust, for estimating the exons that are consistently conserved after splicing in a given data set. These are considered as the exons which are "constitutive" in this data. The method utilises information from both annotation and the dataset of interest. The method is implemented in an openly available R function package, sydSeq. Conclusion: When used on two real datasets exClust includes more than three times as many reads as the standard UI method, and improves concordance with qRT-PCR data. When compared to other methods, our method is shown to produce robust estimates of overall gene transcription.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Removing technical variability in RNA-seq data using conditional quantile normalization
    Hansen, Kasper D.
    Irizarry, Rafael A.
    WU, Zhijin
    BIOSTATISTICS, 2012, 13 (02) : 204 - 216
  • [22] iReckon: Simultaneous isoform discovery and abundance estimation from RNA-seq data
    Mezlini, Aziz M.
    Smith, Eric J. M.
    Fiume, Marc
    Buske, Orion
    Savich, Gleb L.
    Shah, Sohrab
    Aparicio, Sam
    Chiang, Derek Y.
    Goldenberg, Anna
    Brudno, Michael
    GENOME RESEARCH, 2013, 23 (03) : 519 - 529
  • [23] Estimation of isoform expression in RNA-seq data using a hierarchical Bayesian model
    Wang, Zengmiao
    Wang, Jun
    Wu, Changjing
    Deng, Minghua
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2015, 13 (06)
  • [24] A comparison of methods for differential expression analysis of RNA-seq data
    Soneson, Charlotte
    Delorenzi, Mauro
    BMC BIOINFORMATICS, 2013, 14
  • [25] Parametric analysis of RNA-seq expression data
    Konishi, Tomokazu
    GENES TO CELLS, 2016, 21 (06) : 639 - 647
  • [26] EPIG-Seq: extracting patterns and identifying co-expressed genes from RNA-Seq data
    Li, Jianying
    Bushel, Pierre R.
    BMC GENOMICS, 2016, 17
  • [27] RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome
    Li, Bo
    Dewey, Colin N.
    BMC BIOINFORMATICS, 2011, 12
  • [28] Impact of RNA-seq data analysis algorithms on gene expression estimation and downstream prediction
    Tong, Li
    Wu, Po-Yen
    Phan, John H.
    Hassazadeh, Hamid R.
    Tong, Weida
    Wang, May D.
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [29] A Framework for Comparison and Assessment of Synthetic RNA-Seq Data
    Shakola, Felitsiya
    Palejev, Dean
    Ivanov, Ivan
    GENES, 2022, 13 (12)
  • [30] Bias and Correction in RNA-seq Data for Marine Species
    Kai Song
    Li Li
    Guofan Zhang
    Marine Biotechnology, 2017, 19 : 541 - 550