Using non-uniform read distribution models to improve isoform expression inference in RNA-Seq

被引:76
|
作者
Wu, Zhengpeng
Wang, Xi
Zhang, Xuegong [1 ]
机构
[1] Tsinghua Univ, TNLIST Dept Automat, MOE Key Lab Bioinformat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
MESSENGER-RNA; TRANSCRIPTOME; DISEASE; PARKIN; CHIP;
D O I
10.1093/bioinformatics/btq696
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: RNA-Seq technology based on next-generation sequencing provides the unprecedented ability of studying transcriptomes at high resolution and accuracy, and the potential of measuring expression of multiple isoforms from the same gene at high precision. Solved by maximum likelihood estimation, isoform expression can be inferred in RNA-Seq using statistical models based on the assumption that sequenced reads are distributed uniformly along transcripts. Modification of the model is needed when considering situations where RNA-Seq data do not follow uniform distribution. Results: We proposed two curves, the global bias curve (GBC) and the local bias curves (LBCs), to describe the non-uniformity of read distributions for all genes in a transcriptome and for each gene, respectively. Incorporating the bias curves into the uniform read distribution (URD) model, we introduced non-URD (N-URD) models to infer isoform expression levels. On a series of systematic simulation studies, the proposed models outperform the original model in recovering major isoforms and the expression ratio of alternative isoforms. We also applied the new model to real RNA-Seq datasets and found that its inferences on expression ratios of alternative isoforms are more reasonable. The experiments indicate that incorporating N-URD information can improve the accuracy in modeling and inferring isoform expression in RNA-Seq.
引用
收藏
页码:502 / 508
页数:7
相关论文
共 46 条
  • [41] Transcription profiling using RNA-Seq demonstrates expression differences in the body walls of juvenile albino and normal sea cucumbers Apostichopus japonicus
    Ma Deyou
    Yang Hongsheng
    Sun Lina
    Chen Muyan
    CHINESE JOURNAL OF OCEANOLOGY AND LIMNOLOGY, 2014, 32 (01) : 34 - 46
  • [42] Using RNA-seq to Profile Gene Expression of Spikelet Development in Response to Temperature and Nitrogen during Meiosis in Rice (Oryza sativa L.)
    Yang, Jun
    Chen, Xiaorong
    Zhu, Changlan
    Peng, Xiaosong
    He, Xiaopeng
    Fu, Junru
    Ouyang, Linjuan
    Bian, Jianmin
    Hu, Lifang
    Sun, Xiaotang
    Xu, Jie
    He, Haohua
    PLOS ONE, 2015, 10 (12):
  • [43] Genome-wide gene expression analysis of amphioxus (Branchiostoma belcheri) following lipopolysaccharide challenge using strand-specific RNA-seq
    Zhang, Qi-Lin
    Zhu, Qian-Hua
    Xie, Zheng-Qing
    Xu, Bin
    Wang, Xiu-Qiang
    Chen, Jun-Yuan
    RNA BIOLOGY, 2017, 14 (12) : 1799 - 1809
  • [44] Allele Specific Expression (ASE) analysis between Bos Taurus and Bos Indicus cows using RNA-Seq data at SNP level and gene level
    Varkoohi, Sheida
    Banabazi, Mohammad Hossein
    Ghsemi-Siab, Mojgan
    ANAIS DA ACADEMIA BRASILEIRA DE CIENCIAS, 2021, 93 (03):
  • [45] Evaluating Gene Expression in C57BL/6J and DBA/2J Mouse Striatum Using RNA-Seq and Microarrays
    Bottomly, Daniel
    Walter, Nicole A. R.
    Hunter, Jessica Ezzell
    Darakjian, Priscila
    Kawane, Sunita
    Buck, Kari J.
    Searles, Robert P.
    Mooney, Michael
    McWeeney, Shannon K.
    Hitzemann, Robert
    PLOS ONE, 2011, 6 (03):
  • [46] Tolerance to dietary linalool primarily involves co-expression of cytochrome P450s and cuticular proteins in Pagiophloeus tsushimanus (Coleoptera: Curculionidae) larvae using SMRT sequencing and RNA-seq
    Li, Shouyin
    Li, Hui
    Chen, Cong
    Hao, Dejun
    BMC GENOMICS, 2023, 24 (01)