Using non-uniform read distribution models to improve isoform expression inference in RNA-Seq

被引:76
|
作者
Wu, Zhengpeng
Wang, Xi
Zhang, Xuegong [1 ]
机构
[1] Tsinghua Univ, TNLIST Dept Automat, MOE Key Lab Bioinformat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
MESSENGER-RNA; TRANSCRIPTOME; DISEASE; PARKIN; CHIP;
D O I
10.1093/bioinformatics/btq696
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: RNA-Seq technology based on next-generation sequencing provides the unprecedented ability of studying transcriptomes at high resolution and accuracy, and the potential of measuring expression of multiple isoforms from the same gene at high precision. Solved by maximum likelihood estimation, isoform expression can be inferred in RNA-Seq using statistical models based on the assumption that sequenced reads are distributed uniformly along transcripts. Modification of the model is needed when considering situations where RNA-Seq data do not follow uniform distribution. Results: We proposed two curves, the global bias curve (GBC) and the local bias curves (LBCs), to describe the non-uniformity of read distributions for all genes in a transcriptome and for each gene, respectively. Incorporating the bias curves into the uniform read distribution (URD) model, we introduced non-URD (N-URD) models to infer isoform expression levels. On a series of systematic simulation studies, the proposed models outperform the original model in recovering major isoforms and the expression ratio of alternative isoforms. We also applied the new model to real RNA-Seq datasets and found that its inferences on expression ratios of alternative isoforms are more reasonable. The experiments indicate that incorporating N-URD information can improve the accuracy in modeling and inferring isoform expression in RNA-Seq.
引用
收藏
页码:502 / 508
页数:7
相关论文
共 46 条
  • [31] Identification and sex expression profiling of odorant-binding protein genes in Trichogramma japonicum (Hymenoptera: Trichogrammatidae) using RNA-Seq
    Wu, Jia-Dong
    Shen, Zhao-Can
    Hua, Hai-Qing
    Zhang, Fan
    Li, Yuan-Xi
    APPLIED ENTOMOLOGY AND ZOOLOGY, 2017, 52 (04) : 623 - 633
  • [32] Large-Scale Comparative Analysis of Eugenol-lnduced/Repressed Genes Expression in Aspergillus flavus Using RNA-seq
    Lv, Cong
    Wang, Ping
    Ma, Longxue
    Zheng, Mumin
    Liu, Yang
    Xing, Fuguo
    FRONTIERS IN MICROBIOLOGY, 2018, 9
  • [33] Characterization of the transcriptome and gene expression of four different tissues in the ecologically relevant sea urchin Arbacia lixula using RNA-seq
    Perez-Portela, R.
    Turon, X.
    Riesgo, A.
    MOLECULAR ECOLOGY RESOURCES, 2016, 16 (03) : 794 - 808
  • [34] Identification and sex expression profiling of odorant-binding protein genes in Trichogramma japonicum (Hymenoptera: Trichogrammatidae) using RNA-Seq
    Jia-Dong Wu
    Zhao-Can Shen
    Hai-Qing Hua
    Fan Zhang
    Yuan-Xi Li
    Applied Entomology and Zoology, 2017, 52 : 623 - 633
  • [35] Novel Method of Full-Length RNA-seq That Expands the Identification of Non-Polyadenylated RNAs Using Nanopore Sequencing
    Li, Xiaohan
    Yu, Kequan
    Li, Fuyu
    Lu, Wenxiang
    Wang, Ying
    Zhang, Weizhong
    Bai, Yunfei
    ANALYTICAL CHEMISTRY, 2022, : 12342 - 12351
  • [36] Novel Bioinformatics Method for Identification of Genome-Wide Non-Canonical Spliced Regions Using RNA-Seq Data
    Bai, Yongsheng
    Hassler, Justin
    Ziyar, Ahdad
    Li, Philip
    Wright, Zachary
    Menon, Rajasree
    Omenn, Gilbert S.
    Cavalcoli, James D.
    Kaufman, Randal J.
    Sartor, Maureen A.
    PLOS ONE, 2014, 9 (07):
  • [37] Examination of Csr regulatory circuitry using epistasis analysis with RNA-seq (Epi-seq) confirms that CsrD affects gene expression via CsrA, CsrB and CsrC
    Potts, Anastasia H.
    Leng, Yuanyuan
    Babitzke, Paul
    Romeo, Tony
    SCIENTIFIC REPORTS, 2018, 8
  • [38] Intersex goats show different gene expression levels in the hypothalamus and pituitary compared with non-intersex goats based on RNA-Seq
    Han, Haoyuan
    Yang, Shuai
    Li, Jun
    Zhao, Jinyan
    Wei, Hongfang
    Ha, Si
    Li, Wantao
    Li, Congcong
    Quan, Kai
    VETERINARY MEDICINE AND SCIENCE, 2022, 8 (01) : 367 - 376
  • [39] SoloTE for improved analysis of transposable elements in single-cell RNA-Seq data using locus-specific expression
    Rodriguez-Quiroz, Rocio
    Valdebenito-Maturana, Braulio
    COMMUNICATIONS BIOLOGY, 2022, 5 (01)
  • [40] Analysis of alternative polyadenylation from single-cell RNA-seq using scDaPars reveals cell subpopulations invisible to gene expression
    Gao, Yipeng
    Li, Lei
    Amos, Christopher, I
    Li, Wei
    GENOME RESEARCH, 2021, 31 (10) : 1856 - 1866