Efficient RNA isoform identification and quantification from RNA-Seq data with network flows

被引:49
|
作者
Bernard, Elsa [1 ,2 ,3 ]
Jacob, Laurent [4 ]
Mairal, Julien [5 ]
Vert, Jean-Philippe [1 ,2 ,3 ]
机构
[1] Mines ParisTech, Ctr Computat Biol, F-77300 Fontainebleau, France
[2] Inst Curie, F-75248 Paris, France
[3] INSERM, U900, F-75248 Paris, France
[4] Univ Lyon 1, INRA, CNRS, Lab Biometrie & Biol Evolut,UMR5558, Villeurbanne, France
[5] INRIA Grenoble Rhone Alpes, LEAR Project Team, F-38330 Montbonnot St Martin, France
基金
欧洲研究理事会; 美国国家科学基金会;
关键词
ABUNDANCE ESTIMATION; TRANSCRIPTOME; EXPRESSION; SELECTION; ALGORITHM; DISCOVERY; GENOME; GRAPHS; LASSO;
D O I
10.1093/bioinformatics/btu317
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Several state-of-the-art methods for isoform identification and quantification are based on l(1)-regularized regression, such as the Lasso. However, explicitly listing the-possibly exponentially-large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using the l(1)-penalty are either restricted to genes with few exons or only run the regression algorithm on a small set of preselected isoforms. Results: We introduce a new technique called FlipFlop, which can efficiently tackle the sparse estimation problem on the full set of candidate isoforms by using network flow optimization. Our technique removes the need of a preselection step, leading to better isoform identification while keeping a low computational cost. Experiments with synthetic and real RNA-Seq data confirm that our approach is more accurate than alternative methods and one of the fastest available.
引用
收藏
页码:2447 / 2455
页数:9
相关论文
共 50 条
  • [1] Acfs: accurate circRNA identification and quantification from RNA-Seq data
    You, Xintian
    Conrad, Tim O. F.
    SCIENTIFIC REPORTS, 2016, 6
  • [2] A convex formulation for joint RNA isoform detection and quantification from multiple RNA-seq samples
    Bernard, Elsa
    Jacob, Laurent
    Mairal, Julien
    Viara, Eric
    Vert, Jean-Philippe
    BMC BIOINFORMATICS, 2015, 16
  • [3] WemIQ: an accurate and robust isoform quantification method for RNA-seq data
    Zhang, Jing
    Kuo, C. -C. Jay
    Chen, Liang
    BIOINFORMATICS, 2015, 31 (06) : 878 - 885
  • [4] Towards reliable isoform quantification using RNA-SEQ data
    Howard, Brian E.
    Heber, Steffen
    BMC BIOINFORMATICS, 2010, 11
  • [5] Alternating EM algorithm for a bilinear model in isoform quantification from RNA-seq data
    Deng, Wenjiang
    Mou, Tian
    Kalari, Krishna R.
    Niu, Nifang
    Wang, Liewei
    Pawitan, Yudi
    Trung Nghia Vu
    BIOINFORMATICS, 2020, 36 (03) : 805 - 812
  • [6] Estimation of alternative splicing isoform frequencies from RNA-Seq data
    Nicolae, Marius
    Mangul, Serghei
    Mandoiu, Ion I.
    Zelikovsky, Alex
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6
  • [7] Reliable Identification of Genomic Variants from RNA-Seq Data
    Piskol, Robert
    Ramaswami, Gokul
    Li, Jin Billy
    AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 93 (04) : 641 - 651
  • [8] Evaluation and comparison of computational tools for RNA-seq isoform quantification
    Zhang, Chi
    Zhang, Baohong
    Lin, Lih-Ling
    Zhao, Shanrong
    BMC GENOMICS, 2017, 18
  • [9] Prediction and Quantification of Splice Events from RNA-Seq Data
    Goldstein, Leonard D.
    Cao, Yi
    Pau, Gregoire
    Lawrence, Michael
    Wu, Thomas D.
    Seshagiri, Somasekar
    Gentleman, Robert
    PLOS ONE, 2016, 11 (05):
  • [10] Estimation of Alternative Splicing isoform Frequencies from RNA-Seq Data
    Nicolae, Marius
    Mangul, Serghei
    Mandoiu, Ion
    Zelikovsky, Alex
    ALGORITHMS IN BIOINFORMATICS, 2010, 6293 : 202 - +