Efficient RNA isoform identification and quantification from RNA-Seq data with network flows

被引:49
|
作者
Bernard, Elsa [1 ,2 ,3 ]
Jacob, Laurent [4 ]
Mairal, Julien [5 ]
Vert, Jean-Philippe [1 ,2 ,3 ]
机构
[1] Mines ParisTech, Ctr Computat Biol, F-77300 Fontainebleau, France
[2] Inst Curie, F-75248 Paris, France
[3] INSERM, U900, F-75248 Paris, France
[4] Univ Lyon 1, INRA, CNRS, Lab Biometrie & Biol Evolut,UMR5558, Villeurbanne, France
[5] INRIA Grenoble Rhone Alpes, LEAR Project Team, F-38330 Montbonnot St Martin, France
基金
欧洲研究理事会; 美国国家科学基金会;
关键词
ABUNDANCE ESTIMATION; TRANSCRIPTOME; EXPRESSION; SELECTION; ALGORITHM; DISCOVERY; GENOME; GRAPHS; LASSO;
D O I
10.1093/bioinformatics/btu317
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Several state-of-the-art methods for isoform identification and quantification are based on l(1)-regularized regression, such as the Lasso. However, explicitly listing the-possibly exponentially-large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using the l(1)-penalty are either restricted to genes with few exons or only run the regression algorithm on a small set of preselected isoforms. Results: We introduce a new technique called FlipFlop, which can efficiently tackle the sparse estimation problem on the full set of candidate isoforms by using network flow optimization. Our technique removes the need of a preselection step, leading to better isoform identification while keeping a low computational cost. Experiments with synthetic and real RNA-Seq data confirm that our approach is more accurate than alternative methods and one of the fastest available.
引用
收藏
页码:2447 / 2455
页数:9
相关论文
共 50 条
  • [21] Improved RNA-Seq Partitions in Linear Models for Isoform Quantification
    Howard, Brian E.
    Veronese, Paola
    Heber, Steffen
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 151 - 154
  • [22] Evaluation and comparison of computational tools for RNA-seq isoform quantification
    Chi Zhang
    Baohong Zhang
    Lih-Ling Lin
    Shanrong Zhao
    BMC Genomics, 18
  • [23] Evaluation and comparison of computational tools for RNA-seq isoform quantification
    Zhang, Chi
    Zhang, Baohong
    Lin, Lih-Ling
    Zhao, Shanrong
    BMC GENOMICS, 2017, 18
  • [24] Comparative evaluation of full-length isoform quantification from RNA-Seq
    Sarantopoulou, Dimitra
    Brooks, Thomas G.
    Nayak, Soumyashant
    Mrcela, Antonijo
    Lahens, Nicholas F.
    Grant, Gregory R.
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [25] Comparative evaluation of full-length isoform quantification from RNA-Seq
    Dimitra Sarantopoulou
    Thomas G. Brooks
    Soumyashant Nayak
    Antonijo Mrčela
    Nicholas F. Lahens
    Gregory R. Grant
    BMC Bioinformatics, 22
  • [26] Prediction and Quantification of Splice Events from RNA-Seq Data
    Goldstein, Leonard D.
    Cao, Yi
    Pau, Gregoire
    Lawrence, Michael
    Wu, Thomas D.
    Seshagiri, Somasekar
    Gentleman, Robert
    PLOS ONE, 2016, 11 (05):
  • [27] Estimation of alternative splicing isoform frequencies from RNA-Seq data
    Marius Nicolae
    Serghei Mangul
    Ion I Măndoiu
    Alex Zelikovsky
    Algorithms for Molecular Biology, 6
  • [28] SplAdder: identification, quantification and testing of alternative splicing events from RNA-Seq data
    Kahles, Andre
    Ong, Cheng Soon
    Zhong, Yi
    Ratsch, Gunnar
    BIOINFORMATICS, 2016, 32 (12) : 1840 - 1847
  • [29] Estimation of Alternative Splicing isoform Frequencies from RNA-Seq Data
    Nicolae, Marius
    Mangul, Serghei
    Mandoiu, Ion
    Zelikovsky, Alex
    ALGORITHMS IN BIOINFORMATICS, 2010, 6293 : 202 - +
  • [30] Estimation of alternative splicing isoform frequencies from RNA-Seq data
    Nicolae, Marius
    Mangul, Serghei
    Mandoiu, Ion I.
    Zelikovsky, Alex
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6