Simulation-based comprehensive benchmarking of RNA-seq aligners

被引:0
作者
Baruzzo, Giacomo [1 ]
Hayer, Katharina E. [2 ]
Kim, Eun Ji [2 ]
Di Camillo, Barbara [1 ]
FitzGerald, Garret A. [2 ,3 ]
Grant, Gregory R. [2 ,4 ]
机构
[1] Univ Padua, Dept Informat Engn, Padua, Italy
[2] Univ Penn, Inst Translat Med & Therapeut, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Syst Pharmacol & Translat Therapeut, Philadelphia, PA 19104 USA
[4] Univ Penn, Dept Genet, Philadelphia, PA 19104 USA
关键词
ACCURATE; ALIGNMENT; READS;
D O I
10.1038/NMETH.4106
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Alignment is the first step in most RNA-seq analysis pipelines, and the accuracy of downstream analyses depends heavily on it. Unlike most steps in the pipeline, alignment is particularly amenable to benchmarking with simulated data. We performed a comprehensive benchmarking of 14 common splice-aware aligners for base, read, and exon junction-level accuracy and compared default with optimized parameters. We found that performance varied by genome complexity, and accuracy and popularity were poorly correlated. The most widely cited tool underperforms for most metrics, particularly when using default settings.
引用
收藏
页码:135 / 139
页数:5
相关论文
共 19 条
  • [1] PlasmoDB: a functional genomic database for malaria parasites
    Aurrecoechea, Cristina
    Brestelli, John
    Brunk, Brian P.
    Dommer, Jennifer
    Fischer, Steve
    Gajria, Bindu
    Gao, Xin
    Gingle, Alan
    Grant, Greg
    Harb, Omar S.
    Heiges, Mark
    Innamorato, Frank
    Iodice, John
    Kissinger, Jessica C.
    Kraemer, Eileen
    Li, Wei
    Miller, John A.
    Nayak, Vishal
    Pennington, Cary
    Pinney, Deborah F.
    Roos, David S.
    Ross, Chris
    Stoeckert, Christian J., Jr.
    Treatman, Charles
    Wang, Haiming
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D539 - D543
  • [2] Comparing reference-based RNA-Seq mapping methods for non-human primate data
    Benjamin, Ashlee M.
    Nichols, Marshall
    Burke, Thomas W.
    Ginsburg, Geoffrey S.
    Lucas, Joseph E.
    [J]. BMC GENOMICS, 2014, 15
  • [3] ContextMap 2: fast and accurate context-based RNA-seq mapping
    Bonfert, Thomas
    Kirner, Evelyn
    Csaba, Gergely
    Zimmer, Ralf
    Friedel, Caroline C.
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [4] STAR: ultrafast universal RNA-seq aligner
    Dobin, Alexander
    Davis, Carrie A.
    Schlesinger, Felix
    Drenkow, Jorg
    Zaleski, Chris
    Jha, Sonali
    Batut, Philippe
    Chaisson, Mark
    Gingeras, Thomas R.
    [J]. BIOINFORMATICS, 2013, 29 (01) : 15 - 21
  • [5] Engström PG, 2013, NAT METHODS, V10, P1185, DOI [10.1038/NMETH.2722, 10.1038/nmeth.2722]
  • [6] RNA-Seq Gene Profiling - A Systematic Empirical Comparison
    Fonseca, Nuno A.
    Marioni, John
    Brazma, Alvis
    [J]. PLOS ONE, 2014, 9 (09):
  • [7] Tools for mapping high-throughput sequencing data
    Fonseca, Nuno A.
    Rung, Johan
    Brazma, Alvis
    Marioni, John C.
    [J]. BIOINFORMATICS, 2012, 28 (24) : 3169 - 3177
  • [8] Field guide to next-generation DNA sequencers
    Glenn, Travis C.
    [J]. MOLECULAR ECOLOGY RESOURCES, 2011, 11 (05) : 759 - 769
  • [9] Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM)
    Grant, Gregory R.
    Farkas, Michael H.
    Pizarro, Angel D.
    Lahens, Nicholas F.
    Schug, Jonathan
    Brunk, Brian P.
    Stoeckert, Christian J.
    Hogenesch, John B.
    Pierce, Eric A.
    [J]. BIOINFORMATICS, 2011, 27 (18) : 2518 - 2528
  • [10] Benchmark analysis of algorithms for determining and quantifying full-length mRNA splice forms from RNA-seq data
    Hayer, Katharina E.
    Pizarro, Angel
    Lahens, Nicholas F.
    Hogenesch, John B.
    Grant, Gregory R.
    [J]. BIOINFORMATICS, 2015, 31 (24) : 3938 - 3945