Comparisons and performance evaluations of RNA-seq alignment tools

被引:0
作者
Wang, Wei-An [1 ]
Tsai, Mong-Hsun [2 ]
Wu, Chin-Ting [1 ]
Lai, Liang-Chuan [3 ]
Lu, Tzu-Pin [4 ]
Chuang, Eric Y. [1 ]
机构
[1] Natl Taiwan Univ, Grad Inst Biomed Elect & Bioinformat, Taipei, Taiwan
[2] Natl Taiwan Univ, Inst Biotechnol, Taipei, Taiwan
[3] Natl Taiwan Univ, Grad Inst Physiol, Taipei, Taiwan
[4] Natl Taiwan Univ, Grad Inst Epidemiol & Prevent Med, Dept Publ Hlth, Taipei, Taiwan
来源
2014 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICEECS) | 2014年
关键词
next generation sequencing; alignment; RNA-seq; ULTRAFAST;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Next generation sequencing (NGS) has been widely used in biological and medical researches. NGS facilitates identifying mutations and differentially expressed genes that are causative to diseases. However, its high-throughput feature poses major challenges in performing the analysis, especially the alignment step. The alignment step is critical in the NGS gene expression analysis since the correctness of the advanced steps heavily depends on it. Therefore, we evaluated the performances of the four popular alignment algorithms including Tophat, STAR, MapSplice, and GNSAP based on the simulated data generated by Flux-Simulator. Shorter and longer reference genomes and read lengths were evaluated in the four scenarios. Three indices including time cost, alignment accuracy, and junction detection were utilized to consider the performances of different algorithms. The result shows that the Tophat algorithm has highest alignment accuracy and the STAR algorithm is the fastest one with a little lower accuracy. The MapSplice and GNSAP algorithms are not stable as the STAR and Tophat algorithms, and might encounter more problems in doing the alignment.
引用
收藏
页码:215 / 218
页数:4
相关论文
共 8 条
[1]  
[Anonymous], NUCL ACIDS RES
[2]   STAR: ultrafast universal RNA-seq aligner [J].
Dobin, Alexander ;
Davis, Carrie A. ;
Schlesinger, Felix ;
Drenkow, Jorg ;
Zaleski, Chris ;
Jha, Sonali ;
Batut, Philippe ;
Chaisson, Mark ;
Gingeras, Thomas R. .
BIOINFORMATICS, 2013, 29 (01) :15-21
[3]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[4]  
Engström PG, 2013, NAT METHODS, V10, P1185, DOI [10.1038/nmeth.2722, 10.1038/NMETH.2722]
[5]   Modelling and simulating generic RNA-Seq experiments with the flux simulator [J].
Griebel, Thasso ;
Zacher, Benedikt ;
Ribeca, Paolo ;
Raineri, Emanuele ;
Lacroix, Vincent ;
Guigo, Roderic ;
Sammeth, Michael .
NUCLEIC ACIDS RESEARCH, 2012, 40 (20) :10073-10083
[6]   TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions [J].
Kim, Daehwan ;
Pertea, Geo ;
Trapnell, Cole ;
Pimentel, Harold ;
Kelley, Ryan ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2013, 14 (04)
[7]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[8]   Fast and SNP-tolerant detection of complex variants and splicing in short reads [J].
Wu, Thomas D. ;
Nacu, Serban .
BIOINFORMATICS, 2010, 26 (07) :873-881