parSRA: A framework for the parallel execution of short read aligners on compute clusters

被引:2
作者
Gonzalez-Dominguez, Jorge [1 ]
Hundt, Christian [2 ]
Schmidt, Bertil [2 ]
机构
[1] Univ A Coruna, Comp Architecture Grp, La Coruna, Spain
[2] Johannes Gutenberg Univ Mainz, Parallel & Distributed Architectures Grp, Mainz, Germany
关键词
Short read alignment; High performance computing; Multicore clusters; Bioinformatics; PGAS; ALIGNMENT; ACCURATE; TOOL;
D O I
10.1016/j.jocs.2017.01.008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The growth of next generation sequencing datasets poses as a challenge to the alignment of reads to reference genomes in terms of both accuracy and speed. In this work we present parSRA, a parallel framework to accelerate the execution of existing short read aligners on distributed-memory systems. parSRA can be used to parallelize a variety of short read alignment tools installed in the system without any modification to their source code. We show that our framework provides good scalability on a compute cluster for accelerating the popular BWA-MEM and Bowtie2 aligners. On average, it is able to accelerate sequence alignments on 16 64-core nodes (in total, 1024 cores) with speedup of 10.48 compared to the original multithreaded tools running with 64 threads on one node. It is also faster and more scalable than the pMap and BigBWA frameworks. Source code of parSRA in C++ and UPC++ running on Linux systems with support for FUSE is freely available at https://sourceforge.net/projects/parsra/. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:134 / 139
页数:6
相关论文
共 35 条
  • [11] Kassens J. C., 2014, P 15 IEEE INT C CLUS
  • [12] BarraCUDA - A fast short read sequence aligner using graphics processing units
    Klus P.
    Lam S.
    Lyberg D.
    Cheung M.
    Pullan G.
    McFarlane I.
    Yeo G.S.H.
    Lam B.Y.H.
    [J]. BMC Research Notes, 5 (1)
  • [13] Langmead B, 2012, NAT METHODS, V9, P357, DOI [10.1038/NMETH.1923, 10.1038/nmeth.1923]
  • [14] Li H, 2013, PREPRINT, P3997, DOI DOI 10.48550/ARXIV.1303.3997
  • [15] Li H, 2009, BIOINFORMATICS, V25, P1094, DOI [10.1093/bioinformatics/btp100, 10.1093/bioinformatics/btp324]
  • [16] SOAP2: an improved ultrafast tool for short read alignment
    Li, Ruiqiang
    Yu, Chang
    Li, Yingrui
    Lam, Tak-Wah
    Yiu, Siu-Ming
    Kristiansen, Karsten
    Wang, Jun
    [J]. BIOINFORMATICS, 2009, 25 (15) : 1966 - 1967
  • [17] SOAP3: ultra-fast GPU-based parallel alignment tool for short reads
    Liu, Chi-Man
    Wong, Thomas
    Wu, Edward
    Luo, Ruibang
    Yiu, Siu-Ming
    Li, Yingrui
    Wang, Bingqiang
    Yu, Chang
    Chu, Xiaowen
    Zhao, Kaiyong
    Li, Ruiqiang
    Lam, Tak-Wah
    [J]. BIOINFORMATICS, 2012, 28 (06) : 878 - 879
  • [18] Liu YR, 2014, PLOS ONE, V9, DOI [10.1371/journal.pone.0110611, 10.1371/journal.pone.0095043, 10.1371/journal.pone.0086026]
  • [19] CUSHAW2-GPU: Empowering Faster Gapped Short-Read Alignment Using GPU Computing
    Liu, Yongchao
    Schmidt, Bertil
    [J]. IEEE DESIGN & TEST, 2014, 31 (01) : 31 - 39
  • [20] CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform
    Liu, Yongchao
    Schmidt, Bertil
    Maskell, Douglas L.
    [J]. BIOINFORMATICS, 2012, 28 (14) : 1830 - 1837