SEAL: a distributed short read mapping and duplicate removal tool

被引:85
作者
Pireddu, Luca [1 ]
Leo, Simone [1 ]
Zanetti, Gianluigi [1 ]
机构
[1] Polaris, CRS4, I-09010 Pula, Italy
关键词
ALIGNMENT;
D O I
10.1093/bioinformatics/btr325
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
SEAL is a scalable tool for short read pair mapping and duplicate removal. It computes mappings that are consistent with those produced by BWA and removes duplicates according to the same criteria employed by Picard MarkDuplicates. On a 16-node Hadoop cluster, it is capable of processing about 13GB per hour in map+rmdup mode, while reaching a throughput of 19GB per hour in mapping-only mode.
引用
收藏
页码:2159 / 2160
页数:2
相关论文
共 11 条
[11]   APPLICATIONS OF NEXT-GENERATION SEQUENCING Sequencing technologies - the next generation [J].
Metzker, Michael L. .
NATURE REVIEWS GENETICS, 2010, 11 (01) :31-46