SparkEC: speeding up alignment-based DNA error correction tools

被引:0
作者
Roberto R. Expósito
Marco Martínez-Sánchez
Juan Touriño
机构
[1] Universidade da Coruña,
[2] CITIC,undefined
[3] Computer Architecture Group,undefined
来源
BMC Bioinformatics | / 23卷
关键词
Error correction; Big data; Distributed processing; Apache Spark;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 86 条
  • [1] van Dijk EL(2014)Ten years of next-generation sequencing technology Trends Genet 30 418-426
  • [2] Auger H(2016)Objective review of de novo stand-alone error correction methods for NGS data WIREs Comput Mol Sci 6 111-146
  • [3] Jaszczyszyn Y(2017)Evaluation of the impact of Illumina error correction tools on de novo genome assembly BMC Bioinform 18 374-30
  • [4] Thermes C(2008)Google’s MapReduce programming model-Revisited Sci Comput Program 70 1-65
  • [5] Alic AS(2016)Apache spark: a unified engine for big data processing Commun ACM 59 56-66
  • [6] Ruzafa D(2013)A survey of error-correction methods for next-generation sequencing Brief Bioinform 14 56-373
  • [7] Dopazo J(2006)Multiple sequence alignment Curr Opin Struct Biol 16 368-4005
  • [8] Blanquer I(2015)BigBWA: approaching the burrows-wheeler aligner to big data technologies Bioinformatics 31 4003-21
  • [9] Heydari M(2016)SparkBWA: speeding up the alignment of high-throughput DNA sequencing data PLoS ONE 11 1-2764
  • [10] Miclotte G(2017)MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud Bioinformatics 33 2762-25