Next-generation sequencing: big data meets high performance computing

被引:2
作者
Schmidt, Bertil [1 ]
Hildebrandt, Andreas [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, Inst Informat, Mainz, Germany
关键词
FPGA-BASED ACCELERATION; MEMORY-EFFICIENT; ERROR-CORRECTION; LARGE GENOMES; PARALLEL; ACCURATE; ALIGNMENT; MAPREDUCE; ALGORITHM; STRATEGY;
D O I
暂无
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
The progress of next-generation sequencing has a major impact on medical and genomic research. This high-throughput technology can now produce billions of short DNA or RNA fragments in excess of a few terabytes of data in a single run. This leads to massive datasets used by a wide range of applications including personalized cancer treatment and precision medicine. In addition to the hugely increased throughput, the cost of using high-throughput technologies has been dramatically decreasing. A low sequencing cost of around US$1000 per genome has now rendered large population-scale projects feasible. However, to make effective use of the produced data, the design of big data algorithms and their efficient implementation on modern high performance computing systems is required.
引用
收藏
页码:712 / 717
页数:6
相关论文
共 80 条
[1]  
Abouelhoda M. I., 2004, Journal of Discrete Algorithms, V2, P53, DOI 10.1016/S1570-8667(03)00065-0
[2]   SparkBWA: Speeding Up the Alignment of High-Throughput DNA Sequencing Data [J].
Abuin, Jose M. ;
Pichel, Juan C. ;
Pena, Tomas F. ;
Amigo, Jorge .
PLOS ONE, 2016, 11 (05)
[3]   BigBWA: approaching the Burrows-Wheeler aligner to Big Data technologies [J].
Abuin, Jose M. ;
Pichel, Juan C. ;
Pena, Tomas F. ;
Amigo, Jorge .
BIOINFORMATICS, 2015, 31 (24) :4003-4005
[4]   Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance [J].
Ahn, Tae-Hyuk ;
Chai, Juanjuan ;
Pan, Chongle .
BIOINFORMATICS, 2015, 31 (02) :170-177
[5]   Karect: accurate correction of substitution, insertion and deletion errors for next-generation sequencing data [J].
Allam, Amin ;
Kalnis, Panos ;
Solovyev, Victor .
BIOINFORMATICS, 2015, 31 (21) :3421-3428
[6]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[7]  
[Anonymous], 2012, 22nd International Conference on Field Programmable Logic and Applications (FPL), DOI DOI 10.1109/FPL.2012.6339272
[8]   Assembling large genomes with single-molecule sequencing and locality-sensitive hashing [J].
Berlin, Konstantin ;
Koren, Sergey ;
Chin, Chen-Shan ;
Drake, James P. ;
Landolin, Jane M. ;
Phillippy, Adam M. .
NATURE BIOTECHNOLOGY, 2015, 33 (06) :623-+
[9]   SPACE/TIME TRADE/OFFS IN HASH CODING WITH ALLOWABLE ERRORS [J].
BLOOM, BH .
COMMUNICATIONS OF THE ACM, 1970, 13 (07) :422-&
[10]   Near-optimal probabilistic RNA-seq quantification (vol 34, pg 525, 2016) [J].
Bray, Nicolas L. ;
Pimentel, Harold ;
Melsted, Pall ;
Pachter, Lior .
NATURE BIOTECHNOLOGY, 2016, 34 (08) :888-888