High-performance genome sorting program

被引:1
作者
Kasilov, Vasily [1 ]
Drobintsev, Pavel [1 ]
Voinov, Nikita [1 ]
机构
[1] Peter Great St Petersburg Polytech Univ, Polytech Skaya 29, St Petersburg 195251, Russia
来源
10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021) | 2021年 / 193卷
关键词
genome; sorting algorithm; alignment; BAM and SAM files; OpenMP; HPC;
D O I
10.1016/j.procs.2021.10.048
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper is devoted to the practical application of parallel sorting algorithms and parallel input-output methods for the problem of genome alignment. The paper considers different approaches to the implementation of such algorithms, taking into account the capabilities of high-performance systems. Main purpose of the work is to develop a genome sorting program, the efficiency of which significantly exceeds the efficiency of free software analogues. The genome sorting program is implemented for a supercomputer using the C++ language and the OpenMP and OpenMPI. The developed program demonstrates a significant increase in the speed of operation (up to 10 times) compared to free software analogues due to massive parallel data input and output. Different approaches for data input/output parallelization and data processing considered in the paper can be applied in other subject areas. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:464 / 473
页数:10
相关论文
共 11 条
[1]  
[Anonymous], 2021, SEQUENCE ALIGNMENTMA
[2]  
Batcher KE., 1968, P APR 30 MAY 2 1968, P307, DOI DOI 10.1145/1468075.1468121
[3]   The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants [J].
Cock, Peter J. A. ;
Fields, Christopher J. ;
Goto, Naohisa ;
Heuer, Michael L. ;
Rice, Peter M. .
NUCLEIC ACIDS RESEARCH, 2010, 38 (06) :1767-1771
[4]   Computational Biology Methods and Their Application to the Comparative Genomics of Endocellular Symbiotic Bacteria of Insects [J].
Commins, Jennifer ;
Toft, Christina ;
Fares, Mario A. .
BIOLOGICAL PROCEDURES ONLINE, 2009, 11 (01) :52-78
[5]  
[Дедов Иван Иванович Dedov Ivan I.], 2019, [Вестник Российской академии медицинских наук, Annals of the Russian Academy of Medical Sciences, Vestnik Rossiiskoi akademii meditsinskikh nauk], V74, P61, DOI 10.15690/vramn1108
[6]  
Dershowitz Nachum, 1989, FDN ORG ALGORITHMS
[7]   Drug development in the era of precision medicine [J].
Dugger, Sarah A. ;
Platt, Adam ;
Goldstein, David B. .
NATURE REVIEWS DRUG DISCOVERY, 2018, 17 (03) :183-196
[8]  
Iakobovskii Mikhail V, 2004, LYUDMILA UVAROVA MAT, P153
[9]  
Sedgewick R., 1998, Algorithms in C-Parts 1-4: Fundamentals, Data Structures, Sorting, Searching, V3rd
[10]  
Supercomputer Center, 2017, POL TECHN BAS ADV TR