Blast-Parallel: The Parallelizing Implementation Of Sequence Alignment Algorithms Based On Hadoop Platform

被引:0
|
作者
Meng, Ming [1 ]
Gao, Jing [1 ]
Chen, Jun-jie [1 ]
机构
[1] Inner Mongolia Agr Univ, Coll Comp & Informat Engn, Hohhot, Peoples R China
来源
PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2 | 2013年
关键词
Sequence alignment; Blast; Hadoop; parallelization;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The sequence alignment is a basic method for processing the information in Bioinformatics, it has a great significance for finding the function and the structure of nucleic acids and protein sequences and the information of evolution. This paper briefly describes the relevant issues of sequence alignment and the most common local sequence alignment algorithms, Blast algorithm. At present, the Blast algorithm which provided by NCBI or stand-alone can not meet the actual demand for the flood of biological data, this paper achieves the Blast-Parallel algorithm by further improvement based on the Hadoop-Blast algorithm. Through serial experiments of the stand-alone Blast algorithm and parallelizing experiments of the Hadoop-Blast algorithm and the Blast-Parallel algorithm based on Hadoop platform, results show that the Blast algorithm has significantly higher execution efficiency after the parallelization, and the matching speed of the Blast-Parallel algorithm which has been improved can achieve 1 similar to 1.5 times of the Hadoop-Blast algorithm.
引用
收藏
页码:465 / 470
页数:6
相关论文
共 30 条
  • [21] Multithreaded Parallel Sequence Alignment Based on Needleman-Wunsch Algorithm
    Gancheva, Veska
    Georgiev, Ivaylo
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 165 - 169
  • [22] Energy Analysis and Application of Data Mining Algorithms for Internet of Things Based on Hadoop Cloud Platform
    Zheng, Yuanpan
    Chen, Guangyu
    IEEE ACCESS, 2019, 7 : 183195 - 183206
  • [23] Implementation of Time Series Data Clustering Based on SVD for Stock Data Analysis on Hadoop Platform
    Xie, Yonghong
    Wulamu, Aziguli
    Wang, Yantao
    Liu, Zheng
    PROCEEDINGS OF THE 2014 9TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2014, : 2007 - 2010
  • [24] A Distributed Inverse Distance Weighted Interpolation Algorithm Based on the Cloud Computing Platform of Hadoop and Its Implementation
    Xu, Zhong
    Guan, Jihong
    Zhou, Jiaogen
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 2412 - 2416
  • [25] A Performance Comparison of Big Data Processing Platform Based on Parallel Clustering Algorithms
    Hai, Mo
    Zhang, Yuejing
    Li, Haifeng
    6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2018, 139 : 127 - 135
  • [26] A Review on Sequence Alignment Algorithms for Short Reads Based on Next-Generation Sequencing
    Kim, Jeongkyu
    Ji, Mingeun
    Yi, Gangman
    IEEE ACCESS, 2020, 8 : 189811 - 189822
  • [27] Hypergraph Partitioning Implementation for Parallelizing Matrix-Vector Multiplication Using CUDA GPU-Based Parallel Computing
    Murni
    Bustamam, A.
    Ernastuti
    Handhika, T.
    Kerami, D.
    INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2016 (ISCPMS 2016), 2017, 1862
  • [28] Research on Genetic and Simulated Annealing Algorithm for Multiple Sequence Alignment Based on Hybrid Parallel Computation
    Li, Longsheng
    Liu, Yu
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL AND ELECTRICAL ENGINEERING (AMEE 2017), 2017, 87 : 205 - 208
  • [29] Cloud-Coffee: implementation of a parallel consistency-based multiple alignment algorithm in the T-Coffee package and its benchmarking on the Amazon Elastic-Cloud
    Di Tommaso, Paolo
    Orobitg, Miquel
    Guirado, Fernando
    Cores, Fernado
    Espinosa, Toni
    Notredame, Cedric
    BIOINFORMATICS, 2010, 26 (15) : 1903 - 1904
  • [30] Performance Evaluation of AI Based Load Balancing Algorithm (Reinforcement Learning) with other load balancing algorithms in a JPPF Grid: E.coli Genome Sequence Alignment Problem
    Sinha, Subrata
    Hazarika, Abinash
    Johari, Surabhi
    2018 INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND SYSTEMS BIOLOGY (BSB), 2018, : 64 - 66