Parallel biological sequence comparison using prefix computations

被引:41
作者
Aluru, S
Futamura, N
Mehrotra, K
机构
[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50011 USA
[2] Wright State Univ, Dept Comp Sci, Dayton, OH 45435 USA
[3] Syracuse Univ, Sch EECS, Syracuse, NY 13244 USA
基金
美国国家科学基金会;
关键词
computational biology; sequence alignments; parallel prefix; space-efficient; parallel algorithms;
D O I
10.1016/S0743-7315(03)00010-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present practical parallel algorithms using prefix computations for various problems that arise in pairwise comparison of biological sequences. We consider both constant and affine gap penalty functions, full-sequence and subsequence matching, and space-saving algorithms. Commonly used sequential algorithms solve the sequence comparison problems in O(mn) time and O(m + n) space, where m and n are the lengths of the sequences being compared. All the algorithms presented in this paper are time optimal with respect to the sequential algorithms and can use O((log n)-(n)) processors where n is the length of the larger sequence. While optimal parallel algorithms for many of these problems are known, we use a simple framework and demonstrate how these problems can be solved systematically using repeated parallel prefix operations. We also present a space-saving algorithm that uses O(m + (p)-(n)) space and runs in optimal time where p is the number of the processors used. We implemented the parallel space-saving algorithm and provide experimental results on an IBM SP-2 and a Pentium cluster. (C) 2003 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:264 / 272
页数:9
相关论文
共 17 条
[1]  
[Anonymous], 1978, Atlas of protein sequence and structure
[2]   EFFICIENT PARALLEL ALGORITHMS FOR STRING EDITING AND RELATED PROBLEMS [J].
APOSTOLICO, A ;
ATALLAH, MJ ;
LARMORE, LL ;
MCFADDIN, S .
SIAM JOURNAL ON COMPUTING, 1990, 19 (05) :968-988
[3]  
Edmiston E., 1987, Proceedings of the 1987 International Conference on Parallel Processing, P78
[4]   PARALLEL PROCESSING OF BIOLOGICAL SEQUENCE COMPARISON ALGORITHMS [J].
EDMISTON, EW ;
CORE, NG ;
SALTZ, JH ;
SMITH, RM .
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1988, 17 (03) :259-275
[5]   AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES [J].
GOTOH, O .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) :705-708
[6]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[7]   LINEAR SPACE ALGORITHM FOR COMPUTING MAXIMAL COMMON SUBSEQUENCES [J].
HIRSCHBERG, DS .
COMMUNICATIONS OF THE ACM, 1975, 18 (06) :341-343
[8]   A SPACE-EFFICIENT PARALLEL SEQUENCE COMPARISON ALGORITHM FOR A MESSAGE-PASSING MULTIPROCESSOR [J].
HUANG, XQ .
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1989, 18 (03) :223-239
[9]  
HUAUG X, 1990, COMPUT APPL BIOSCI, V6, P373
[10]  
Kumar V., 1994, INTRO PARALLEL COMPU, V400