Multiobjective artificial fish swarm algorithm for multiple sequence alignment

被引:16
作者
Dabba, Ali [1 ]
Tari, Abdelkamel [1 ]
Zouache, Djaafar [2 ]
机构
[1] Abderrahmane Mira Univ, Fac Sci, Comp Sci Dept, Bejaia, Algeria
[2] Mohamed Elbachir Elibrahimi Univ, Comp Sci Dept, Bordj Bou Arreridj, Algeria
关键词
Multiple sequence alignment; artificial fish swarm algorithm; bioinspired algorithms; bioinformatics; optimization algorithms; evolutionary algorithm; molecular biology; swarm intelligence; HIDDEN MARKOV-MODELS; GENETIC ALGORITHM; PROTEIN; COFFEE;
D O I
10.1080/03155986.2019.1629782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multiple sequence alignment (MSA) represents a basic task for many bioinformatics applications. MSA allows finding common conserved regions among various sequences of proteins or DNA. However, to find the optimal multiple sequence alignment, it is necessary to design an efficient exploration approach that could explore a huge number of possible multiple sequence alignments. As well as, it is required to use a powerful evaluation method to assess the biological relevance of these multiple sequence alignment. To address these main problems, this article presents a multiobjective artificial fish swarm algorithm (MOAFS) to solve multiple sequence alignment. MOAFS uses the behaviors of artificial fish swarm algorithm such as the cooperation, decentralization and parallelism to ensure a good trade-off between the exploration and the exploitation of the search space of MSA problem. To preserve the quality and consistency of alignment, two fitness functions have been simultaneously used by the MOAFS algorithm: (i) Weighted Sum of Pairs to determine similar regions horizontally and (ii) Similarity function to determine vertically similar regions between the sequences of an alignment. Following the exploration of space search, the Pareto-optimal set is obtained by MOAFS which performs the optimal multiple sequence alignments for both fitness functions. The performance of MOAFS algorithm has been proved by comparing our algorithm with different progressive alignment methods, and other alignment methods based on evolutionary algorithms with singleobjective and many-objective. The experiment results conducted on BAliBASE 2.0 and BAliBASE 3.0 benchmark confirm that the MOAFS algorithm provides a greater accuracy statistical significance in terms of SP or CS scores.
引用
收藏
页码:38 / 59
页数:22
相关论文
共 50 条
[1]   Heuristics for multiobjective multiple sequence alignment [J].
Abbasi, Maryam ;
Paquete, Luis ;
Pereira, Francisco B. .
BIOMEDICAL ENGINEERING ONLINE, 2016, 15
[2]  
[Anonymous], 2009, ADV MULTIOBJECTIVE N
[3]  
[Anonymous], 1999, P 1 C EV COMP 6 9 JU
[4]   MULTIPLE SEQUENCE ALIGNMENT [J].
BACON, DJ ;
ANDERSON, WF .
JOURNAL OF MOLECULAR BIOLOGY, 1986, 191 (02) :153-161
[5]   BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations [J].
Bahr, A ;
Thompson, JD ;
Thierry, JC ;
Poch, O .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :323-326
[6]   Fast Statistical Alignment [J].
Bradley, Robert K. ;
Roberts, Adam ;
Smoot, Michael ;
Juvekar, Sudeep ;
Do, Jaeyoung ;
Dewey, Colin ;
Holmes, Ian ;
Pachter, Lior .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (05)
[7]  
Chao K-M, 2008, SEQUENCE COMP THEORY, V7
[8]   Protein multiple sequence alignment by hybrid bio-inspired algorithms [J].
Cutello, Vincenzo ;
Nicosia, Giuseppe ;
Pavone, Mario ;
Prizzi, Igor .
NUCLEIC ACIDS RESEARCH, 2011, 39 (06) :1980-1992
[9]  
Dayhoff M. O., 1979, ATLAS PROTEIN SEQ S3, V5, P345
[10]  
Derrien V, 2008, THESIS