Multiobjective artificial fish swarm algorithm for multiple sequence alignment

被引:16
作者
Dabba, Ali [1 ]
Tari, Abdelkamel [1 ]
Zouache, Djaafar [2 ]
机构
[1] Abderrahmane Mira Univ, Fac Sci, Comp Sci Dept, Bejaia, Algeria
[2] Mohamed Elbachir Elibrahimi Univ, Comp Sci Dept, Bordj Bou Arreridj, Algeria
关键词
Multiple sequence alignment; artificial fish swarm algorithm; bioinspired algorithms; bioinformatics; optimization algorithms; evolutionary algorithm; molecular biology; swarm intelligence; HIDDEN MARKOV-MODELS; GENETIC ALGORITHM; PROTEIN; COFFEE;
D O I
10.1080/03155986.2019.1629782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multiple sequence alignment (MSA) represents a basic task for many bioinformatics applications. MSA allows finding common conserved regions among various sequences of proteins or DNA. However, to find the optimal multiple sequence alignment, it is necessary to design an efficient exploration approach that could explore a huge number of possible multiple sequence alignments. As well as, it is required to use a powerful evaluation method to assess the biological relevance of these multiple sequence alignment. To address these main problems, this article presents a multiobjective artificial fish swarm algorithm (MOAFS) to solve multiple sequence alignment. MOAFS uses the behaviors of artificial fish swarm algorithm such as the cooperation, decentralization and parallelism to ensure a good trade-off between the exploration and the exploitation of the search space of MSA problem. To preserve the quality and consistency of alignment, two fitness functions have been simultaneously used by the MOAFS algorithm: (i) Weighted Sum of Pairs to determine similar regions horizontally and (ii) Similarity function to determine vertically similar regions between the sequences of an alignment. Following the exploration of space search, the Pareto-optimal set is obtained by MOAFS which performs the optimal multiple sequence alignments for both fitness functions. The performance of MOAFS algorithm has been proved by comparing our algorithm with different progressive alignment methods, and other alignment methods based on evolutionary algorithms with singleobjective and many-objective. The experiment results conducted on BAliBASE 2.0 and BAliBASE 3.0 benchmark confirm that the MOAFS algorithm provides a greater accuracy statistical significance in terms of SP or CS scores.
引用
收藏
页码:38 / 59
页数:22
相关论文
共 50 条
[31]   T-Coffee: A novel method for fast and accurate multiple sequence alignment [J].
Notredame, C ;
Higgins, DG ;
Heringa, J .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 302 (01) :205-217
[32]   Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns [J].
Ortuno, Francisco M. ;
Valenzuela, Olga ;
Rojas, Fernando ;
Pomares, Hector ;
Florido, Javier P. ;
Urquiza, Jose M. ;
Rojas, Ignacio .
BIOINFORMATICS, 2013, 29 (17) :2112-2121
[33]   MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information [J].
Pei, Jimin ;
Grishin, Nick V. .
NUCLEIC ACIDS RESEARCH, 2006, 34 (16) :4364-4374
[34]  
Rani R.R., 2018, SOFT COMPUTING BIOL, P39
[35]   Probalign: multiple sequence alignment using partition function posterior probabilities [J].
Roshan, Usman ;
Livesay, Dennis R. .
BIOINFORMATICS, 2006, 22 (22) :2715-2721
[36]   Hybrid multiobjective artificial bee colony for multiple sequence alignment [J].
Rubio-Largo, Alvaro ;
Vega-Rodriguez, Miguel A. ;
Gonzalez-Alvarez, David L. .
APPLIED SOFT COMPUTING, 2016, 41 :157-168
[37]   Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega [J].
Sievers, Fabian ;
Wilm, Andreas ;
Dineen, David ;
Gibson, Toby J. ;
Karplus, Kevin ;
Li, Weizhong ;
Lopez, Rodrigo ;
McWilliam, Hamish ;
Remmert, Michael ;
Soeding, Johannes ;
Thompson, Julie D. ;
Higgins, Desmond G. .
MOLECULAR SYSTEMS BIOLOGY, 2011, 7
[38]  
Silva F, 2010, INSTR MEAS TECHN C, P1
[39]   DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment [J].
Subramanian, Amarendran R. ;
Kaufmann, Michael ;
Morgenstern, Burkhard .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2008, 3 (1)
[40]   RBT-GA: a novel metaheuristic for solving the multiple sequence alignment problem [J].
Taheri, Javid ;
Zomaya, Albert Y. .
BMC GENOMICS, 2009, 10