Developing new genetic algorithm based on integer programming for multiple sequence alignment

被引:0
作者
S. Ali Lajevardy
Mehrdad Kargari
机构
[1] Tarbiat Modares University,Faculty of Industrial Engineering & Systems
来源
Soft Computing | 2022年 / 26卷
关键词
Multiple sequence alignment; Mathematical modeling; Bioinformatics; Integer programming; Genetic algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Molecular biology advances in the past few decades have contributed to the rapid increase in genome sequencing of various organisms; sequence alignment is usually considered the first step in understanding a sequence's molecular function. Understanding such a function is made possible by aligning an unknown sequence with known sequences based on evolution. An optimal alignment adjusts two or more sequences in a way that it could compare the maximum number of identical or similar residues. The two sequence alignments types are Pairwise Sequence Alignment (PSA) and Multiple Sequence Alignment (MSA). The MSA enjoys a higher advantage than PSA since it predicts the similarity in a family of similar sequences, providing more biological data. Moreover, while the dynamic programming (DP) technique is used in PSA to provide the optimal method, it will lead to more complexity if used in MSA. So, the MSA mainly uses heuristic and approximation methods. Such methods include progressive alignment, iterative alignment, Hidden Markov Model (HMM), and Metaheuristic algorithms. This paper presents a genetic algorithm and chromosome design to solve a mathematical model for MSA. This model uses a basis for an optimal solution in different ways, and an X-mediated matrix with binary elements is used to model the sequence. Then, the model is implemented using the Genetic Algorithm method on the web, and the final results indicate the success of GA for the MSA approach.
引用
收藏
页码:3863 / 3870
页数:7
相关论文
共 108 条
[1]  
Altschul SF(1989)Trees, stars and multiple sequence alignment SIAM J Appl Math 49 197-209
[2]  
Lipman DJ(2007)DNA reference alignment benchmarks based on teritary structure of encoded proteins Bioinformatics 23 2648-2649
[3]  
Carroll H(2017)A review on multiple sequence alignment from the perspective of genetic algorithm Genomics 109 419-431
[4]  
Beckstead W(2008)Protein multiple sequence alignment Methods Mol Biol 484 379-413
[5]  
O'Connor T(2004)MUSCLE: multiple sequence alignment with high accuracy and high throughput Nucleic Acids Res 32 1792-1797
[6]  
Ebbert M(1987)Progressive sequence alignment as a prerequisite to correct phylogenetic trees J Mol Evol 25 351-360
[7]  
Clement M(2004)Combining partial order alignment and progressive multiple sequence alignment increases alignment speed and scalability to very large alignment problems Bioinformatics 20 1546-1556
[8]  
Snell Q(1992)Amino acid substitution matrices from protein blocks Proc Natl Acad Sci 89 10915-10919
[9]  
McClellan D(1988)CLUSTAL: a package for performing multiple sequence alignment on a microcomputer Gene 73 237-244
[10]  
Chowdhury B(1995)Comprehensive study on iterative algorithms of multiple sequence alignment Comput Appl Biosci 11 13-18