Two Hybrid Algorithms for Multiple Sequence Alignment

被引:0
作者
Naznin, Farhana [1 ]
Sarker, Ruhul [1 ]
Essam, Daryl [1 ]
机构
[1] Univ New S Wales, Australian Def Force Acad, Canberra, ACT 2600, Australia
来源
2009 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL MODELS FOR LIFE SCIENCES (CMLS '09) | 2010年 / 1210卷
关键词
Progressive Alignment; Multiple Sequence Alignment (MSA); Dynamic Programming (DP); Guide-tree; Genetic Algorithm (GA); PHYLOGENETIC TREES; ACCURACY;
D O I
10.1063/1.3314271
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In order to design life saving drugs, such as cancer drugs, the design of Protein or DNA structures has to be accurate. These structures depend on Multiple Sequence Alignment (MSA). MSA is used to find the accurate structure of Protein and DNA sequences from existing approximately correct sequences. To overcome the overly greedy nature of the well known global progressive alignment method for multiple sequence alignment, we have proposed two different algorithms in this paper; one is using an iterative approach with a progressive alignment method (PAMIM) and the second one is using a genetic algorithm with a progressive alignment method (PAMGA). Both of our methods started with a "kmer" distance table to generate single guide-tree. In the iterative approach, we have introduced two new techniques: the first technique is to generate Guide-trees with randomly selected sequences and the second is of shuffling the sequences inside that tree. The output of the tree is a multiple sequence alignment which has been evaluated by the Sum of Pairs Method (SPM) considering the real value data from PAM250. In our second GA approach, these two techniques are used to generate an initial population and also two different approaches of genetic operators are implemented in crossovers and mutation. To test the performance of our two algorithms, we have compared these with the existing well known methods: T-Coffee, MUSCEL, MAFFT and Probcon, using BAliBase benchmarks. The experimental results show that the first algorithm works well for some situations, where other existing methods face difficulties in obtaining better solutions. The proposed second method works well compared to the existing methods for all situations and it shows better performance over the first one.
引用
收藏
页码:69 / 83
页数:15
相关论文
共 14 条