Two Hybrid Algorithms for Multiple Sequence Alignment

被引:0
|
作者
Naznin, Farhana [1 ]
Sarker, Ruhul [1 ]
Essam, Daryl [1 ]
机构
[1] Univ New S Wales, Australian Def Force Acad, Canberra, ACT 2600, Australia
关键词
Progressive Alignment; Multiple Sequence Alignment (MSA); Dynamic Programming (DP); Guide-tree; Genetic Algorithm (GA); PHYLOGENETIC TREES; ACCURACY;
D O I
10.1063/1.3314271
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In order to design life saving drugs, such as cancer drugs, the design of Protein or DNA structures has to be accurate. These structures depend on Multiple Sequence Alignment (MSA). MSA is used to find the accurate structure of Protein and DNA sequences from existing approximately correct sequences. To overcome the overly greedy nature of the well known global progressive alignment method for multiple sequence alignment, we have proposed two different algorithms in this paper; one is using an iterative approach with a progressive alignment method (PAMIM) and the second one is using a genetic algorithm with a progressive alignment method (PAMGA). Both of our methods started with a "kmer" distance table to generate single guide-tree. In the iterative approach, we have introduced two new techniques: the first technique is to generate Guide-trees with randomly selected sequences and the second is of shuffling the sequences inside that tree. The output of the tree is a multiple sequence alignment which has been evaluated by the Sum of Pairs Method (SPM) considering the real value data from PAM250. In our second GA approach, these two techniques are used to generate an initial population and also two different approaches of genetic operators are implemented in crossovers and mutation. To test the performance of our two algorithms, we have compared these with the existing well known methods: T-Coffee, MUSCEL, MAFFT and Probcon, using BAliBase benchmarks. The experimental results show that the first algorithm works well for some situations, where other existing methods face difficulties in obtaining better solutions. The proposed second method works well compared to the existing methods for all situations and it shows better performance over the first one.
引用
收藏
页码:69 / 83
页数:15
相关论文
共 50 条
  • [1] Protein multiple sequence alignment by hybrid bio-inspired algorithms
    Cutello, Vincenzo
    Nicosia, Giuseppe
    Pavone, Mario
    Prizzi, Igor
    NUCLEIC ACIDS RESEARCH, 2011, 39 (06) : 1980 - 1992
  • [2] Multiple Sequence Alignment with Genetic Algorithms
    Botta, Marco
    Negro, Guido
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 206 - 214
  • [3] Multiple sequence alignment: Algorithms and applications
    Gotoh, O
    ADVANCES IN BIOPHYSICS, VOL 36, 1999, 1999, 36 : 159 - 206
  • [4] Approximation algorithms for multiple sequence alignment
    Bafna, V
    Lawler, EL
    Pevzner, PA
    THEORETICAL COMPUTER SCIENCE, 1997, 182 (1-2) : 233 - 244
  • [5] Recent evolutions of multiple sequence alignment algorithms
    Notredame, Cedric
    PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (08) : 1405 - 1408
  • [6] Algorithms for loosely constrained multiple sequence alignment
    Bin, S
    Zhou, FF
    Chen, GL
    COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 213 - 218
  • [7] Cactus: Algorithms for genome multiple sequence alignment
    Paten, Benedict
    Earl, Dent
    Ngan Nguyen
    Diekhans, Mark
    Zerbino, Daniel
    Haussler, David
    GENOME RESEARCH, 2011, 21 (09) : 1512 - 1528
  • [8] Instability in progressive multiple sequence alignment algorithms
    Boyce, Kieran
    Sievers, Fabian
    Higgins, Desmond G.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
  • [9] Instability in progressive multiple sequence alignment algorithms
    Kieran Boyce
    Fabian Sievers
    Desmond G. Higgins
    Algorithms for Molecular Biology, 10
  • [10] New algorithms for multiple DNA sequence alignment
    Brown, DG
    Hudek, AK
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2004, 3240 : 314 - +