A novel multi-objective genetic algorithm for multiple sequence alignment

被引:4
作者
Kaya, Mehmet [1 ]
Kaya, Buket [2 ]
Alhajj, Reda [3 ]
机构
[1] Firat Univ, Dept Comp Engn, TR-23169 Elazig, Turkey
[2] Firat Univ, Dept Elect & Elect Engn, TR-23169 Elazig, Turkey
[3] Univ Calgary, Dept Comp Sci, Calgary, AB T2N 1N4, Canada
关键词
multiple sequence alignment; multi-objective genetic algorithm; bioinformatics; ACCURACY; STRATEGY;
D O I
10.1504/IJDMB.2016.074684
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
There is no well-accepted theoretical model for multiple sequence alignment. An algorithm is accepted as a good method for multiple sequence alignment if it produces better fitness scores with respect to the benchmark datasets. For this purpose, we propose an efficient method using multi-objective genetic algorithm to discover optimal alignments in multiple sequence data. The main advantage of our approach is that a large number of trade-off alignments can be obtained by a single run with respect to conflicting objectives: alignment length minimisation and similarity and support maximisation. We compare our method with the four well-known multiple sequence alignment methods, MUSCLE, ClustalW, SAGA and MSA-GA. The first two of them are progressive methods, and the other two are based on evolutionary algorithms. Experimental results on the BAliBASE 2.0 database demonstrate that our method found better solutions than the others for most of the cases in terms of accuracy.
引用
收藏
页码:139 / 158
页数:20
相关论文
共 28 条
  • [1] BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations
    Bahr, A
    Thompson, JD
    Thierry, JC
    Poch, O
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 323 - 326
  • [2] A STRATEGY FOR THE RAPID MULTIPLE ALIGNMENT OF PROTEIN SEQUENCES - CONFIDENCE LEVELS FROM TERTIARY STRUCTURE COMPARISONS
    BARTON, GJ
    STERNBERG, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (02) : 327 - 337
  • [3] Chellapilla K., 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), P445, DOI 10.1109/CEC.1999.781958
  • [4] A fast and elitist multiobjective genetic algorithm: NSGA-II
    Deb, K
    Pratap, A
    Agarwal, S
    Meyarivan, T
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) : 182 - 197
  • [5] ProbCons: Probabilistic consistency-based multiple sequence alignment
    Do, CB
    Mahabhashyam, MSP
    Brudno, M
    Batzoglou, S
    [J]. GENOME RESEARCH, 2005, 15 (02) : 330 - 340
  • [6] MUSCLE: multiple sequence alignment with high accuracy and high throughput
    Edgar, RC
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (05) : 1792 - 1797
  • [7] Gondro C, 2007, GENET MOL RES, V6, P964
  • [8] Gupta S K, 1995, J Comput Biol, V2, P459, DOI 10.1089/cmb.1995.2.459
  • [9] THE ALIGNMENT OF SETS OF SEQUENCES AND THE CONSTRUCTION OF PHYLETIC TREES - AN INTEGRATED METHOD
    HOGEWEG, P
    HESPER, B
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1984, 20 (02) : 175 - 186
  • [10] Holland JH., 1992, ADAPTATION NATURAL A, DOI [10.7551/mitpress/1090.001.0001, DOI 10.7551/MITPRESS/1090.001.0001]