Cactus: Algorithms for genome multiple sequence alignment

被引:158
|
作者
Paten, Benedict [1 ]
Earl, Dent [1 ]
Ngan Nguyen [1 ]
Diekhans, Mark [1 ]
Zerbino, Daniel [1 ]
Haussler, David [1 ]
机构
[1] Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
关键词
REARRANGEMENTS; VERTEBRATE; ELEMENTS; BROWSER; GRAPHS; DNA;
D O I
10.1101/gr.123356.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Much attention has been given to the problem of creating reliable multiple sequence alignments in a model incorporating substitutions, insertions, and deletions. Far less attention has been paid to the problem of optimizing alignments in the presence of more general rearrangement and copy number variation. Using Cactus graphs, recently introduced for representing sequence alignments, we describe two complementary algorithms for creating genomic alignments. We have implemented these algorithms in the new "Cactus'' alignment program. We test Cactus using the Evolver genome evolution simulator, a comprehensive new tool for simulation, and show using these and existing simulations that Cactus significantly outperforms all of its peers. Finally, we make an empirical assessment of Cactus's ability to properly align genes and find interesting cases of intra-gene duplication within the primates.
引用
收藏
页码:1512 / 1528
页数:17
相关论文
共 50 条
  • [1] Multiple Sequence Alignment with Genetic Algorithms
    Botta, Marco
    Negro, Guido
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 206 - 214
  • [2] Multiple sequence alignment: Algorithms and applications
    Gotoh, O
    ADVANCES IN BIOPHYSICS, VOL 36, 1999, 1999, 36 : 159 - 206
  • [3] Approximation algorithms for multiple sequence alignment
    Bafna, V
    Lawler, EL
    Pevzner, PA
    THEORETICAL COMPUTER SCIENCE, 1997, 182 (1-2) : 233 - 244
  • [4] Multiple genome alignment: Chaining algorithms revisited
    Abouelhoda, MI
    Ohlebusch, E
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2003, 2676 : 1 - 16
  • [5] Recent evolutions of multiple sequence alignment algorithms
    Notredame, Cedric
    PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (08) : 1405 - 1408
  • [6] Algorithms for loosely constrained multiple sequence alignment
    Bin, S
    Zhou, FF
    Chen, GL
    COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 213 - 218
  • [7] Two Hybrid Algorithms for Multiple Sequence Alignment
    Naznin, Farhana
    Sarker, Ruhul
    Essam, Daryl
    2009 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL MODELS FOR LIFE SCIENCES (CMLS '09), 2010, 1210 : 69 - 83
  • [8] Instability in progressive multiple sequence alignment algorithms
    Boyce, Kieran
    Sievers, Fabian
    Higgins, Desmond G.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
  • [9] Instability in progressive multiple sequence alignment algorithms
    Kieran Boyce
    Fabian Sievers
    Desmond G. Higgins
    Algorithms for Molecular Biology, 10
  • [10] New algorithms for multiple DNA sequence alignment
    Brown, DG
    Hudek, AK
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2004, 3240 : 314 - +