ALF-A Simulation Framework for Genome Evolution

被引:88
作者
Dalquen, Daniel A. [1 ,2 ]
Anisimova, Maria [1 ,2 ]
Gonnet, Gaston H. [1 ,2 ]
Dessimoz, Christophe [1 ,2 ]
机构
[1] ETH, Dept Comp Sci, Computat Biochem Res Grp, Zurich, Switzerland
[2] Swiss Inst Bioinformat, Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
simulation; genome evolution; codon models; indel; lateral gene transfer; GC-content amelioration; MONTE-CARLO-SIMULATION; PROTEIN-SEQUENCE EVOLUTION; MAXIMUM-LIKELIHOOD; PHYLOGENETIC TREES; MITOCHONDRIAL-DNA; PSEQ-GEN; SEQ-GEN; MODEL; SITES; ALGORITHM;
D O I
10.1093/molbev/msr268
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In computational evolutionary biology, verification and benchmarking is a challenging task because the evolutionary history of studied biological entities is usually not known. Computer programs for simulating sequence evolution in silico have shown to be viable test beds for the verification of newly developed methods and to compare different algorithms. However, current simulation packages tend to focus either on gene-level aspects of genome evolution such as character substitutions and insertions and deletions (indels) or on genome-level aspects such as genome rearrangement and speciation events. Here, we introduce Artificial Life Framework (ALF), which aims at simulating the entire range of evolutionary forces that act on genomes: nucleotide, codon, or amino acid substitution (under simple or mixture models), indels, GC-content amelioration, gene duplication, gene loss, gene fusion, gene fission, genome rearrangement, lateral gene transfer (LGT), or speciation. The other distinctive feature of ALF is its user-friendly yet powerful web interface. We illustrate the utility of ALF with two possible applications: 1) we reanalyze data from a study of selection after globin gene duplication and test the statistical significance of the original conclusions and 2) we demonstrate that LGT can dramatically decrease the accuracy of two well-established orthology inference methods. ALF is available as a stand-alone application or via a web interface at http://www.cbrg.ethz.ch/alf.
引用
收藏
页码:1115 / 1123
页数:9
相关论文
共 62 条
  • [1] Phylogenetic and Functional Assessment of Orthologs Inference Projects and Methods
    Altenhoff, Adrian M.
    Dessimoz, Christophe
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (01)
  • [2] Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models
    Anisimova, Maria
    Kosiol, Carolin
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2009, 26 (02) : 255 - 271
  • [3] Evolution of protein domain promiscuity in eukaryotes
    Basu, Malay Kumar
    Carmel, Liran
    Rogozin, Igor B.
    Koonin, Eugene V.
    [J]. GENOME RESEARCH, 2008, 18 (03) : 449 - 461
  • [4] A simulation test bed for hypotheses of genome evolution
    Beiko, Robert G.
    Charlebois, Robert L.
    [J]. BIOINFORMATICS, 2007, 23 (07) : 825 - 831
  • [5] EMPIRICAL AND STRUCTURAL MODELS FOR INSERTIONS AND DELETIONS IN THE DIVERGENT EVOLUTION OF PROTEINS
    BENNER, SA
    COHEN, MA
    GONNET, GH
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1993, 229 (04) : 1065 - 1082
  • [6] A maximum likelihood method for detecting functional divergence at individual codon sites, with application to gene family evolution
    Bielawski, JP
    Yang, ZH
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2004, 59 (01) : 121 - 132
  • [7] Conceptual framework and pilot study to benchmark phylogenomic databases based on reference gene trees
    Boeckmann, Brigitte
    Robinson-Rechavi, Marc
    Xenarios, Ioannis
    Dessimoz, Christophe
    [J]. BRIEFINGS IN BIOINFORMATICS, 2011, 12 (05) : 423 - 435
  • [8] DNA assembly with gaps (Dawg): simulating sequence evolution
    Cartwright, RA
    [J]. BIOINFORMATICS, 2005, 21 : 31 - 38
  • [9] Fregene: Simulation of realistic sequence-level data in populations and ascertained samples
    Chadeau-Hyam, Marc
    Hoggart, Clive J.
    O'Reilly, Paul F.
    Whittaker, John C.
    De Iorio, Maria
    Balding, David J.
    [J]. BMC BIOINFORMATICS, 2008, 9 (1)
  • [10] Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments
    Chang, MSS
    Benner, SA
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 341 (02) : 617 - 631