MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics

被引:92
作者
Helaers, RaphaeL [2 ]
Milinkovitch, Michel C. [1 ]
机构
[1] Dept Genet & Evolut Sci III, LANE, CH-1211 Geneva 4, Switzerland
[2] Fac Univ Notre Dame Paix, Dept Biol, B-5000 Namur, Belgium
基金
瑞士国家科学基金会;
关键词
ANCESTRAL PROTEIN RECONSTRUCTION; DNA-SEQUENCES; INFERENCE;
D O I
10.1186/1471-2105-11-379
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities. Results: Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers. Conclusions: The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org.
引用
收藏
页数:11
相关论文
共 42 条
[1]   Reconstructing large regions of an ancestral mammalian genome in silico [J].
Blanchette, M ;
Green, ED ;
Miller, W ;
Haussler, D .
GENOME RESEARCH, 2004, 14 (12) :2412-2423
[2]   Phylogeny and biogeography of a cosmopolitan frog radiation: Late cretaceous diversification resulted in continent-scale endemism in the family ranidae [J].
Bossuyt, Franky ;
Brown, Rafe M. ;
Hillis, David M. ;
Cannatella, David C. ;
Milinkovitch, Michel C. .
SYSTEMATIC BIOLOGY, 2006, 55 (04) :579-594
[3]   Independent adaptation to riverine habitats allowed survival of ancient cetacean lineages [J].
Cassens, I ;
Vicario, S ;
Waddell, VG ;
Balchowsky, H ;
Van Belle, D ;
Ding, W ;
Fan, C ;
Mohan, RSL ;
Simoes-Lopes, PC ;
Bastida, R ;
Meyer, A ;
Stanhope, MJ ;
Milinkovitch, MC .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (21) :11343-11347
[4]   Applications of ancestral protein reconstruction in understanding protein function: GFP-like proteins [J].
Chang, BSW ;
Ugalde, JA ;
Matz, MV .
MOLECULAR EVOLUTION: PRODUCING THE BIOCHEMICAL DATA, PART B, 2005, 395 :652-670
[5]   Recreating a functional ancestral archosaur visual pigment [J].
Chang, BSW ;
Jönsson, K ;
Kazmi, MA ;
Donoghue, MJ ;
Sakmar, TP .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (09) :1483-1489
[6]   Maximum likelihood of evolutionary trees: hardness and approximation [J].
Chor, B ;
Tuller, T .
BIOINFORMATICS, 2005, 21 :I97-I106
[7]   The effect of haplotype-block definitions on inference of haplotype-block structure and htSNPs selection [J].
Ding, KY ;
Zhou, K ;
Zhang, J ;
Knight, J ;
Zhang, XG ;
Shen, Y .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (01) :148-159
[8]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[9]  
Felsenstein J., 2002, INFERRING PHYLOGENIE
[10]   Large-scale assignment of orthology: back to phylogenetics? [J].
Gabaldon, Toni .
GENOME BIOLOGY, 2008, 9 (10) :235