A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood

被引:14674
|
作者
Guindon, S [1 ]
Gascuel, O [1 ]
机构
[1] CNRS, LIRMM, F-34392 Montpellier 5, France
关键词
algorithm; computer simulations; maximum likelihood; phylogeny; rbcL; RDPII project;
D O I
10.1080/10635150390235520
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The increase in the number of large data sets and the complexity of current probabilistic sequence evolution models necessitates fast and reliable phylogeny reconstruction methods. We describe a new approach, based on the maximum-likelihood principle, which clearly satisfies these requirements. The core of this method is a simple hill-climbing algorithm that adjusts tree topology and branch lengths simultaneously. This algorithm starts from an initial tree built by a fast distance-based method and modifies this tree to improve its likelihood at each iteration. Due to this simultaneous adjustment of the topology and branch lengths, only a few iterations are sufficient to reach an optimum. We used extensive and realistic computer simulations to show that the topological accuracy of this new method is at least as high as that of the existing maximum-likelihood programs and much higher than the performance of distance-based and parsimony approaches. The reduction of computing time is dramatic in comparison with other maximum-likelihood packages, while the likelihood maximization ability tends to be higher. For example, only 12 min were required on a standard personal computer to analyze a data set consisting of 500 rbcL sequences with 1,428 base pairs from plant plastids, thus reaching a speed of the same order as some popular distance-based and parsimony algorithms. This new method is implemented in the PHYML program, which is freely available on our web page: http://www.lirmm.fr/w3ifa/MAAS/.
引用
收藏
页码:696 / 704
页数:9
相关论文
共 50 条
  • [1] IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies
    Lam-Tung Nguyen
    Schmidt, Heiko A.
    von Haeseler, Arndt
    Bui Quang Minh
    MOLECULAR BIOLOGY AND EVOLUTION, 2015, 32 (01) : 268 - 274
  • [2] FastMG: a simple, fast, and accurate maximum likelihood procedure to estimate amino acid replacement rate matrices from large data sets
    Cuong Cao Dang
    Vinh Sy Le
    Gascuel, Olivier
    Hazes, Bart
    Quang Si Le
    BMC BIOINFORMATICS, 2014, 15 : 341
  • [3] New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0
    Guindon, Stephane
    Dufayard, Jean-Francois
    Lefort, Vincent
    Anisimova, Maria
    Hordijk, Wim
    Gascuel, Olivier
    SYSTEMATIC BIOLOGY, 2010, 59 (03) : 307 - 321
  • [4] A fast likelihood approach for estimation of large phylogenies from continuous trait data
    Peng, Jing
    Rajeevan, Haseena
    Kubatko, Laura
    RoyChoudhury, Arindam
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2021, 161
  • [5] Maximum likelihood estimation on large phylogenies and analysis of adaptive evolution in human influenza virus A
    Yang, ZH
    JOURNAL OF MOLECULAR EVOLUTION, 2000, 51 (05) : 423 - 432
  • [6] Fast and accurate estimation of the covariance between pairwise maximum likelihood distances
    Gil, Manuel
    PEERJ, 2014, 2
  • [7] On fast computation of the non-parametric maximum likelihood estimate of a mixing distribution
    Wang, Yong
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2007, 69 : 185 - 198
  • [8] PHYLOGENIES FROM RESTRICTION SITES - A MAXIMUM-LIKELIHOOD APPROACH
    FELSENSTEIN, J
    EVOLUTION, 1992, 46 (01) : 159 - 173
  • [9] MLMD: Maximum Likelihood Mixture Decoupling for Fast and Accurate Point Cloud Registration
    Eckart, Ben
    Kim, Kihwan
    Troccoli, Alejandro
    Kelly, Alonzo
    Kautz, Jan
    2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 241 - 249
  • [10] A fast image reconstruction algorithm based on penalized-likelihood estimate
    Sheng, JH
    Ying, L
    MEDICAL ENGINEERING & PHYSICS, 2005, 27 (08) : 679 - 686