Efficiencies of maximum likelihood methods of phylogenetic inferences when different substitution models are used

被引:17
作者
Piontkivska, H
机构
[1] Penn State Univ, Inst Mol Evolut Genet, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
关键词
maximum likelihood; nucleotide substitution model; phylogenetic tree; topological distance;
D O I
10.1016/j.ympev.2003.10.011
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Choice of a substitution model is a crucial step in the maximum likelihood (ML) method of phylogenetic inference, and investigators tend to prefer complex mathematical models to simple ones. However, when complex models with many parameters are used, the extent of noise in statistical inferences increases, and thus complex models may not produce the true topology with a higher probability than simple ones. This problem was studied using computer simulation. When the number of nucleotides used was relatively large (1000 bp), the HKY + Gamma model showed smaller d(T) (topological distance between the inferred and the true trees) than the JC and Kimura models. In the cases of shorter sequences (300 bp) simpler model and search algorithm such as JC model and SA + NNI search were found to be as efficient as more complicated searches and models in terms of topological distances, although the topologies obtained under HKY + F model had the highest likelihood values. The performance of relatively simple search algorithm SA + NNI was found to be essentially the same as that of more extensive SA + TBR search under all models studied. Similarly to the conclusions reached by Takahashi and Nei [Mol. Biol. Evol. 17 (2000) 125 1], our results indicate that simple models can be as efficient as complex models, and that use of complex models does not necessarily give more reliable trees compared with simple models. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:865 / 873
页数:9
相关论文
共 35 条
[1]  
ANDERSON CL, 1993, MOL BIOL EVOL, V10, P605
[2]   Topological bias and inconsistency of maximum likelihood using wrong models [J].
Bruno, WJ ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (04) :564-566
[3]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[4]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174
[5]   A combined molecular approach to phylogeny of the lumping spider subfamily Dendryphantinae (Araneae: Salticidae) [J].
Hedin, MC ;
Maddison, WP .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2001, 18 (03) :386-403
[6]   Phylogeny estimation and hypothesis testing using maximum likelihood [J].
Huelsenbeck, JP ;
Crandall, KA .
ANNUAL REVIEW OF ECOLOGY AND SYSTEMATICS, 1997, 28 :437-466
[7]  
JIN L, 1990, MOL BIOL EVOL, V7, P82
[8]  
Jukes TH, 1969, MAMMALIAN PROTEIN ME, P21, DOI [DOI 10.1016/B978-1-4832-3211-9.50009-7, DOI 10.1093/BIOINFORMATICS/BTM404]
[10]   The metapopulation genetic algorithm: An efficient solution for the problem of phylogeny estimation [J].
Lemmon, AR ;
Milinkovitch, MC .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (16) :10516-10521