Penalized Likelihood Phylogenetic Inference: Bridging the Parsimony-Likelihood Gap

被引:35
作者
Kim, Junhyong [1 ,2 ]
Sanderson, Michael J. [3 ]
机构
[1] Univ Penn, Dept Biol, Philadelphia, PA 19104 USA
[2] Univ Penn, Penn Genome Frontiers Inst, Philadelphia, PA 19104 USA
[3] Univ Arizona, Dept Ecol & Evolutionary Biol, Tucson, AZ 85721 USA
关键词
Model selection; penalized likelihood; phylogeny estimation; semi-parametric;
D O I
10.1080/10635150802422274
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The increasing diversity and heterogeneity of molecular data for phylogeny estimation has led to development of complex models and model-based estimators. Here, we propose a penalized likelihood (PL) framework in which the levels of complexity in the underlying model can be smoothly controlled. We demonstrate the PL framework for a four-taxon tree case and investigate its properties. The PL framework yields an estimator in which the majority of currently employed estimators such as the maximum-parsimony estimator, homogeneous likelihood estimator, gamma mixture likelihood estimator, etc., become special cases of a single family of PL estimators. Furthermore, using the appropriate penalty function, the complexity of the underlying models can be partitioned into separately controlled classes allowing flexible control of model complexity.
引用
收藏
页码:665 / 674
页数:10
相关论文
共 48 条
[1]   Covarion structure in plastid genome evolution:: A new statistical test [J].
Ané, C ;
Burleigh, JG ;
McMahon, MM ;
Sanderson, MJ .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (04) :914-924
[2]  
[Anonymous], 2006, GARLI GENETIC ALGORI
[3]   Substantial regional variation in substitution rates in the human genome: Importance of GC content, gene density, and telomere-specific effects [J].
Arndt, PF ;
Hwa, T ;
Petrov, DA .
JOURNAL OF MOLECULAR EVOLUTION, 2005, 60 (06) :748-U28
[4]   Assessing among-locus variation in the inference of seed plant phylogeny [J].
Burleigh, J. Gordon ;
Mathews, Sarah .
INTERNATIONAL JOURNAL OF PLANT SCIENCES, 2007, 168 (02) :111-124
[5]  
Burnham K.P., 2002, Model selection and multimodel inference: a practical information-theoretic approach, DOI 10.1007/978-1-4757-2917-7_3
[6]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[7]  
Friedman J, 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5
[8]   Genotyping of single nucleotide polymorphism using model-based clustering [J].
Fujisawa, H ;
Eguchi, S ;
Ushijima, M ;
Miyata, S ;
Miki, Y ;
Muto, T ;
Matsuura, M .
BIOINFORMATICS, 2004, 20 (05) :718-U489
[9]   Maximum likelihood outperforms maximum parsimony even when evolutionary rates are heterotachous [J].
Gadagkar, SR ;
Kumar, S .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (11) :2139-2141
[10]   A call for likelihood phylogenetics even when the process of sequence evolution is heterogeneous [J].
Gaucher, EA ;
Miyamoto, MM .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2005, 37 (03) :928-931