COALESCENT-BASED SPECIES TREE INFERENCE FROM GENE TREE TOPOLOGIES UNDER INCOMPLETE LINEAGE SORTING BY MAXIMUM LIKELIHOOD

被引:109
作者
Wu, Yufeng [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
基金
美国国家科学基金会;
关键词
Coalescent models; gene trees and species trees; incomplete lineage sorting; maximum likelihood; species tree estimation; PHYLOGENY; DNA; DISTRIBUTIONS; SEQUENCES; DESCENT;
D O I
10.1111/j.1558-5646.2011.01476.x
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Incomplete lineage sorting can cause incongruence between the phylogenetic history of genes (the gene tree) and that of the species (the species tree), which can complicate the inference of phylogenies. In this article, I present a new coalescent-based algorithm for species tree inference with maximum likelihood. I first describe an improved method for computing the probability of a gene tree topology given a species tree, which is much faster than an existing algorithm by Degnan and Salter (2005). Based on this method, I develop a practical algorithm that takes a set of gene tree topologies and infers species trees with maximum likelihood. This algorithm searches for the best species tree by starting from initial species trees and performing heuristic search to obtain better trees with higher likelihood. This algorithm, called STELLS (which stands for Species Tree InfErence with Likelihood for Lineage Sorting), has been implemented in a program that is downloadable from the authors web page. The simulation results show that the STELLS algorithm is more accurate than an existing maximum likelihood method for many datasets, especially when there is noise in gene trees. I also show that the STELLS algorithm is efficient and can be applied to real biological datasets.
引用
收藏
页码:763 / 775
页数:13
相关论文
共 31 条
[1]  
[Anonymous], 2004, Inferring phylogenies
[2]  
[Anonymous], 2002, Algorithms for Minimization Without Derivatives
[3]  
[Anonymous], 2005, PHYLIP (phylogeny inference package) version 3.6
[4]   Estimating species phylogeny from gene-tree probabilities despite incomplete lineage sorting:: An example from Melanoplus grasshoppers [J].
Carstens, Bryan C. ;
Knowles, L. Lacey .
SYSTEMATIC BIOLOGY, 2007, 56 (03) :400-411
[5]   Discordance of species trees with their most likely gene trees [J].
Degnan, James H. ;
Rosenberg, Noah A. .
PLOS GENETICS, 2006, 2 (05) :762-768
[6]  
Degnan JH, 2005, EVOLUTION, V59, P24
[7]   IS A NEW AND GENERAL THEORY OF MOLECULAR SYSTEMATICS EMERGING? [J].
Edwards, Scott V. .
EVOLUTION, 2009, 63 (01) :1-19
[8]   Estimating species trees using approximate Bayesian computation [J].
Fan, Helen Hang ;
Kubatko, Laura S. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2011, 59 (02) :354-363
[9]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[10]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174