GHOST: Recovering Historical Signal from Heterotachously Evolved Sequence Alignments

被引:131
作者
Crotty, Stephen M. [1 ,2 ,3 ,4 ]
Bui Quang Minh [1 ,2 ,5 ]
Bean, Nigel G. [3 ,4 ]
Holland, Barbara R. [6 ]
Tuke, Jonathan [3 ,4 ]
Jermiin, Lars S. [5 ,7 ,8 ,9 ]
von Haeseler, Arndt [1 ,2 ,10 ]
机构
[1] Univ Vienna, Ctr Integrat Bioinformat Vienna, Max F Perutz Labs, Vienna, Austria
[2] Med Univ Vienna, Vienna, Austria
[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia
[4] Univ Adelaide, ARC Ctr Excellence Math & Stat Frontiers, Adelaide, SA, Australia
[5] Australian Natl Univ, Res Sch Biol, Canberra, ACT 2601, Australia
[6] Univ Tasmania, Sch Nat Sci, Hobart, Tas 7001, Australia
[7] CSIRO Land & Water, Black Mt Labs, Canberra, ACT 2601, Australia
[8] Univ Coll Dublin, Sch Biol & Environm Sci, Dublin 4, Ireland
[9] Univ Coll Dublin, Earth Inst, Dublin 4, Ireland
[10] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Vienna, Austria
基金
奥地利科学基金会;
关键词
Convergent evolution; heterotachy; maximum likelihood; mixture model; phylogenetics; MAXIMUM-LIKELIHOOD; MIXTURE MODEL; PHYLOGENETIC MODELS; EVOLUTIONARY TREES; DNA-SEQUENCES; COVARION; IDENTIFIABILITY; PARSIMONY; SELECTION; HETEROGENEITY;
D O I
10.1093/sysbio/syz051
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Molecular sequence data that have evolved under the influence of heterotachous evolutionary processes are known to mislead phylogenetic inference. We introduce the General Heterogeneous evolution On a Single Topology (GHOST) model of sequence evolution, implemented under a maximum-likelihood framework in the phylogenetic program IQ-TREE (hap://www.iqtree.org). Simulations show that using the GHOST model, IQ-TREE can accurately recover the tree topology, branch lengths, and substitution model parameters from heterotachously evolved sequences. We investigate the performance of the GHOST model on empirical data by sampling phylogenomic alignments of varying lengths from a plastome alignment. We then carry out inference under the GHOST model on a phylogenomic data set composed of 248 genes from 16 taxa, where we find the GHOST model concurs with the currently accepted view, placing turtles as a sister lineage of archosaurs, in contrast to results obtained using traditional variable rates-across-sites models. Finally, we apply the model to a data set composed of a sodium channel gene of 11 fish taxa, finding that the GHOST model is able to elucidate a subtle component of the historical signal, linked to the previously established convergent evolution of the electric organ in two geographically distinct lineages of electric fish. We compare inference under the GHOST model to partitioning by codon position and show that, owing to the minimization of model constraints, the GHOST model offers unique biological insights when applied to empirical data.
引用
收藏
页码:249 / 264
页数:16
相关论文
共 60 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   Identifiability of a Markovian model of molecular evolution with gamma-distributed rates [J].
Allman, Elizabeth S. ;
Ane, Cecile ;
Rhodes, John A. .
ADVANCES IN APPLIED PROBABILITY, 2008, 40 (01) :229-249
[3]   Identifying evolutionary trees and substitution parameters for the general Markov model with invariable sites [J].
Allman, Elizabeth S. ;
Rhodes, John A. .
MATHEMATICAL BIOSCIENCES, 2008, 211 (01) :18-33
[4]   The identifiability of tree topology for phylogenetic models, including covarion and mixture models [J].
Allman, Elizabeth S. ;
Rhodes, John A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (05) :1101-1113
[5]   Identifiability of Two-Tree Mixtures for Group-Based Models [J].
Allman, Elizabeth S. ;
Petrovic, Sonja ;
Rhodes, John A. ;
Sullivant, Seth .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) :710-722
[6]   An improved statistical method for detecting heterotachy in nucleotide sequences [J].
Baele, Guy ;
Raes, Jeroen ;
Van de Peer, Yves ;
Vansteelandt, Stijn .
MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (07) :1397-1405
[8]  
Burnham K. P., 2002, Model selection and multimodel inference: A practical information-theoretic approach, V2nd
[9]   Phylogenomic analyses support the position of turtles as the sister group of birds and crocodiles (Archosauria) [J].
Chiari, Ylenia ;
Cahais, Vincent ;
Galtier, Nicolas ;
Delsuc, Frederic .
BMC BIOLOGY, 2012, 10
[10]  
Crotty S.M., 2018, CHARACTERISING GENET, DOI [10.1101/455303, DOI 10.1101/455303]