nQMaker: Estimating Time Nonreversible Amino Acid Substitution Models

被引:16
作者
Cuong Cao Dang [1 ]
Bui Quang Minh [2 ]
McShea, Hanon [3 ]
Masel, Joanna [4 ]
James, Jennifer Eleanor [5 ]
Le Sy Vinh [1 ]
Lanfear, Robert [6 ]
机构
[1] Vietnam Natl Univ, Univ Engn & Technol, Fac Informat Technol, 144 Xuan Thuy, Hanoi 10000, Vietnam
[2] Australian Natl Univ, Sch Comp, Computat Phylogen Lab, Canberra, ACT 2601, Australia
[3] Stanford Univ, Sch Earth Energy & Environm Sci, Dept Earth Syst Sci, Palo Alto, CA 94305 USA
[4] Univ Arizona, Dept Ecol & Evolutionary Biol, Tucson, AZ 85721 USA
[5] Uppsala Univ, Evolutionary Biol Ctr, Dept Ecol & Genet, Plant Ecol & Evolut, SE-75236 Uppsala, Sweden
[6] Australian Natl Univ, Res Sch Biol, Dept Ecol & Evolut, Canberra, ACT 2601, Australia
基金
澳大利亚研究理事会;
关键词
amino acid sequence analyses; amino acid substitution models; maximum likelihood model estimation; nonreversible models; phylogenetic inference; reversible models; MAXIMUM-LIKELIHOOD-ESTIMATION; PHYLOGENETIC TREES; PROTEIN EVOLUTION; DNA-SEQUENCES; PHYLOGENOMICS; MATRICES; LIFE;
D O I
10.1093/sysbio/syac007
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Amino acid substitution models are a key component in phylogenetic analyses of protein sequences. All commonly used amino acid models available to date are time-reversible, an assumption designed for computational convenience but not for biological reality. Another significant downside to time-reversible models is that they do not allow inference of rooted trees without outgroups. In this article, we introduce a maximum likelihood approach nQMaker, an extension of the recently published QMaker method, that allows the estimation of time nonreversible amino acid substitution models and rooted phylogenetic trees from a set of protein sequence alignments. We show that the nonreversible models estimated with nQMaker are a much better fit to empirical alignments than pre-existing reversible models, across a wide range of data sets including mammals, birds, plants, fungi, and other taxa, and that the improvements in model fit scale with the size of the data set. Notably, for the recently published plant and bird trees, these nonreversible models correctly recovered the commonly estimated root placements with very high-statistical support without the need to use an outgroup. We provide nQMaker as an easy-to-use feature in the IQ-TREE software (http://www.iqtree.org), allowing users to estimate nonreversible models and rooted phylogenies from their own protein data sets.
引用
收藏
页码:1110 / 1123
页数:14
相关论文
共 46 条
[31]   When Outgroups Fail; Phylogenomics of Rooting the Emerging Pathogen, Coxiella burnetii [J].
Pearson, Talima ;
Hornstra, Heidie M. ;
Sahl, Jason W. ;
Schaack, Sarah ;
Schupp, James M. ;
Beckstrom-Sternberg, Stephen M. ;
O'Neill, Matthew W. ;
Priestley, Rachael A. ;
Champion, Mia D. ;
Beckstrom-Sternberg, James S. ;
Kersh, Gilbert J. ;
Samuel, James E. ;
Massung, Robert F. ;
Keim, Paul .
SYSTEMATIC BIOLOGY, 2013, 62 (05) :752-762
[32]   Phylogenomics resolves the deep phylogeny of seed plants and indicates partial convergent or homoplastic evolution between Gnetales and angiosperms [J].
Ran, Jin-Hua ;
Shen, Ting-Ting ;
Wang, Ming-Ming ;
Wang, Xiao-Quan .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2018, 285 (1881)
[33]   COMPARISON OF PHYLOGENETIC TREES [J].
ROBINSON, DF ;
FOULDS, LR .
MATHEMATICAL BIOSCIENCES, 1981, 53 (1-2) :131-147
[34]   Next-generation genome annotation: we still struggle to get it right [J].
Salzberg, Steven L. .
GENOME BIOLOGY, 2019, 20 (1)
[35]   ESTIMATING DIMENSION OF A MODEL [J].
SCHWARZ, G .
ANNALS OF STATISTICS, 1978, 6 (02) :461-464
[36]   Tempo and Mode of Genome Evolution in the Budding Yeast Subphylum [J].
Shen, Xing-Xing ;
Opulente, Dana A. ;
Kominek, Jacek ;
Zhou, Xiaofan ;
Steenwyk, Jacob L. ;
Buh, Kelly V. ;
Haase, Max A. B. ;
Wisecaver, Jennifer H. ;
Wang, Mingshuang ;
Doering, Drew T. ;
Boudouris, James T. ;
Schneider, Rachel M. ;
Langdon, Quinn K. ;
Ohkuma, Moriya ;
Endoh, Rikiya ;
Takashima, Masako ;
Manabe, Ri-ichiroh ;
Cadez, Neza ;
Libkind, Diego ;
Rosa, Carlos A. ;
DeVirgilio, Jeremy ;
Hulfachor, Amanda Beth ;
Groenewald, Marizeth ;
Kurtzman, Cletus P. ;
Hittinger, Chris Todd ;
Rokas, Antonis .
CELL, 2018, 175 (06) :1533-+
[37]   An approximately unbiased test of phylogenetic tree selection [J].
Shimodaira, H .
SYSTEMATIC BIOLOGY, 2002, 51 (03) :492-508
[38]   Quantifying the Stationarity and Time Reversibility of the Nucleotide Substitution Process [J].
Squartini, Federico ;
Arndt, Peter F. .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (12) :2525-2535
[39]   Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference [J].
Tan, Ge ;
Muffato, Matthieu ;
Ledergerber, Christian ;
Herrero, Javier ;
Goldman, Nick ;
Gil, Manuel ;
Dessimoz, Christophe .
SYSTEMATIC BIOLOGY, 2015, 64 (05) :778-791
[40]   Phylogenetic rooting using minimal ancestor deviation [J].
Tria, Fernando Domingues Kuemmel ;
Landan, Giddy ;
Dagan, Tal .
NATURE ECOLOGY & EVOLUTION, 2017, 1 (07)