Estimating amino acid substitution models for metazoan evolutionary studies

被引:2
作者
Dang, Cuong Cao [1 ]
Vinh, Le Sy [1 ]
机构
[1] Vietnam Natl Univ, Univ Engn & Technol, Hanoi, Vietnam
关键词
amino acid substitution models; metazoan protein sequences; time non-reversible models; time-reversible models; PHYLOGENY; SISTER;
D O I
10.1111/jeb.14147
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Amino acid substitution models represent the substitution rates among amino acids during the evolution of protein sequences. The models are a prerequisite for maximum likelihood or Bayesian methods to analyse the phylogenetic relationships among species based on their protein sequences. Estimating amino acid substitution models requires large protein datasets and intensive computation. In this paper, we presented the estimation of both time-reversible model (Q.met) and time non-reversible model (NQ.met) for multicellular animals (Metazoa). Analyses showed that the Q.met and NQ.met models were significantly better than existing models in analysing metazoan protein sequences. Moreover, the time non-reversible model NQ.met enables us to reconstruct the rooted phylogenetic tree for Metazoa. We recommend researchers to employ the Q.met and NQ.met models in analysing metazoan protein sequences.
引用
收藏
页码:499 / 506
页数:8
相关论文
共 27 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   Extracting phylogenetic signal and accounting for bias in whole-genome data sets supports the Ctenophora as sister to remaining Metazoa [J].
Borowiec, Marek L. ;
Lee, Ernest K. ;
Chiu, Joanna C. ;
Plachetzki, David C. .
BMC GENOMICS, 2015, 16
[3]   QMaker: Fast and Accurate Method to Estimate Empirical Models of Protein Evolution [J].
Bui Quang Minh ;
Cuong Cao Dang ;
Le Sy Vinh ;
Lanfear, Robert .
SYSTEMATIC BIOLOGY, 2021, 70 (05) :1046-1060
[4]   nQMaker: Estimating Time Nonreversible Amino Acid Substitution Models [J].
Cuong Cao Dang ;
Bui Quang Minh ;
McShea, Hanon ;
Masel, Joanna ;
James, Jennifer Eleanor ;
Le Sy Vinh ;
Lanfear, Robert .
SYSTEMATIC BIOLOGY, 2022, :1110-1123
[5]   FastMG: a simple, fast, and accurate maximum likelihood procedure to estimate amino acid replacement rate matrices from large data sets [J].
Cuong Cao Dang ;
Vinh Sy Le ;
Gascuel, Olivier ;
Hazes, Bart ;
Quang Si Le .
BMC BIOINFORMATICS, 2014, 15 :341
[6]   FLU, an amino acid substitution model for influenza proteins [J].
Cuong Cao Dang ;
Le, Quang Si ;
Gascuel, Olivier ;
Vinh Sy Le .
BMC EVOLUTIONARY BIOLOGY, 2010, 10
[7]   UFBoot2: Improving the Ultrafast Bootstrap Approximation [J].
Diep Thi Hoang ;
Chernomor, Olga ;
von Haeseler, Arndt ;
Minh, Bui Quang ;
Le Sy Vinh .
MOLECULAR BIOLOGY AND EVOLUTION, 2018, 35 (02) :518-522
[8]   THE RAPID GENERATION OF MUTATION DATA MATRICES FROM PROTEIN SEQUENCES [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (03) :275-282
[9]  
Kalyaanamoorthy S, 2017, NAT METHODS, V14, P587, DOI [10.1038/nmeth.4285, 10.1038/NMETH.4285]
[10]   An improved general amino acid replacement matrix [J].
Le, Si Quang ;
Gascuel, Olivier .
MOLECULAR BIOLOGY AND EVOLUTION, 2008, 25 (07) :1307-1320