MISFITS: Evaluating the Goodness of Fit between a Phylogenetic Model and an Alignment

被引:12
作者
Minh Anh Thi Nguyen [1 ]
Klaere, Steffen [2 ]
von Haeseler, Arndt [1 ]
机构
[1] Univ Vet Med Vienna, Med Univ Vienna, Univ Vienna, Max F Perutz Labs,Ctr Integrat Bioinformat Vienna, Vienna, Austria
[2] Univ Auckland, Dept Math, Computat Evolut Grp, Auckland, New Zealand
基金
奥地利科学基金会;
关键词
goodness of fit; model test; model evaluation; phylogeny inference; maximum likelihood; maximum parsimony; MAXIMUM-LIKELIHOOD-ESTIMATION; SEQUENCE EVOLUTION; STATISTICAL TESTS; INFERENCE; ALGORITHM; SELECTION; PATTERN; RATES;
D O I
10.1093/molbev/msq180
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As models of sequence evolution become more and more complicated, many criteria for model selection have been proposed, and tools are available to select the best model for an alignment under a particular criterion. However, in many instances the selected model fails to explain the data adequately as reflected by large deviations between observed pattern frequencies and the corresponding expectation. We present MISFITS, an approach to evaluate the goodness of fit (http://www.cibiv.at/software/misfits). MISFITS introduces a minimum number of "extra substitutions" on the inferred tree to provide a biologically motivated explanation why the alignment may deviate from expectation. These extra substitutions plus the evolutionary model then fully explain the alignment. We illustrate the method on several examples and then give a survey about the goodness of fit of the selected models to the alignments in the PANDIT database.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 36 条
[1]  
[Anonymous], 2004, Inferring phylogenies
[2]  
[Anonymous], MOL SYSTEMATICS
[3]   Variation in evolutionary processes at different codon positions [J].
Bofkin, Lee ;
Goldman, Nick .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (02) :513-521
[4]   METHODS FOR COMPUTING WAGNER TREES [J].
FARRIS, JS .
SYSTEMATIC ZOOLOGY, 1970, 19 (01) :83-&
[5]   TOWARD DEFINING COURSE OF EVOLUTION - MINIMUM CHANGE FOR A SPECIFIC TREE TOPOLOGY [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1971, 20 (04) :406-&
[6]   Modeling compositional heterogeneity [J].
Foster, PG .
SYSTEMATIC BIOLOGY, 2004, 53 (03) :485-495
[7]   TESTS AUXILIARY X2 TESTS IN A MARKOV-CHAIN [J].
GOLD, RZ .
ANNALS OF MATHEMATICAL STATISTICS, 1963, 34 (01) :56-&
[8]   STATISTICAL TESTS OF MODELS OF DNA SUBSTITUTION [J].
GOLDMAN, N .
JOURNAL OF MOLECULAR EVOLUTION, 1993, 36 (02) :182-198
[9]  
GOLDMAN N, 1993, J MOL EVOL, V37, P650
[10]  
GU X, 1995, MOL BIOL EVOL, V12, P546