Measuring Asymmetry in Time-Stamped Phylogenies

被引:17
作者
Dearlove, Bethany L. [1 ]
Frost, Simon D. W. [1 ]
机构
[1] Univ Cambridge, Dept Vet Med, Cambridge, England
基金
英国医学研究理事会;
关键词
FALSE DISCOVERY RATE; REJECTIVE MULTIPLE TEST; TREE SHAPE; MODELS; TESTS; DISTRIBUTIONS; TRANSMISSION; STATISTICS; EVOLUTION; DYNAMICS;
D O I
10.1371/journal.pcbi.1004312
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Previous work has shown that asymmetry in viral phylogenies may be indicative of heterogeneity in transmission, for example due to acute HIV infection or the presence of 'core groups' with higher contact rates. Hence, evidence of asymmetry may provide clues to underlying population structure, even when direct information on, for example, stage of infection or contact rates, are missing. However, current tests of phylogenetic asymmetry (a) suffer from false positives when the tips of the phylogeny are sampled at different times and (b) only test for global asymmetry, and hence suffer from false negatives when asymmetry is localised to part of a phylogeny. We present a simple permutation-based approach for testing for asymmetry in a phylogeny, where we compare the observed phylogeny with random phylogenies with the same sampling and coalescence times, to reduce the false positive rate. We also demonstrate how profiles of measures of asymmetry calculated over a range of evolutionary times in the phylogeny can be used to identify local asymmetry. In combination with different metrics of asymmetry, this combined approach offers detailed insights of how phylogenies reconstructed from real viral datasets may deviate from the simplistic assumptions of commonly used coalescent and birth-death process models.
引用
收藏
页数:16
相关论文
共 56 条
[1]   Power of eight tree shape statistics to detect nonrandom diversification: A comparison by simulation of two models of cladogenesis [J].
Agapow, PM ;
Purvis, A .
SYSTEMATIC BIOLOGY, 2002, 51 (06) :866-872
[2]   Serial SimCoal: A population genetics model for data from multiple populations and points in time [J].
Anderson, CNK ;
Ramakrishnan, U ;
Chan, YL ;
Hadly, EA .
BIOINFORMATICS, 2005, 21 (08) :1733-1734
[3]  
[Anonymous], 2019, R: A language for environment for statistical computing
[4]  
[Anonymous], 1982, Syst. Zool.
[5]  
[Anonymous], 1991, OXFORD SURVEYS EVOL
[6]  
[Anonymous], 2014, CLUSTER CLUSTER ANAL
[7]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]   On statistical tests of phylogenetic tree imbalance:: The Sackin and other indices revisited [J].
Blum, MGB ;
François, O .
MATHEMATICAL BIOSCIENCES, 2005, 195 (02) :141-153
[10]   The mean, variance and limiting distribution of two statistics sensitive to phylogenetic tree balance [J].
Blum, Michael G. B. ;
Francois, Olivier ;
Janson, Svante .
ANNALS OF APPLIED PROBABILITY, 2006, 16 (04) :2195-2214