A Metric on Phylogenetic Tree Shapes

被引:38
作者
Colijn, C. [1 ]
Plazzotta, G. [1 ]
机构
[1] Imperial Coll, Dept Math, 180 Queens Gate, London SW7 2AZ, England
基金
英国工程与自然科学研究理事会;
关键词
tree metric; phylodynamics; tree shapes; A H3N2 VIRUSES; 2; MODELS; GLOBAL CIRCULATION; GENEALOGICAL TREES; INFLUENZA; STATISTICS; PATTERNS; ISOMORPHISM; PHENOGRAMS; IMBALANCE;
D O I
10.1093/sysbio/syx046
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The shapes of evolutionary trees are influenced by the nature of the evolutionary process but comparisons of trees fromdifferent processes are hindered by the challenge of completely describing tree shape. We present a full characterization of the shapes of rooted branching trees in a form that lends itself to natural tree comparisons. We use this characterization to define a metric, in the sense of a true distance function, on tree shapes. The metric distinguishes trees from random models known to produce different tree shapes. It separates trees derived from tropical versus USA influenza A sequences, which reflect the differing epidemiology of tropical and seasonal flu. We describe several metrics based on the same core characterization, and illustrate howto extend themetric to incorporate trees' branch lengths or other features such as overall imbalance. Our approach allows us to construct addition and multiplication on trees, and to create a convex metric on tree shapes which formally allows computation of average tree shapes.
引用
收藏
页码:113 / 126
页数:14
相关论文
共 63 条
[1]   Power of eight tree shape statistics to detect nonrandom diversification: A comparison by simulation of two models of cladogenesis [J].
Agapow, PM ;
Purvis, A .
SYSTEMATIC BIOLOGY, 2002, 51 (06) :866-872
[2]  
Aldous David., 1996, Random Discrete Structures, volume 76 of The IMA Volumes in Mathematics and its Applications, V76
[3]   Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today [J].
Aldous, DJ .
STATISTICAL SCIENCE, 2001, 16 (01) :23-34
[4]  
[Anonymous], 1979, ZAMM-Z ANGEW MATH ME, V59, P141
[5]  
Anopheles gambiae 1000 Genomes, 2016, ANOPHELES GAMBIAE 10
[6]   Global circulation patterns of seasonal influenza viruses vary with antigenic drift [J].
Bedford, Trevor ;
Riley, Steven ;
Barr, Ian G. ;
Broor, Shobha ;
Chadha, Mandeep ;
Cox, Nancy J. ;
Daniels, Rodney S. ;
Gunasekaran, C. Palani ;
Hurt, Aeron C. ;
Kelso, Anne ;
Klimov, Alexander ;
Lewis, Nicola S. ;
Li, Xiyan ;
McCauley, John W. ;
Odagiri, Takato ;
Potdar, Varsha ;
Rambaut, Andrew ;
Shu, Yuelong ;
Skepner, Eugene ;
Smith, Derek J. ;
Suchard, Marc A. ;
Tashiro, Masato ;
Wang, Dayan ;
Xu, Xiyan ;
Lemey, Philippe ;
Russell, Colin A. .
NATURE, 2015, 523 (7559) :217-U206
[7]   Geometry of the space of phylogenetic trees [J].
Billera, LJ ;
Holmes, SP ;
Vogtmann, K .
ADVANCES IN APPLIED MATHEMATICS, 2001, 27 (04) :733-767
[8]   The mean, variance and limiting distribution of two statistics sensitive to phylogenetic tree balance [J].
Blum, Michael G. B. ;
Francois, Olivier ;
Janson, Svante .
ANNALS OF APPLIED PROBABILITY, 2006, 16 (04) :2195-2214
[9]   Which random processes describe the tree of life?: A large-scale study of phylogenetic tree imbalance [J].
Blum, Michael G. B. ;
Francois, Olivier .
SYSTEMATIC BIOLOGY, 2006, 55 (04) :685-691
[10]   apTreeshape:: statistical analysis of phylogenetic tree shape [J].
Bortolussi, N ;
Durand, E ;
Blum, M ;
François, O .
BIOINFORMATICS, 2006, 22 (03) :363-364