Evolutionary trees and the Ising model on the Bethe lattice: a proof of Steel's conjecture

被引:40
作者
Daskalakis, Constantinos [2 ]
Mossel, Elchanan [3 ,4 ]
Roch, Sebastien [1 ]
机构
[1] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90024 USA
[2] MIT, CSAIL, Cambridge, MA 02139 USA
[3] UC Berkeley, Berkeley, CA USA
[4] Weizmann Inst Sci, IL-76100 Rehovot, Israel
基金
美国国家科学基金会;
关键词
Phylogenetics; CFN model; Ising model; Phase transitions; Reconstruction problem; Jukes-Cantor; GLAUBER DYNAMICS; RECONSTRUCTION; TRANSITION; GRAPHS; STATE;
D O I
10.1007/s00440-009-0246-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A major task of evolutionary biology is the reconstruction of phylogenetic trees from molecular data. The evolutionary model is given by a Markov chain on a tree. Given samples from the leaves of the Markov chain, the goal is to reconstruct the leaf-labelled tree. It is well known that in order to reconstruct a tree on n leaves, sample sequences of length Omega (log n) are needed. It was conjectured by Steel that for the CFN/Ising evolutionary model, if the mutation probability on all edges of the tree is less than p* = (root 2 - 1)/2(3/2), then the tree can be recovered from sequences of length O(log n). The value p* is given by the transition point for the extremality of the free Gibbs measure for the Ising model on the binary tree. Steel's conjecture was proven by the second author in the special case where the tree is "balanced." The second author also proved that if all edges have mutation probability larger than p* then the length needed is n(Omega(1)). Here we show that Steel's conjecture holds true for general trees by giving a reconstruction algorithm that recovers the tree from O(log n)-length sequences when the mutation probabilities are discretized and less than p*. Our proof and results demonstrate that extremality of the free Gibbs measure on the infinite binary tree, which has been studied before in probability, statistical physics and computer science, determines how distinguishable are Gibbs measures on finite binary trees.
引用
收藏
页码:149 / 189
页数:41
相关论文
共 36 条
[1]  
[Anonymous], 2004, Inferring phylogenies
[2]   Glauber dynamics on trees and hyperbolic graphs [J].
Berger, N ;
Kenyon, C ;
Mossel, E ;
Peres, Y .
PROBABILITY THEORY AND RELATED FIELDS, 2005, 131 (03) :311-340
[3]   ON THE PURITY OF THE LIMITING GIBBS STATE FOR THE ISING-MODEL ON THE BETHE LATTICE [J].
BLEHER, PM ;
RUIZ, J ;
ZAGREBNOV, VA .
JOURNAL OF STATISTICAL PHYSICS, 1995, 79 (1-2) :473-482
[4]  
Borgs C, 2006, ANN IEEE SYMP FOUND, P518
[5]  
BUNEMAN P, 1971, MATH ARCHAELOGICAL H, P187
[6]  
CAVENDER JA, 1978, MATH BIOSCI, V40
[7]   Full reconstruction of Markov models on evolutionary trees: Identifiability and consistency [J].
Chang, JT .
MATHEMATICAL BIOSCIENCES, 1996, 137 (01) :51-73
[8]  
Daskalakis C., 2006, STOC'06. Proceedings of the 38th Annual ACM Symposium on Theory of Computing, P159, DOI 10.1145/1132516.1132540
[9]  
Erdos PL, 1999, RANDOM STRUCT ALGOR, V14, P153, DOI 10.1002/(SICI)1098-2418(199903)14:2<153::AID-RSA3>3.0.CO
[10]  
2-R