Topological rearrangements and local search method for tandem duplication trees

被引:13
作者
Bertrand, D [1 ]
Gascuel, O [1 ]
机构
[1] Univ Montpellier 2, CNRS,UMR 5506, LIRMM, Projet Methodes & Algorithmes Bioinformat, F-34392 Montpellier 5, France
关键词
tandem duplication trees; phylogeny; topological rearrangements; local search; parsimony; minimum evolution; Zinc finger genes;
D O I
10.1109/TCBB.2005.15
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The problem of reconstructing the duplication history of a set of tandemly repeated sequences was first introduced by Fitch [4]. Many recent studies deal with this problem, showing the validity of the unequal recombination model proposed by Fitch, describing numerous inference algorithms, and exploring the combinatorial properties of these new mathematical objects, which are duplication trees. In this paper, we deal with the topological rearrangement of these trees. Classical rearrangements used in phylogeny (NNI, SPR, TBR,...) cannot be applied directly on duplication trees. We show that restricting the neighborhood defined by the SPR (Subtree Pruning and Regrafting) rearrangement to valid duplication trees, allows exploring the whole duplication tree space. We use these restricted rearrangements in a local search method which improves an initial tree via successive rearrangements. This method is applied to the optimization of parsimony and minimum evolution criteria. We show through simulations that this method improves all existing programs for both reconstructing the topology of the true tree and recovering its duplication events. We apply this approach to tandemly repeated human Zinc finger genes and observe that a much better duplication tree is obtained by our method than using any other program.
引用
收藏
页码:15 / 28
页数:14
相关论文
共 55 条
[1]  
Barthelemy J.P., 1991, TREES PROXIMITY REPR
[2]  
Benson G, 1999, Proc Int Conf Intell Syst Mol Biol, P44
[3]   Inferring evolutionary trees with strong combinatorial evidence [J].
Berry, V ;
Gascuel, O .
THEORETICAL COMPUTER SCIENCE, 2000, 240 (02) :271-298
[4]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[6]   Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting [J].
Desper, R ;
Gascuel, O .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (03) :587-598
[7]   Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle [J].
Desper, R ;
Gascuel, O .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (05) :687-705
[8]  
ELBARABI T, 1991, MECH DEVELOP, V33, P155
[9]   An efficient and accurate distance based algorithm to reconstruct tandem duplication trees [J].
Elemento, O ;
Gascuel, O .
BIOINFORMATICS, 2002, 18 :S92-S99
[10]   Reconstructing the duplication history of tandemly repeated genes [J].
Elemento, O ;
Gascuel, O ;
Lefranc, MP .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (03) :278-288