The binary perfect phylogeny with persistent characters

被引:18
作者
Bonizzoni, Paola [1 ]
Braghin, Chiara [2 ]
Dondi, Riccardo [3 ]
Trucco, Gabriella [2 ]
机构
[1] Univ Milano Bicocca, Dipartimento Informat Sistemist & Comunicaz, I-20126 Milan, Italy
[2] Univ Milan, Dipartimento Tecnol Informaz, Crema, Italy
[3] Univ Bergamo, Dipartimento Sci Linguaggi Comunicaz & Culturali, Bergamo, Italy
关键词
LINEAR-TIME ALGORITHM; TREE;
D O I
10.1016/j.tcs.2012.05.035
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The binary perfect phylogeny model is too restrictive to model biological events such as back mutations. In this paper, we consider a natural generalization of the model that allows a special type of back mutation. We investigate the problem of reconstructing a near perfect phylogeny over a binary set of characters where characters are persistent: characters can be gained and lost at most once. Based on this notion, we define the problem of the Persistent Perfect Phylogeny (referred as P-PP). We restate the P-PP problem as a special case of the Incomplete Directed Perfect Phylogeny, called Incomplete Perfect Phylogeny with Persistent Completion, (refereed as IP-PP), where the instance is an incomplete binary matrix M having some missing entries, denoted by symbol ?, that must be determined (or completed) as 0 or 1 so that M admits a binary perfect phylogeny. We show that the IP-PP problem can be reduced to a problem over an edge colored graph since the completion of each column of the input matrix can be represented by a graph operation. Based on this graph formulation, we develop an exact algorithm for solving the P-PP problem that is exponential in the number of characters and polynomial in the number of species. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:51 / 63
页数:13
相关论文
共 19 条
[1]  
[Anonymous], 1997, Introduction to computational molecular biology
[2]   A linear-time algorithm for the perfect phylogeny haplotype problem [J].
Bonizzoni, Paola .
ALGORITHMICA, 2007, 48 (03) :267-285
[3]   A METHOD FOR DEDUCING BRANCHING SEQUENCES IN PHYLOGENY [J].
CAMIN, JH ;
SOKAL, RR .
EVOLUTION, 1965, 19 (03) :311-326
[4]  
Cormen T., 2001, Introduction to Algorithms
[5]   A linear-time algorithm for the Perfect Phylogeny Haplotyping (PPH) problem [J].
Ding, ZH ;
Filkov, V ;
Gusfield, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :522-553
[6]  
Eskin Eleazar, 2003, J Bioinform Comput Biol, V1, P1, DOI 10.1142/S0219720003000174
[7]  
Felsenstein J., 2003, Inferring phylogenies
[8]  
Fernandez-Baca D., 2000, PERFECT PHYLOGENY PR
[9]   EFFICIENT ALGORITHMS FOR INFERRING EVOLUTIONARY TREES [J].
GUSFIELD, D .
NETWORKS, 1991, 21 (01) :19-28
[10]  
Gusfield D., 2002, PROC 6 ANN INT C COM, P166, DOI [DOI 10.1145/565196.565218., 10.1145/565196.565218, DOI 10.1145/565196.565218]