Relative timing information and orthology in evolutionary scenarios

被引:0
作者
Schaller, David [1 ,2 ]
Hartmann, Tom [1 ,2 ]
Lafond, Manuel [4 ]
Stadler, Peter F. [1 ,2 ,6 ,7 ,8 ,9 ,10 ,11 ]
Wieseke, Nicolas [5 ]
Hellmuth, Marc [3 ]
机构
[1] Univ Leipzig, Bioinformat Grp, Dept Comp Sci, Hartelstr 16-18, D-04107 Leipzig, Germany
[2] Univ Leipzig, Interdisciplinary Ctr Bioinformat, Hartelstr 16-18, D-04107 Leipzig, Germany
[3] Stockholm Univ, Fac Sci, Dept Math, S-10691 Stockholm, Sweden
[4] Univ Sherbrooke, Dept Comp Sci, 2500 Boul Univ, Sherbrooke, PQ J1K 2R1, Canada
[5] Univ Leipzig, Fac Math & Comp Sci, Swarm Intelligence & Complex Syst Grp, Augustuspl 10, D-04109 Leipzig, Germany
[6] Univ Leipzig, Competence Ctr Scalable Data Serv & Solut Dresden, Interdisciplinary Ctr Bioinformat, German Ctr Integrat Biodivers Res IDiv, Augustuspl 12, D-04107 Leipzig, Germany
[7] Univ Leipzig, Leipzig Res Ctr Civilizat Dis, Augustuspl 12, D-04107 Leipzig, Germany
[8] Max Planck Inst Math Sci, Inselstr 22, D-04109 Leipzig, Germany
[9] Univ Vienna, Dept Theoret Chem, Wahringer Str 17, A-1090 Vienna, Austria
[10] Univ Nacl Colombia, Fac Ciencias, Ciudad Univ, Bogota 111321, DC, Colombia
[11] Santa Fe Inst, 1399 Hyde Pk Rd, Santa Fe, NM 87501 USA
关键词
Gene tree; Species tree; Cograph; Perfect graph; Orthology; Xenology; Horizontal gene transfer; Informative and forbidden triples; Relative timing; NP-hardness; LOWEST COMMON ANCESTORS; GENE; TREES; DUPLICATIONS;
D O I
10.1186/s13015-023-00240-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundEvolutionary scenarios describing the evolution of a family of genes within a collection of species comprise the mapping of the vertices of a gene tree T to vertices and edges of a species tree S. The relative timing of the last common ancestors of two extant genes (leaves of T) and the last common ancestors of the two species (leaves of S) in which they reside is indicative of horizontal gene transfers (HGT) and ancient duplications. Orthologous gene pairs, on the other hand, require that their last common ancestors coincides with a corresponding speciation event. The relative timing information of gene and species divergences is captured by three colored graphs that have the extant genes as vertices and the species in which the genes are found as vertex colors: the equal-divergence-time (EDT) graph, the later-divergence-time (LDT) graph and the prior-divergence-time (PDT) graph, which together form an edge partition of the complete graph.ResultsHere we give a complete characterization in terms of informative and forbidden triples that can be read off the three graphs and provide a polynomial time algorithm for constructing an evolutionary scenario that explains the graphs, provided such a scenario exists. While both LDT and PDT graphs are cographs, this is not true for the EDT graph in general. We show that every EDT graph is perfect. While the information about LDT and PDT graphs is necessary to recognize EDT graphs in polynomial-time for general scenarios, this extra information can be dropped in the HGT-free case. However, recognition of EDT graphs without knowledge of putative LDT and PDT graphs is NP-complete for general scenarios. In contrast, PDT graphs can be recognized in polynomial-time. We finally connect the EDT graph to the alternative definitions of orthology that have been proposed for scenarios with horizontal gene transfer. With one exception, the corresponding graphs are shown to be colored cographs.
引用
收藏
页数:43
相关论文
共 57 条
[1]   INFERRING A TREE FROM LOWEST COMMON ANCESTORS WITH AN APPLICATION TO THE OPTIMIZATION OF RELATIONAL EXPRESSIONS [J].
AHO, AV ;
SAGIV, Y ;
SZYMANSKI, TG ;
ULLMAN, JD .
SIAM JOURNAL ON COMPUTING, 1981, 10 (03) :405-421
[2]   Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss [J].
Bansal, Mukul S. ;
Alm, Eric J. ;
Kellis, Manolis .
BIOINFORMATICS, 2012, 28 (12) :I283-I291
[3]   Lowest common ancestors in trees and directed acyclic graphs [J].
Bender, MA ;
Farach-Colton, M ;
Pemmasani, G ;
Skiena, S ;
Sumazin, P .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2005, 57 (02) :75-94
[4]   The Level Ancestor Problem simplified [J].
Bender, MA ;
Farach-Colton, M .
THEORETICAL COMPUTER SCIENCE, 2004, 321 (01) :5-12
[5]   FINDING LEVEL-ANCESTORS IN TREES [J].
BERKMAN, O ;
VISHKIN, U .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1994, 48 (02) :214-230
[6]   The ancestor of modern Holozoa acquired the CCA-adding enzyme from Alphaproteobacteria by horizontal gene transfer [J].
Betat, Heike ;
Mede, Tobias ;
Tretbar, Sandy ;
Steiner, Lydia ;
Stadler, Peter F. ;
Moerl, Mario ;
Prohaska, Sonja J. .
NUCLEIC ACIDS RESEARCH, 2015, 43 (14) :6739-6746
[7]   Pattern matching for permutations [J].
Bose, P ;
Buss, AF ;
Lubiw, A .
INFORMATION PROCESSING LETTERS, 1998, 65 (05) :277-283
[8]   Extension operations on sets of leaf-labelled trees [J].
Bryant, D ;
Steel, M .
ADVANCES IN APPLIED MATHEMATICS, 1995, 16 (04) :425-453
[9]   Recognizing Berge graphs [J].
Chudnovsky, M ;
Cornuéjols, G ;
Liu, XM ;
Seymour, P ;
Vuskovic, K .
COMBINATORICA, 2005, 25 (02) :143-186
[10]   The strong perfect graph theorem [J].
Chudnovsky, Maria ;
Robertson, Neil ;
Seymour, Paul ;
Thomas, Robin .
ANNALS OF MATHEMATICS, 2006, 164 (01) :51-229