Checking Phylogenetics Decisiveness in Theory and in Practice

被引:0
|
作者
Parvini, Ghazaleh [1 ]
Braught, Katherine [2 ]
Fernandez-Baca, David [3 ]
机构
[1] Univ Massachusetts, Coll Informat & Comp Sci, Amherst, MA 01003 USA
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[3] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
关键词
Phylogeny; Partitioning algorithms; Color; Terminology; Steel; Software; Reliability; Phylogenetic tree; taxon coverage; algorithms; no-rainbow hypergraph coloring; integer linear programming; satisfiability; TREE;
D O I
10.1109/TCBB.2021.3128381
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Suppose we aim to build a phylogeny for a set of taxa X using information from a collection of loci, where each locus offers information for only a fraction of the taxa. The question is whether, based solely on the pattern of data availability, called a taxon coverage pattern, one can determine if the data suffices to construct a reliable phylogeny. The problem can be expressed combinatorially as follows. Let us call a taxon coverage pattern decisive if, for any binary phylogenetic tree T for X, the collection of phylogenetic trees obtained by restricting T to the subset of X covered by each locus uniquely determines T. Here we relate the problem of checking whether a taxon coverage pattern is decisive to a hypergraph coloring problem. Using this connection, we (1) show that checking decisiveness is co-NP complete; (2) obtain lower bounds on the amount of coverage needed to achieve decisiveness; (3) devise an exact algorithm for decisiveness; (4) develop problem reduction rules, and use them to obtain efficient algorithms for inputs with few loci; and (5) devise Boolean satisfiability (SAT) and integer linear programming formulations (ILP) of decisiveness that allow us to analyze data sets that arise in practice. For data sets that are not decisive, we use our SATand ILP formulations to obtain decisive subsets of the data.
引用
收藏
页码:3307 / 3316
页数:10
相关论文
共 50 条
  • [1] Applying Theory to Practice
    Fagin, Ronald
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 3 - 4
  • [2] Intersectionality: From Theory to Practice
    Al-Faham, Hajer
    Davis, Angelique M.
    Ernst, Rose
    ANNUAL REVIEW OF LAW AND SOCIAL SCIENCE, VOL 15, 2019, 15 : 247 - 265
  • [3] Epilepsy in Cats: Theory and Practice
    Pakozdy, A.
    Halasz, P.
    Klang, A.
    JOURNAL OF VETERINARY INTERNAL MEDICINE, 2014, 28 (02): : 255 - 263
  • [4] Domain decomposition: Theory and practice
    Gerteisen, E
    Gruber, R
    Merazzi, S
    COMPUTERS IN PHYSICS, 1997, 11 (01): : 26 - 30
  • [5] Theory and Practice in Large Carpooling Problems
    Hartman, Irith Ben-Arroyo
    Keren, Daniel
    Abu Dbai, Abed
    Cohen, Elad
    Knapen, Luk
    Yasar, Ansar-Ul-Haque
    Janssens, Davy
    5TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2014), THE 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2014), 2014, 32 : 339 - 347
  • [6] Algorithmic Actants in Practice, Theory, and Method
    Zamith, Rodrigo
    Haim, Mario
    MEDIA AND COMMUNICATION, 2020, 8 (03): : 1 - 4
  • [7] Theory and Practice of Ultra-Perfection
    Ouangraoua, Aida
    Bergeron, Anne
    Swenson, Krister M.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2011, 18 (09) : 1219 - 1230
  • [8] On terminological figurativeness From theory to practice
    Timofeeva-Timofeev, Larissa
    Vargas-Sierra, Chelo
    TERMINOLOGY, 2015, 21 (01): : 102 - 125
  • [9] Indexing for Summary Queries: Theory and Practice
    Yi, Ke
    Wang, Lu
    Wei, Zhewei
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2014, 39 (01):
  • [10] Structural stability: from theory to practice
    Chen, WF
    ENGINEERING STRUCTURES, 2000, 22 (02) : 116 - 122