Manhattan Path-Difference Median Trees

被引:1
作者
Markin, Alexey [1 ]
Eulenstein, Oliver [1 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
来源
PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS | 2016年
基金
美国国家科学基金会;
关键词
Path-Difference distance; median trees; supertrees; Manhattan distance; local search; phylogenetics; ALGORITHMS; PHYLOGENY; SUPERTREE; EVOLUTION;
D O I
10.1145/2975167.2975209
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Median tree problems are a powerful tool for inferring large-scale phylogenetic trees that hold enormous promise for society at large. Such problems seek a median tree for a given collection of input trees under some problem-specific distance. Here, we introduce this problem for the classic Manhattan path-difference distance and show that this problem is NP-hard. To address this inherent time complexity we devise an ILP formulation, and an effective local search heuristic that is based on solving a local search problem exactly. Our algorithm for the local search problem improves asymptotically by a factor of n/log n on the best-known (naive) solution, where n is the size of the input trees. Finally, we demonstrate the ability of our heuristic in a comparative study using large-scale published empirical data sets, and showing its accuracy for small phylogenetic studies by using exact ILP solutions.
引用
收藏
页码:404 / 413
页数:10
相关论文
共 42 条
  • [1] [Anonymous], 2002, PAUP*. Phylogenetic Analysis Using Parsimony (*and other methods). Version 4
  • [2] [Anonymous], 1997, ACM SIGACT NEWS
  • [3] Efficient genome-scale phylogenetic analysis under the duplication-loss and deep coalescence cost models
    Bansal, Mukul S.
    Burleigh, J. Gordon
    Eulenstein, Oliver
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [4] Robinson-Foulds Supertrees
    Bansal, Mukul S.
    Burleigh, J. Gordon
    Eulenstein, Oliver
    Fernandez-Baca, David
    [J]. ALGORITHMS FOR MOLECULAR BIOLOGY, 2010, 5
  • [5] Markovian trees: properties and algorithms
    Bean, Nigel G.
    Kontoleon, Nectarios
    Taylor, Peter G.
    [J]. ANNALS OF OPERATIONS RESEARCH, 2008, 160 (01) : 31 - 50
  • [6] Bininda-Emonds OlafR. P., 2004, Computational Biology, V4
  • [7] Nodal distance algorithm: Calculating a phylogenetic tree comparison metric
    Bluis, J
    Shin, DG
    [J]. THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 87 - 94
  • [8] Hunting for trees in binary character sets: Efficient algorithms for extraction, enumeration, and optimization
    Bryant, D
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1996, 3 (02) : 275 - 288
  • [9] A species-level phylogenetic supertree of marsupials
    Cardillo, M
    Bininda-Emonds, ORP
    Boakes, E
    Purvis, A
    [J]. JOURNAL OF ZOOLOGY, 2004, 264 : 11 - 31
  • [10] An ILP solution for the gene duplication problem
    Chang, Wen-Chieh
    Burleigh, Gordon J.
    Fernandez-Baca, David F.
    Eulenstein, Oliver
    [J]. BMC BIOINFORMATICS, 2011, 12