Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss

被引:144
作者
Bansal, Mukul S. [1 ]
Alm, Eric J. [2 ]
Kellis, Manolis [1 ,3 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] MIT, Dept Biol Engn, Cambridge, MA 02139 USA
[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
PHYLOGENETIC TREES; RECONSTRUCTION; HISTORY; ASSOCIATIONS; EVOLUTION; ORTHOLOGS; MODEL;
D O I
10.1093/bioinformatics/bts225
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene family evolution is driven by evolutionary events such as speciation, gene duplication, horizontal gene transfer and gene loss, and inferring these events in the evolutionary history of a given gene family is a fundamental problem in comparative and evolutionary genomics with numerous important applications. Solving this problem requires the use of a reconciliation framework, where the input consists of a gene family phylogeny and the corresponding species phylogeny, and the goal is to reconcile the two by postulating speciation, gene duplication, horizontal gene transfer and gene loss events. This reconciliation problem is referred to as duplication-transfer-loss (DTL) reconciliation and has been extensively studied in the literature. Yet, even the fastest existing algorithms for DTL reconciliation are too slow for reconciling large gene families and for use in more sophisticated applications such as gene tree or species tree reconstruction. Results: We present two new algorithms for the DTL reconciliation problem that are dramatically faster than existing algorithms, both asymptotically and in practice. We also extend the standard DTL reconciliation model by considering distance-dependent transfer costs, which allow for more accurate reconciliation and give an efficient algorithm for DTL reconciliation under this extended model. We implemented our new algorithms and demonstrated up to 100 000-fold speed-up over existing methods, using both simulated and biological datasets. This dramatic improvement makes it possible to use DTL reconciliation for performing rigorous evolutionary analyses of large gene families and enables its use in advanced reconciliation-based gene and species tree reconstruction methods.
引用
收藏
页码:I283 / I291
页数:9
相关论文
共 49 条
  • [1] Biased gene transfer in microbial evolution
    Andam, Cheryl P.
    Gogarten, J. Peter
    [J]. NATURE REVIEWS MICROBIOLOGY, 2011, 9 (07) : 543 - 555
  • [2] The Gene Evolution Model and Computing Its Associated Probabilities
    Arvestad, Lars
    Lagergren, Jens
    Sennblad, Bengt
    [J]. JOURNAL OF THE ACM, 2009, 56 (02)
  • [3] Bansal MS, 2007, LECT NOTES COMPUT SC, V4453, P238
  • [4] Lowest common ancestors in trees and directed acyclic graphs
    Bender, MA
    Farach-Colton, M
    Pemmasani, G
    Skiena, S
    Sumazin, P
    [J]. JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2005, 57 (02): : 75 - 94
  • [5] Inferring and Validating Horizontal Gene Transfer Events Using Bipartition Dissimilarity
    Boc, Alix
    Philippe, Herve
    Makarenkov, Vladimir
    [J]. SYSTEMATIC BIOLOGY, 2010, 59 (02) : 195 - 211
  • [6] Reconciling a gene tree to a species tree under the duplication cost model
    Bonizzoni, P
    Della Vedova, G
    Dondi, R
    [J]. THEORETICAL COMPUTER SCIENCE, 2005, 347 (1-2) : 36 - 53
  • [7] Brodal GS, 2011, LECT NOTES COMPUT SC, V6844, P290, DOI 10.1007/978-3-642-22300-6_25
  • [8] Genome-Scale Phylogenetics: Inferring the Plant Tree of Life from 18,896 Gene Trees
    Burleigh, J. Gordon
    Bansal, Mukul S.
    Eulenstein, Oliver
    Hartmann, Stefanie
    Wehe, Andre
    Vision, Todd J.
    [J]. SYSTEMATIC BIOLOGY, 2011, 60 (02) : 117 - 125
  • [9] Jungles: a new solution to the host/parasite phylogeny reconciliation problem
    Charleston, MA
    [J]. MATHEMATICAL BIOSCIENCES, 1998, 149 (02) : 191 - 223
  • [10] Traversing the tangle: Algorithms and applications for cophylogenetic studies
    Charleston, NA
    Perkins, SL
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2006, 39 (01) : 62 - 71