Summarizing the solution space in tumor phylogeny inference by multiple consensus trees

被引:17
作者
Aguse, Nuraini [1 ]
Qi, Yuanyuan [1 ]
El-Kebir, Mohammed [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
EVOLUTION; TRACKING; HISTORY;
D O I
10.1093/bioinformatics/btz312
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Cancer phylogenies are key to studying tumorigenesis and have clinical implications. Due to the heterogeneous nature of cancer and limitations in current sequencing technology, current cancer phylogeny inference methods identify a large solution space of plausible phylogenies. To facilitate further downstream analyses, methods that accurately summarize such a set T of cancer phylogenies are imperative. However, current summary methods are limited to a single consensus tree or graph and may miss important topological features that are present in different subsets of candidate trees. Results We introduce the Multiple Consensus Tree (MCT) problem to simultaneously cluster T and infer a consensus tree for each cluster. We show that MCT is NP-hard, and present an exact algorithm based on mixed integer linear programming (MILP). In addition, we introduce a heuristic algorithm that efficiently identifies high-quality consensus trees, recovering all optimal solutions identified by the MILP in simulated data at a fraction of the time. We demonstrate the applicability of our methods on both simulated and real data, showing that our approach selects the number of clusters depending on the complexity of the solution space T. Availability and implementation https://github.com/elkebir-group/MCT. Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:I408 / I416
页数:9
相关论文
共 36 条
  • [1] [Anonymous], 1997, ART COMPUTER PROGRAM
  • [2] [Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
  • [3] Beyond Perfect Phylogeny: Multisample Phylogeny Reconstruction via ILP
    Bonizzoni, Paola
    Ciccolella, Simone
    Della Vedova, Gianluca
    Soto, Mauricio
    [J]. ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 1 - 10
  • [4] ClonEvol: clonal ordering and visualization in cancer sequencing
    Dang, H. X.
    White, B. S.
    Foltz, S. M.
    Miller, C. A.
    Luo, J.
    Fields, R. C.
    Maher, C. A.
    [J]. ANNALS OF ONCOLOGY, 2017, 28 (12) : 3076 - 3082
  • [5] PhyloWGS: Reconstructing subclonal composition and evolution from whole-genome sequencing of tumors
    Deshwar, Amit G.
    Vembu, Shankar
    Yung, Christina K.
    Jang, Gun Ho
    Stein, Lincoln
    Morris, Quaid
    [J]. GENOME BIOLOGY, 2015, 16
  • [6] Inferring tree models for oncogenesis from comparative genome hybridization data
    Desper, R
    Jiang, F
    Kallioniemi, OP
    Moch, H
    Papadimitriou, CH
    Schäffer, AA
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (01) : 37 - 51
  • [7] Donmez Nilgun., 2016, International Conference on Research in Computational Molecular Biology, P83
  • [8] SPhyR: tumor phylogeny estimation from single-cell sequencing data under loss and error
    El-Kebir, Mohammed
    [J]. BIOINFORMATICS, 2018, 34 (17) : 671 - 679
  • [9] Inferring parsimonious migration histories for metastatic cancers
    El-Kebir, Mohammed
    Satas, Gryte
    Raphael, Benjamin J.
    [J]. NATURE GENETICS, 2018, 50 (05) : 718 - +
  • [10] Inferring the Mutational History of a Tumor Using Multi-state Perfect Phylogeny Mixtures
    El-Kebir, Mohammed
    Satas, Gryte
    Oesper, Layla
    Raphael, Benjamin J.
    [J]. CELL SYSTEMS, 2016, 3 (01) : 43 - 53