Accurate and efficient cell lineage tree inference from noisy single cell data: the maximum likelihood perfect phylogeny approach

被引:17
作者
Wu, Yufeng [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
基金
美国国家科学基金会;
关键词
HETEROGENEITY; NUCLEOTIDE; EVOLUTION;
D O I
10.1093/bioinformatics/btz676
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cells in an organism share a common evolutionary history, called cell lineage tree. Cell lineage tree can be inferred from single cell genotypes at genomic variation sites. Cell lineage tree inference from noisy single cell data is a challenging computational problem. Most existing methods for cell lineage tree inference assume uniform uncertainty in genotypes. A key missing aspect is that real single cell data usually has non-uniform uncertainty in individual genotypes. Moreover, existing methods are often sampling based and can be very slow for large data. Results: In this article, we propose a new method called ScisTree, which infers cell lineage tree and calls genotypes from noisy single cell genotype data. Different from most existing approaches, ScisTree works with genotype probabilities of individual genotypes (which can be computed by existing single cell genotype callers). ScisTree assumes the infinite sites model. Given uncertain genotypes with individualized probabilities, ScisTree implements a fast heuristic for inferring cell lineage tree and calling the genotypes that allow the so-called perfect phylogeny and maximize the likelihood of the genotypes. Through simulation, we show that ScisTree performs well on the accuracy of inferred trees, and is much more efficient than existing methods. The efficiency of ScisTree enables new applications including imputation of the so-called doublets.
引用
收藏
页码:742 / 750
页数:9
相关论文
共 19 条
  • [1] [Anonymous], 1997, ACM SIGACT NEWS
  • [2] Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads
    Duitama, Jorge
    Kennedy, Justin
    Dinakar, Sanjiv
    Hernandez, Yoezen
    Wu, Yufeng
    Mandoiu, Ion I.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [3] Dissecting the clonal origins of childhood acute lymphoblastic leukemia by single-cell genomics
    Gawad, Charles
    Koh, Winston
    Quake, Stephen R.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (50) : 17947 - 17952
  • [4] Intratumor Heterogeneity and Branched Evolution Revealed by Multiregion Sequencing
    Gerlinger, Marco
    Rowan, Andrew J.
    Horswell, Stuart
    Larkin, James
    Endesfelder, David
    Gronroos, Eva
    Martinez, Pierre
    Matthews, Nicholas
    Stewart, Aengus
    Tarpey, Patrick
    Varela, Ignacio
    Phillimore, Benjamin
    Begum, Sharmin
    McDonald, Neil Q.
    Butler, Adam
    Jones, David
    Raine, Keiran
    Latimer, Calli
    Santos, Claudio R.
    Nohadani, Mahrokh
    Eklund, Aron C.
    Spencer-Dene, Bradley
    Clark, Graham
    Pickering, Lisa
    Stamp, Gordon
    Gore, Martin
    Szallasi, Zoltan
    Downward, Julian
    Futreal, P. Andrew
    Swanton, Charles
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2012, 366 (10) : 883 - 892
  • [5] The evolutionary history of lethal metastatic prostate cancer
    Gundem, Gunes
    Van Loo, Peter
    Kremeyer, Barbara
    Alexandrov, Ludmil B.
    Tubio, Jose M. C.
    Papaemmanuil, Elli
    Brewer, Daniel S.
    Kallio, Heini M. L.
    Hoegnas, Gunilla
    Annala, Matti
    Kivinummi, Kati
    Goody, Victoria
    Latimer, Calli
    O'Meara, Sarah
    Dawson, Kevin J.
    Isaacs, William
    Emmert-Buck, Michael R.
    Nykter, Matti
    Foster, Christopher
    Kote-Jarai, Zsofia
    Easton, Douglas
    Whitaker, Hayley C.
    Neal, David E.
    Cooper, Colin S.
    Eeles, Rosalind A.
    Visakorpi, Tapio
    Campbell, Peter J.
    McDermott, Ultan
    Wedge, David C.
    Bova, G. Steven
    [J]. NATURE, 2015, 520 (7547) : 353 - +
  • [6] EFFICIENT ALGORITHMS FOR INFERRING EVOLUTIONARY TREES
    GUSFIELD, D
    [J]. NETWORKS, 1991, 21 (01) : 19 - 28
  • [7] Gusfield D, 2014, RECOMBINATORICS: THE ALGORITHMICS OF ANCESTRAL RECOMBINATION GRAPHS AND EXPLICIT PHYLOGENETIC NETWORKS, P1
  • [8] Tree inference for single-cell data
    Jahn, Katharina
    Kuipers, Jack
    Beerenwinkel, Niko
    [J]. GENOME BIOLOGY, 2016, 17
  • [9] Single-cell sequencing data reveal widespread recurrence and loss of mutational hits in the life histories of tumors
    Kuipers, Jack
    Jahn, Katharina
    Raphael, Benjamin J.
    Beerenwinkel, Niko
    [J]. GENOME RESEARCH, 2017, 27 (11) : 1885 - 1894
  • [10] Computational enhancement of single-cell sequences for inferring tumor evolution
    Miura, Sayaka
    Huuki, Louise A.
    Buturla, Tiffany
    Vu, Tracy
    Gomez, Karen
    Kumar, Sudhir
    [J]. BIOINFORMATICS, 2018, 34 (17) : 917 - 926