SiCloneFit: Bayesian inference of population structure, genotype, and phylogeny of tumor clones from single-cell genome sequencing data

被引:68
|
作者
Zafar, Hamim [1 ,2 ]
Navin, Nicholas [3 ]
Chen, Ken [2 ]
Nakhleh, Luay [1 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[2] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Genet, Houston, TX 77030 USA
基金
美国国家科学基金会;
关键词
INTRATUMOR HETEROGENEITY; CANCER; EVOLUTION; SELECTION; HISTORY; MODEL;
D O I
10.1101/gr.243121.118
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accumulation and selection of somatic mutations in a Darwinian framework result in intra-tumor heterogeneity (ITH) that poses significant challenges to the diagnosis and clinical therapy of cancer. Identification of the tumor cell populations (clones) and reconstruction of their evolutionary relationship can elucidate this heterogeneity. Recently developed single-cell DNA sequencing (SCS) technologies promise to resolve ITH to a single-cell level. However, technical errors in SCS data sets, including false-positives (FP) and false-negatives (FN) due to allelic dropout, and cell doublets, significantly complicate these tasks. Here, we propose a nonparametric Bayesian method that reconstructs the clonal populations as clusters of single cells, genotypes of each clone, and the evolutionary relationship between the clones. It employs a tree-structured Chinese restaurant process as the prior on the number and composition of clonal populations. The evolution of the clonal populations is modeled by a clonal phylogeny and a finite-site model of evolution to account for potential mutation recurrence and losses. We probabilistically account for FP and FN errors, and cell doublets are modeled by employing a Beta-binomial distribution. We develop a Gibbs sampling algorithm comprising partial reversible-jump and partial Metropolis-Hastings updates to explore the joint posterior space of all parameters. The performance of our method on synthetic and experimental data sets suggests that joint reconstruction of tumor clones and clonal phylogeny under a finite-site model of evolution leads to more accurate inferences. Our method is the first to enable this joint reconstruction in a fully Bayesian framework, thus providing measures of support of the inferences it makes.
引用
收藏
页码:1847 / 1859
页数:13
相关论文
共 50 条
  • [41] Approximate Bayesian computation for inferring Waddington landscapes from single-cell data
    Liu, Yujing
    Zhang, Stephen Y.
    Kleijn, Istvan T.
    Stumpf, Michael P. H.
    ROYAL SOCIETY OPEN SCIENCE, 2024, 11 (07):
  • [42] ddClone: joint statistical inference of clonal populations from single cell and bulk tumour sequencing data
    Salehi, Sohrab
    Steif, Adi
    Roth, Andrew
    Aparicio, Samuel
    Bouchard-Cote, Alexandre
    Shah, Sohrab P.
    GENOME BIOLOGY, 2017, 18
  • [43] Evaluation of genetic demultiplexing of single-cell sequencing data from model species
    Cardiello, Joseph F.
    Araus, Alberto Joven
    Giatrellis, Sarantis
    Helsens, Clement
    Simon, Andras
    Leigh, Nicholas
    LIFE SCIENCE ALLIANCE, 2023, 6 (08)
  • [44] Latent periodic process inference from single-cell RNA-seq data
    Liang, Shaoheng
    Wang, Fang
    Han, Jincheng
    Chen, Ken
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [45] NestedBD: Bayesian inference of phylogenetic trees from single-cell copy number profiles under a birth-death model
    Liu, Yushu
    Edrisi, Mohammadamin
    Yan, Zhi
    Ogilvie, Huw A.
    Nakhleh, Luay
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2024, 19 (01)
  • [46] Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data
    Pratapa, Aditya
    Jalihal, Amogh P.
    Law, Jeffrey N.
    Bharadwaj, Aditya
    Murali, T. M.
    NATURE METHODS, 2020, 17 (02) : 147 - +
  • [47] Inferring replication timing and proliferation dynamics from single-cell DNA sequencing data
    Weiner, Adam C.
    Williams, Marc J.
    Shi, Hongyu
    Vazquez-Garcia, Ignacio
    Salehi, Sohrab
    Rusk, Nicole
    Aparicio, Samuel
    Shah, Sohrab P.
    McPherson, Andrew
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [48] Learning mutational graphs of individual tumour evolution from single-cell and multi-region sequencing data
    Ramazzotti, Daniele
    Graudenzi, Alex
    De Sano, Luca
    Antoniotti, Marco
    Caravagna, Giulio
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [49] Differential Variation Analysis Enables Detection of Tumor Heterogeneity Using Single-Cell RNA-Sequencing Data
    Davis-Marcisak, Emily F.
    Sherman, Thomas D.
    Orugunta, Pranay
    Stein-O'Brien, Genevieve L.
    Puram, Sidharth V.
    Torres, Evanthia T. Roussos
    Hopkins, Alexander C.
    Jaffee, Elizabeth M.
    Favorov, Alexander V.
    Afsari, Bahman
    Goff, Loyal A.
    Fertig, Elana J.
    CANCER RESEARCH, 2019, 79 (19) : 5102 - 5112
  • [50] A Simple Strategy for Reducing False Negatives in Calling Variants from Single-Cell Sequencing Data
    Ji, Cong
    Miao, Zong
    He, Xionglei
    PLOS ONE, 2015, 10 (04):