SPhyR: tumor phylogeny estimation from single-cell sequencing data under loss and error

被引:66
作者
El-Kebir, Mohammed [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
PERFECT PHYLOGENY; INFERENCE; CANCER; ALGORITHM; EVOLUTION; SAMPLES; TREES; MODEL;
D O I
10.1093/bioinformatics/bty589
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cancer is characterized by intra-tumor heterogeneity, the presence of distinct cell populations with distinct complements of somatic mutations, which include single-nucleotide variants (SNVs) and copy-number aberrations (CNAs). Single-cell sequencing technology enables one to study these cell populations at single-cell resolution. Phylogeny estimation algorithms that employ appropriate evolutionary models are key to understanding the evolutionary mechanisms behind intra-tumor heterogeneity. Results: We introduce Single-cell Phylogeny Reconstruction (SPhyR), a method for tumor phylogeny estimation from single-cell sequencing data. In light of frequent loss of SNVs due to CNAs in cancer, SPhyR employs the k-Dollo evolutionary model, where a mutation can only be gained once but lost k times. Underlying SPhyR is a novel combinatorial characterization of solutions as constrained integer matrix completions, based on a connection to the cladistic multi-state perfect phylogeny problem. SPhyR outperforms existing methods on simulated data and on a metastatic colorectal cancer.
引用
收藏
页码:671 / 679
页数:9
相关论文
共 31 条
[11]   Inferring parsimonious migration histories for metastatic cancers [J].
El-Kebir, Mohammed ;
Satas, Gryte ;
Raphael, Benjamin J. .
NATURE GENETICS, 2018, 50 (05) :718-+
[12]   Reconstruction of clonal trees and tumor composition from multi-sample sequencing data [J].
El-Kebir, Mohammed ;
Oesper, Layla ;
Acheson-Field, Hannah ;
Raphael, Benjamin J. .
BIOINFORMATICS, 2015, 31 (12) :62-70
[13]  
ESTABROOK G F, 1975, Mathematical Biosciences, V23, P263, DOI 10.1016/0025-5564(75)90040-1
[14]  
Fernandez-Baca D., 2000, STEINER TREES IND
[15]   EFFICIENT ALGORITHMS FOR INFERRING EVOLUTIONARY TREES [J].
GUSFIELD, D .
NETWORKS, 1991, 21 (01) :19-28
[16]  
Gusfield Dan., 2015, Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics, BCB'15, P443
[17]   Generating samples under a Wright-Fisher neutral model of genetic variation [J].
Hudson, RR .
BIOINFORMATICS, 2002, 18 (02) :337-338
[18]   Tree inference for single-cell data [J].
Jahn, Katharina ;
Kuipers, Jack ;
Beerenwinkel, Niko .
GENOME BIOLOGY, 2016, 17
[19]   A fast algorithm for the computation and enumeration of perfect phylogenies [J].
Kannan, S ;
Warnow, T .
SIAM JOURNAL ON COMPUTING, 1997, 26 (06) :1749-1763
[20]   Single-cell sequencing data reveal widespread recurrence and loss of mutational hits in the life histories of tumors [J].
Kuipers, Jack ;
Jahn, Katharina ;
Raphael, Benjamin J. ;
Beerenwinkel, Niko .
GENOME RESEARCH, 2017, 27 (11) :1885-1894