Scelestial: Fast and accurate single-cell lineage tree inference based on a Steiner tree approximation algorithm

被引:0
作者
Foroughmand-Araabi, Mohammad-Hadi [1 ,2 ]
Goliaei, Sama [1 ,2 ]
Mchardy, Alice C. [1 ,2 ]
机构
[1] Helmholtz Ctr Infect Res, Computat Biol Infect Res, Braunschweig, Germany
[2] Tech Univ Carolo Wilhelmina Braunschweig, Braunschweig Integrated Ctr Syst Biol BRICS, Braunschweig, Germany
关键词
MAXIMUM-PARSIMONY; EVOLUTION;
D O I
10.1371/journal.pcbi.1009100
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell genome sequencing provides a highly granular view of biological systems but is affected by high error rates, allelic amplification bias, and uneven genome coverage. This creates a need for data-specific computational methods, for purposes such as for cell lineage tree inference. The objective of cell lineage tree reconstruction is to infer the evolutionary process that generated a set of observed cell genomes. Lineage trees may enable a better understanding of tumor formation and growth, as well as of organ development for healthy body cells. We describe a method, Scelestial, for lineage tree reconstruction from single-cell data, which is based on an approximation algorithm for the Steiner tree problem and is a generalization of the neighbor-joining method. We adapt the algorithm to efficiently select a limited subset of potential sequences as internal nodes, in the presence of missing values, and to minimize cost by lineage tree-based missing value imputation. In a comparison against seven state-of-the-art single-cell lineage tree reconstruction algorithms-BitPhylogeny, OncoNEM, SCITE, SiFit, SASC, SCIPhI, and SiCloneFit-on simulated and real single-cell tumor samples, Scelestial performed best at reconstructing trees in terms of accuracy and run time. Scelestial has been implemented in C++. It is also available as an R package named RScelestial.
引用
收藏
页数:27
相关论文
共 37 条
[1]   Approximate Maximum Parsimony and Ancestral Maximum Likelihood [J].
Alon, Noga ;
Chor, Benny ;
Pardi, Fabio ;
Rapoport, Anat .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (01) :183-187
[2]  
[Anonymous], 2018, INFERRING CANC PROGR
[3]   IMPROVED APPROXIMATIONS FOR THE STEINER TREE PROBLEM [J].
BERMAN, P ;
RAMAIYER, V .
JOURNAL OF ALGORITHMS, 1994, 17 (03) :381-408
[4]   The k-Steiner ratio in graphs [J].
Borchers, A ;
Du, DZ .
SIAM JOURNAL ON COMPUTING, 1997, 26 (03) :857-869
[5]  
Bourque M, 1978, THESIS U MONTREAL MO
[6]   On wirelength estimations for row-based placement [J].
Caldwell, AE ;
Kahng, AB ;
Mantik, S ;
Markov, IL ;
Zelikovsky, A .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1999, 18 (09) :1265-1278
[7]   A mixed integer linear programming model to reconstruct phylogenies from single nucleotide polymorphism haplotypes under the maximum parsimony criterion [J].
Catanzaro, Daniele ;
Ravi, Ramamoorthi ;
Schwartz, Russell .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2013, 8
[8]   The Steiner tree problem on graphs: Inapproximability results [J].
Chlebik, Miroslav ;
Chlebikova, Janka .
THEORETICAL COMPUTER SCIENCE, 2008, 406 (03) :207-214
[9]  
Cieslik D, 2013, STEINER RATIO, V10
[10]   miRNAs in the spotlight: Understanding cancer gene dependency [J].
Croce, Carlo M. .
NATURE MEDICINE, 2011, 17 (08) :935-936