Resolving single-cell copy number profiling for large datasets

被引:6
作者
Wang Ruohan [1 ]
Zhang Yuwei [1 ]
Wang Mengbo [1 ]
Feng Xikang [2 ]
Wang Jianping [1 ]
Li Shuai Cheng [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Northwestern Polytech Univ, Sch Software, Xian, Shaanxi, Peoples R China
关键词
single-cell sequencing; copy number variation; cross-sample breakpoint detection; structural information theory;
D O I
10.1093/bib/bbac264
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advances of single-cell DNA sequencing (scDNA-seq) enable us to characterize the genetic heterogeneity of cancer cells. However, the high noise and low coverage of scDNA-seq impede the estimation of copy number variations (CNVs). In addition, existing tools suffer from intensive execution time and often fail on large datasets. Here, we propose SeCNV, an efficient method that leverages structural entropy, to profile the copy numbers. SeCNV adopts a local Gaussian kernel to construct a matrix, depth congruent map (DCM), capturing the similarities between any two bins along the genome. Then, SeCNV partitions the genome into segments by minimizing the structural entropy from the DCM. With the partition, SeCNV estimates the copy numbers within each segment for cells. We simulate nine datasets with various breakpoint distributions and amplitudes of noise to benchmark SeCNV. SeCNV achieves a robust performance, i.e. the Fl-scores are higher than 0.95 for breakpoint detections, significantly outperforming state-of-the-art methods. SeCNV successfully processes large datasets (>50 000 cells) within 4 min, while other tools fail to finish within the time limit, i.e. 120 h. We apply SeCNV to single-nucleus sequencing datasets from two breast cancer patients and acoustic cell tagmentation sequencing datasets from eight breast cancer patients. SeCNV successfully reproduces the distinct subclones and infers tumor heterogeneity. SeCNV is available at https://github.com/deepomicslab/SeCNV.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] [Anonymous], 2019, Picard Toolkit
  • [2] Genome-wide copy number analysis of single cells
    Baslan, Timour
    Kendall, Jude
    Rodgers, Linda
    Cox, Hilary
    Riggs, Mike
    Stepansky, Asya
    Troge, Jennifer
    Ravi, Kandasamy
    Esposito, Diane
    Lakshmi, B.
    Wigler, Michael
    Navin, Nicholas
    Hicks, James
    [J]. NATURE PROTOCOLS, 2012, 7 (06) : 1024 - 1041
  • [3] Somatic variant analysis suite: copy number variation clonal visualization online platform for large-scale single-cell genomics
    Chen, Lingxi
    Qing, Yuhao
    Li, Ruikang
    Li, Chaohui
    Li, Hechen
    Feng, Xikang
    Li, Shuai Cheng
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [4] Human genes involved in copy number variation: mechanisms of origin, functional effects and implications for disease
    de Smith, A. J.
    Walters, R. G.
    Froguel, P.
    Blakemore, A. I.
    [J]. CYTOGENETIC AND GENOME RESEARCH, 2008, 123 (1-4) : 17 - 26
  • [5] Ester M., 1996, P 2 INT C KNOWL DISC, DOI DOI 10.5555/3001460.3001507
  • [6] Everitt B. S., 2010, CAMBRIDGE DICT STAT
  • [7] Gene copy number variation and common human disease
    Fanciulli, M.
    Petretto, E.
    Aitman, T. J.
    [J]. CLINICAL GENETICS, 2010, 77 (03) : 201 - 213
  • [8] Transcriptome and metabolome analysis reveals anthocyanin biosynthesis pathway associated with ramie (Boehmeria nivea (L.) Gaud.) leaf color formation
    Feng, Xinkang
    Gao, Gang
    Yu, Chunming
    Zhu, Aiguo
    Chen, Jikang
    Chen, Kunmei
    Wang, Xiaofei
    Abubakar, Aminu Shehu
    Chen, Ping
    [J]. BMC GENOMICS, 2021, 22 (01)
  • [9] Noncoding copy-number variations are associated with congenital limb malformation
    Floettmann, Ricarda
    Kragesteen, Bjort K.
    Geuer, Sinje
    Socha, Magdalena
    Allou, Lila
    Sowinska-Seidler, Anna
    de Jarcy, Laure Bosquillon
    Wagner, Johannes
    Jamsheer, Aleksander
    Oehl-Jaschkowitz, Barbara
    Wittler, Lars
    de Silva, Deepthi
    Kurth, Ingo
    Maya, Idit
    Santos-Simarro, Fernando
    Huelsemann, Wiebke
    Klopocki, Eva
    Mountford, Roger
    Fryer, Alan
    Borck, Guntram
    Horn, Denise
    Lapunzina, Pablo
    Wilson, Meredith
    Mascrez, Benedicte
    Duboule, Denis
    Mundlos, Stefan
    Spielmann, Malte
    [J]. GENETICS IN MEDICINE, 2018, 20 (06) : 599 - 607
  • [10] Copy number variation: New insights in genome diversity
    Freeman, Jennifer L.
    Perry, George H.
    Feuk, Lars
    Redon, Richard
    McCarroll, Steven A.
    Altshuler, David M.
    Aburatani, Hiroyuki
    Jones, Keith W.
    Tyler-Smith, Chris
    Hurles, Matthew E.
    Carter, Nigel P.
    Scherer, Stephen W.
    Lee, Charles
    [J]. GENOME RESEARCH, 2006, 16 (08) : 949 - 961