Robust single-cell Hi-C clustering by convolution- and random-walk-based imputation

被引:87
作者
Zhou, Jingtian [1 ,2 ]
Ma, Jianzhu [3 ]
Chen, Yusi [4 ,5 ]
Cheng, Chuankai [6 ]
Bao, Bokan [2 ]
Peng, Jian [7 ]
Sejnowski, Terrence J. [4 ,5 ]
Dixon, Jesse R. [8 ]
Ecker, Joseph R. [1 ,9 ]
机构
[1] Salk Inst Biol Studies, Genom Anal Lab, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Program, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Med, La Jolla, CA 92093 USA
[4] Salk Inst Biol Studies, Computat Neurobiol Lab, La Jolla, CA 92037 USA
[5] Univ Calif San Diego, Div Biol Sci, La Jolla, CA 92093 USA
[6] Univ Calif San Diego, Dept Bioengn, La Jolla, CA 92093 USA
[7] Univ Illinois, gDept Comp Sci, Urbana, IL 61801 USA
[8] Salk Inst Biol Studies, Peptide Biol Lab, La Jolla, CA 92037 USA
[9] Salk Inst Biol Studies, Howard Hughes Med Inst, La Jolla, CA 92037 USA
关键词
single cell; Hi-C; 3D chromosome structure; random walk; CHROMATIN ACCESSIBILITY; REVEALS PRINCIPLES; GENOME; DYNAMICS; REORGANIZATION; ORGANIZATION; DOMAINS;
D O I
10.1073/pnas.1901423116
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Three-dimensional genome structure plays a pivotal role in gene regulation and cellular function. Single-cell analysis of genome architecture has been achieved using imaging and chromatin conformation capture methods such as Hi-C. To study variation in chromosome structure between different cell types, computational approaches are needed that can utilize sparse and heterogeneous single-cell Hi-C data. However, few methods exist that are able to accurately and efficiently cluster such data into constituent cell types. Here, we describe scHiCluster, a single-cell clustering algorithm for Hi-C contact matrices that is based on imputations using linear convolution and random walk. Using both simulated and real single-cell Hi-C data as benchmarks, scHiCluster significantly improves clustering accuracy when applied to low coverage datasets compared with existing methods. After imputation by scHiCluster, topologically associating domain (TAD)-like structures (TLSs) can be identified within single cells, and their consensus boundaries were enriched at the TAD boundaries observed in bulk cell Hi-C samples. In summary, scHiCluster facilitates visualization and comparison of single-cell 3D genomes.
引用
收藏
页码:14011 / 14018
页数:8
相关论文
共 41 条
  • [1] [Anonymous], 2004, ACM SIGKDD
  • [2] Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells
    Bintu, Bogdan
    Mateo, Leslie J.
    Su, Jun-Han
    Sinnott-Armstrong, Nicholas A.
    Parker, Mirae
    Kinrot, Seon
    Yamaya, Kei
    Boettiger, Alistair N.
    Zhuang, Xiaowei
    [J]. SCIENCE, 2018, 362 (6413) : 419 - +
  • [3] Multiscale 3D Genome Rewiring during Mouse Neural Development
    Bonev, Boyan
    Cohen, Netta Mendelson
    Szabo, Quentin
    Fritsch, Lauriane
    Papadopoulos, Giorgio L.
    Lubling, Yaniv
    Xu, Xiaole
    Lv, Xiaodan
    Hugnot, Jean-Philippe
    Tanay, Amos
    Cavalli, Giacomo
    [J]. CELL, 2017, 171 (03) : 557 - +
  • [4] Single-cell chromatin accessibility reveals principles of regulatory variation
    Buenostro, Jason D.
    Wu, Beijing
    Litzenburger, Ulrike M.
    Ruff, Dave
    Gonzales, Michael L.
    Snyder, Michael P.
    Chang, Howard Y.
    Greenleaf, William J.
    [J]. NATURE, 2015, 523 (7561) : 486 - U264
  • [5] The CXCR4 chemokine receptor in acute and chronic leukaemia:: a marrow homing receptor and potential therapeutic target
    Burger, Jan A.
    Buerkle, Andrea
    [J]. BRITISH JOURNAL OF HAEMATOLOGY, 2007, 137 (04) : 288 - 296
  • [6] Network propagation: a universal amplifier of genetic associations
    Cowen, Lenore
    Ideker, Trey
    Raphael, Benjamin J.
    Sharan, Roded
    [J]. NATURE REVIEWS GENETICS, 2017, 18 (09) : 551 - 562
  • [7] The cis-regulatory dynamics of embryonic development at single-cell resolution
    Cusanovich, Darren A.
    Reddington, James P.
    Garfield, David A.
    Daza, Riza M.
    Aghamirzaie, Delasa
    Marco-Ferreres, Raquel
    Pliner, Hannah A.
    Christiansen, Lena
    Qiu, Xiaojie
    Steemers, Frank J.
    Trapnell, Cole
    Shendure, Jay
    Furlong, Eileen E. M.
    [J]. NATURE, 2018, 555 (7697) : 538 - +
  • [8] Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing
    Cusanovich, Darren A.
    Daza, Riza
    Adey, Andrew
    Pliner, Hannah A.
    Christiansen, Lena
    Gunderson, Kevin L.
    Steemers, Frank J.
    Trapnell, Cole
    Shendure, Jay
    [J]. SCIENCE, 2015, 348 (6237) : 910 - 914
  • [9] Chromatin architecture reorganization during stem cell differentiation
    Dixon, Jesse R.
    Jung, Inkyung
    Selvaraj, Siddarth
    Shen, Yin
    Antosiewicz-Bourget, Jessica E.
    Lee, Ah Young
    Ye, Zhen
    Kim, Audrey
    Rajagopal, Nisha
    Xie, Wei
    Diao, Yarui
    Liang, Jing
    Zhao, Huimin
    Lobanenkov, Victor V.
    Ecker, Joseph R.
    Thomson, James A.
    Ren, Bing
    [J]. NATURE, 2015, 518 (7539) : 331 - 336
  • [10] Topological domains in mammalian genomes identified by analysis of chromatin interactions
    Dixon, Jesse R.
    Selvaraj, Siddarth
    Yue, Feng
    Kim, Audrey
    Li, Yan
    Shen, Yin
    Hu, Ming
    Liu, Jun S.
    Ren, Bing
    [J]. NATURE, 2012, 485 (7398) : 376 - 380