Unsupervised topological alignment for single-cell multi-omics integration

被引:69
作者
Cao, Kai [1 ,2 ]
Bai, Xiangqi [1 ,2 ]
Hong, Yiguang [1 ,2 ]
Wan, Lin [1 ,2 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
D O I
10.1093/bioinformatics/btaa443
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell multi-omics data provide a comprehensive molecular view of cells. However, single-cell multi-omics datasets consist of unpaired cells measured with distinct unmatched features across modalities, making data integration challenging. Results: In this study, we present a novel algorithm, termed UnionCom, for the unsupervised topological alignment of single-cell multi-omics integration. UnionCom does not require any correspondence information, either among cells or among features. It first embeds the intrinsic low-dimensional structure of each single-cell dataset into a distance matrix of cells within the same dataset and then aligns the cells across single-cell multi-omics datasets by matching the distance matrices via a matrix optimization method. Finally, it projects the distinct unmatched features across single-cell datasets into a common embedding space for feature comparability of the aligned cells. To match the complex non-linear geometrical distorted low-dimensional structures across datasets, UnionCom proposes and adjusts a global scaling parameter on distance matrices for aligning similar topological structures. It does not require one-to-one correspondence among cells across datasets, and it can accommodate samples with dataset-specific cell types. UnionCom outperforms state-of-the-art methods on both simulated and real single-cell multi-omics datasets. UnionCom is robust to parameter choices, as well as subsampling of features.
引用
收藏
页码:48 / 56
页数:9
相关论文
共 27 条
  • [1] Amodio M., 2018, P MACHINE LEARNING R, P215
  • [2] Dimensionality reduction for visualizing single-cell data using UMAP
    Becht, Etienne
    McInnes, Leland
    Healy, John
    Dutertre, Charles-Antoine
    Kwok, Immanuel W. H.
    Ng, Lai Guan
    Ginhoux, Florent
    Newell, Evan W.
    [J]. NATURE BIOTECHNOLOGY, 2019, 37 (01) : 38 - +
  • [3] DensityPath: an algorithm to visualize and reconstruct cell state-transition path on density landscape for single-cell RNA sequencing data
    Chen, Ziwei
    An, Shaokun
    Bai, Xiangqi
    Gong, Fuzhou
    Ma, Liang
    Wan, Lin
    [J]. BIOINFORMATICS, 2019, 35 (15) : 2593 - 2601
  • [4] Single-cell multimodal profiling reveals cellular epigenetic heterogeneity
    Cheow, Lih Feng
    Courtois, Elise T.
    Tan, Yuliana
    Viswanathan, Ramya
    Xing, Qiaorui
    Tan, Rui Zhen
    Tan, Daniel S. W.
    Robson, Paul
    Loh, Yuin-Han
    Quake, Stephen R.
    Burkholder, William F.
    [J]. NATURE METHODS, 2016, 13 (10) : 833 - +
  • [5] scNMT-seq enables joint profiling of chromatin accessibility DNA methylation and transcription in single cells
    Clark, Stephen J.
    Argelaguet, Ricard
    Kapourani, Chantriolnt-Andreas
    Stubbs, Thomas M.
    Lee, Heather J.
    Alda-Catalinas, Celia
    Krueger, Felix
    Sanguinetti, Guido
    Kelsey, Gavin
    Marioni, John C.
    Stegle, Oliver
    Reik, Wolf
    [J]. NATURE COMMUNICATIONS, 2018, 9
  • [6] Cui Z., 2014, Advances in Neural Information Processing Systems
  • [7] Cui Z, 2012, PROC CVPR IEEE, P2626, DOI 10.1109/CVPR.2012.6247982
  • [8] Tuning photoluminescence energy and fine structure splitting in single quantum dots by uniaxial stress
    SKLSM, Institute of Semiconductors, Chinese Academy of Sciences, PO Box 912, Beijing 100083, China
    [J]. Chin. Phys. Lett., 2008, 3 (1120-1123):
  • [9] Computational methods for single-cell omics across modalities
    Efremova, Mirjana
    Teichmann, Sarah A.
    [J]. NATURE METHODS, 2020, 17 (01) : 14 - 17
  • [10] Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors
    Haghverdi, Laleh
    Lun, Aaron T. L.
    Morgan, Michael D.
    Marioni, John C.
    [J]. NATURE BIOTECHNOLOGY, 2018, 36 (05) : 421 - +