SECANT: a biology-guided semi-supervised method for clustering, classification, and annotation of single-cell multi-omics

被引:0
|
作者
Wang, Xinjun [1 ,2 ]
Xu, Zhongli [3 ,4 ]
Hu, Haoran [1 ]
Zhou, Xueping [1 ]
Zhang, Yanfu [5 ]
Lafyatis, Robert [6 ]
Chen, Kong [6 ]
Huang, Heng [5 ]
Ding, Ying [1 ]
Duerr, Richard H. [6 ]
Chen, Wei [1 ,3 ]
机构
[1] Univ Pittsburgh, Dept Biostat, Pittsburgh, PA 15213 USA
[2] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10065 USA
[3] Univ Pittsburgh, Dept Pediat, Pittsburgh, PA 15224 USA
[4] Tsinghua Univ, Sch Med, Beijing 100084, Peoples R China
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
[6] Univ Pittsburgh, Dept Med, Pittsburgh, PA 15261 USA
来源
PNAS NEXUS | 2022年 / 1卷 / 04期
基金
美国国家卫生研究院;
关键词
scRNA-Seq; CITE-Seq; single-cell multi-omics; semi-supervised learning; MESSENGER-RNA; CHROMATIN ACCESSIBILITY; INTEGRATED ANALYSIS; EXPRESSION; QUANTIFICATION; IDENTIFICATION; PROTEIN;
D O I
10.1093/pnasnexus/pgac165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent advance of single cell sequencing (scRNA-seq) technology such as Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) allows researchers to quantify cell surface protein abundance and RNA expression simultaneously at single cell resolution. Although CITE-seq and other similar technologies have gained enormous popularity, novel methods for analyzing this type of single cell multi-omics data are in urgent need. A limited number of available tools utilize data-driven approach, which may undermine the biological importance of surface protein data. In this study, we developed SECANT, a biology-guided SEmi-supervised method for Clustering, classification, and ANnoTation of single-cell multi-omics. SECANT is used to analyze CITE-seq data, or jointly analyze CITE-seq and scRNA-seq data. The novelties of SECANT include (1) using confident cell type label identified from surface protein data as guidance for cell clustering, (2) providing general annotation of confident cell types for each cell cluster, (3) utilizing cells with uncertain or missing cell type label to increase performance, and (4) accurate prediction of confident cell types for scRNA-seq data. Besides, as a model-based approach, SECANT can quantify the uncertainty of the results through easily interpretable posterior probability, and our framework can be potentially extended to handle other types of multi-omics data. We successfully demonstrated the validity and advantages of SECANT via simulation studies and analysis of public and in-house datasets from multiple tissues. We believe this new method will be complementary to existing tools for characterizing novel cell types and make new biological discoveries using single-cell multi-omics data.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Single-cell multi-omics sequencing of human early embryos
    Li, Lin
    Guo, Fan
    Gao, Yun
    Ren, Yixin
    Yuan, Peng
    Yan, Liying
    Li, Rong
    Lian, Ying
    Li, Jingyun
    Hu, Boqiang
    Gao, Junpeng
    Wen, Lu
    Tang, Fuchou
    Qiao, Jie
    NATURE CELL BIOLOGY, 2018, 20 (07) : 847 - +
  • [22] Paired single-cell multi-omics data integration with Mowgli
    Huizing, Geert-Jan
    Deutschmann, Ina Maria
    Peyre, Gabriel
    Cantini, Laura
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [23] Multi-omics integration in the age of million single-cell data
    Miao, Zhen
    Humphreys, Benjamin D.
    McMahon, Andrew P.
    Kim, Junhyong
    NATURE REVIEWS NEPHROLOGY, 2021, 17 (11) : 710 - 724
  • [24] Molecular mechanisms reconstruction from single-cell multi-omics data with HuMMuS
    Trimbour, Remi
    Deutschmann, Ina Maria
    Cantini, Laura
    BIOINFORMATICS, 2024, 40 (05)
  • [25] AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders
    Xu, Li
    Xu, Yin
    Xue, Tong
    Zhang, Xinyu
    Li, Jin
    FRONTIERS IN GENETICS, 2021, 12
  • [26] Advances in Single-Cell Multi-Omics and Application in Cardiovascular Research
    Zhang, Xingwu
    Qiu, Hui
    Zhang, Fengzhi
    Ding, Shuangyuan
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2022, 10
  • [27] Single-Cell Multi-omics Clustering Algorithm Based on Adaptive Weighted Hyper-laplacian Regularization
    Lan, Wei
    Huang, Shengzu
    Sun, Xun
    Liao, Haibo
    Chen, Qingfeng
    Cao, Junyue
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT II, ISBRA 2024, 2024, 14955 : 373 - 382
  • [28] Progress in single-cell multimodal sequencing and multi-omics data integration
    Wang, Xuefei
    Wu, Xinchao
    Hong, Ni
    Jin, Wenfei
    BIOPHYSICAL REVIEWS, 2024, 16 (01) : 13 - 28
  • [29] Integration of single-cell multi-omics for gene regulatory network inference
    Hu, Xinlin
    Hu, Yaohua
    Wu, Fanjie
    Leung, Ricky Wai Tak
    Qin, Jing
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 1925 - 1938
  • [30] Single-cell multi-omics identify novel regulators required for osteoclastogenesis during aging
    Li, Hao
    Xu, Wan-Xing
    Tan, Jing-Cong
    Hong, Yue-Mei
    He, Jian
    Zhao, Ben-Peng
    Zhou, Jin-An
    Zheng, Yu-Min
    Lei, Ming
    Zheng, Xiao-Qi
    Ding, Jun
    Lu, Ning-Ning
    Gao, Jun-Jie
    Zhang, Chang-Qing
    Wang, Hui
    ISCIENCE, 2024, 27 (09)