SECANT: a biology-guided semi-supervised method for clustering, classification, and annotation of single-cell multi-omics

被引:0
|
作者
Wang, Xinjun [1 ,2 ]
Xu, Zhongli [3 ,4 ]
Hu, Haoran [1 ]
Zhou, Xueping [1 ]
Zhang, Yanfu [5 ]
Lafyatis, Robert [6 ]
Chen, Kong [6 ]
Huang, Heng [5 ]
Ding, Ying [1 ]
Duerr, Richard H. [6 ]
Chen, Wei [1 ,3 ]
机构
[1] Univ Pittsburgh, Dept Biostat, Pittsburgh, PA 15213 USA
[2] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10065 USA
[3] Univ Pittsburgh, Dept Pediat, Pittsburgh, PA 15224 USA
[4] Tsinghua Univ, Sch Med, Beijing 100084, Peoples R China
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
[6] Univ Pittsburgh, Dept Med, Pittsburgh, PA 15261 USA
来源
PNAS NEXUS | 2022年 / 1卷 / 04期
基金
美国国家卫生研究院;
关键词
scRNA-Seq; CITE-Seq; single-cell multi-omics; semi-supervised learning; MESSENGER-RNA; CHROMATIN ACCESSIBILITY; INTEGRATED ANALYSIS; EXPRESSION; QUANTIFICATION; IDENTIFICATION; PROTEIN;
D O I
10.1093/pnasnexus/pgac165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent advance of single cell sequencing (scRNA-seq) technology such as Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) allows researchers to quantify cell surface protein abundance and RNA expression simultaneously at single cell resolution. Although CITE-seq and other similar technologies have gained enormous popularity, novel methods for analyzing this type of single cell multi-omics data are in urgent need. A limited number of available tools utilize data-driven approach, which may undermine the biological importance of surface protein data. In this study, we developed SECANT, a biology-guided SEmi-supervised method for Clustering, classification, and ANnoTation of single-cell multi-omics. SECANT is used to analyze CITE-seq data, or jointly analyze CITE-seq and scRNA-seq data. The novelties of SECANT include (1) using confident cell type label identified from surface protein data as guidance for cell clustering, (2) providing general annotation of confident cell types for each cell cluster, (3) utilizing cells with uncertain or missing cell type label to increase performance, and (4) accurate prediction of confident cell types for scRNA-seq data. Besides, as a model-based approach, SECANT can quantify the uncertainty of the results through easily interpretable posterior probability, and our framework can be potentially extended to handle other types of multi-omics data. We successfully demonstrated the validity and advantages of SECANT via simulation studies and analysis of public and in-house datasets from multiple tissues. We believe this new method will be complementary to existing tools for characterizing novel cell types and make new biological discoveries using single-cell multi-omics data.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Clustering of single-cell multi-omics data with a multimodal deep learning method
    Lin, Xiang
    Tian, Tian
    Wei, Zhi
    Hakonarson, Hakon
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [2] Single-cell sequencing to multi-omics: technologies and applications
    Wu, Xiangyu
    Yang, Xin
    Dai, Yunhan
    Zhao, Zihan
    Zhu, Junmeng
    Guo, Hongqian
    Yang, Rong
    BIOMARKER RESEARCH, 2024, 12 (01)
  • [3] scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization
    Qiu, Yushan
    Guo, Dong
    Zhao, Pu
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [4] Advances in bulk and single-cell multi-omics approaches for systems biology and precision medicine
    Li, Yunjin
    Ma, Lu
    Wu, Duojiao
    Chen, Geng
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [5] CASSL: A cell-type annotation method for single cell transcriptomics data using semi-supervised learning
    Seal, Dibyendu Bikash
    Das, Vivek
    De, Rajat K.
    APPLIED INTELLIGENCE, 2023, 53 (02) : 1287 - 1305
  • [6] The technological landscape and applications of single-cell multi-omics
    Baysoy, Alev
    Bai, Zhiliang
    Satija, Rahul
    Fan, Rong
    NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2023, 24 (10) : 695 - 713
  • [7] Sequencing-based methods for single-cell multi-omics studies
    Qin, Shanshan
    Liu, Songmei
    Weng, Xiaocheng
    SCIENCE CHINA-CHEMISTRY, 2023, 66 (11) : 3024 - 3043
  • [8] DEMOC: a deep embedded multi-omics learning approach for clustering single-cell CITE-seq data
    Zou, Guanhua
    Lin, Yilong
    Han, Tianyang
    Le Ou-Yang
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [9] Advances in single-cell multi-omics profiling
    Bai, Dongsheng
    Peng, Jinying
    Yi, Chengqi
    RSC CHEMICAL BIOLOGY, 2021, 2 (02): : 441 - 449
  • [10] Single-cell multi-omics advances in lymphoma research (Review)
    Jin, Chanjuan
    Zhou, Di
    Li, Jun
    Bi, Lintao
    Li, Lisha
    ONCOLOGY REPORTS, 2023, 50 (04)