SECANT: a biology-guided semi-supervised method for clustering, classification, and annotation of single-cell multi-omics

被引:0
|
作者
Wang, Xinjun [1 ,2 ]
Xu, Zhongli [3 ,4 ]
Hu, Haoran [1 ]
Zhou, Xueping [1 ]
Zhang, Yanfu [5 ]
Lafyatis, Robert [6 ]
Chen, Kong [6 ]
Huang, Heng [5 ]
Ding, Ying [1 ]
Duerr, Richard H. [6 ]
Chen, Wei [1 ,3 ]
机构
[1] Univ Pittsburgh, Dept Biostat, Pittsburgh, PA 15213 USA
[2] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10065 USA
[3] Univ Pittsburgh, Dept Pediat, Pittsburgh, PA 15224 USA
[4] Tsinghua Univ, Sch Med, Beijing 100084, Peoples R China
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
[6] Univ Pittsburgh, Dept Med, Pittsburgh, PA 15261 USA
来源
PNAS NEXUS | 2022年 / 1卷 / 04期
基金
美国国家卫生研究院;
关键词
scRNA-Seq; CITE-Seq; single-cell multi-omics; semi-supervised learning; MESSENGER-RNA; CHROMATIN ACCESSIBILITY; INTEGRATED ANALYSIS; EXPRESSION; QUANTIFICATION; IDENTIFICATION; PROTEIN;
D O I
10.1093/pnasnexus/pgac165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent advance of single cell sequencing (scRNA-seq) technology such as Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) allows researchers to quantify cell surface protein abundance and RNA expression simultaneously at single cell resolution. Although CITE-seq and other similar technologies have gained enormous popularity, novel methods for analyzing this type of single cell multi-omics data are in urgent need. A limited number of available tools utilize data-driven approach, which may undermine the biological importance of surface protein data. In this study, we developed SECANT, a biology-guided SEmi-supervised method for Clustering, classification, and ANnoTation of single-cell multi-omics. SECANT is used to analyze CITE-seq data, or jointly analyze CITE-seq and scRNA-seq data. The novelties of SECANT include (1) using confident cell type label identified from surface protein data as guidance for cell clustering, (2) providing general annotation of confident cell types for each cell cluster, (3) utilizing cells with uncertain or missing cell type label to increase performance, and (4) accurate prediction of confident cell types for scRNA-seq data. Besides, as a model-based approach, SECANT can quantify the uncertainty of the results through easily interpretable posterior probability, and our framework can be potentially extended to handle other types of multi-omics data. We successfully demonstrated the validity and advantages of SECANT via simulation studies and analysis of public and in-house datasets from multiple tissues. We believe this new method will be complementary to existing tools for characterizing novel cell types and make new biological discoveries using single-cell multi-omics data.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Unraveling Heterogeneity in Transcriptome and Its Regulation Through Single-Cell Multi-Omics Technologies
    Xing, Qiao Rui
    Cipta, Nadia Omega
    Hamashima, Kiyofumi
    Liou, Yih-Cherng
    Koh, Cheng Gee
    Loh, Yuin-Han
    FRONTIERS IN GENETICS, 2020, 11
  • [32] The frontier of precision medicine: application of single-cell multi-omics in preimplantation genetic diagnosis
    Zhang, Jinglei
    Zhang, Nan
    Mai, Qingyun
    Zhou, Canquan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (06) : 726 - 732
  • [33] Leveraging Single-Cell Multi-Omics to Decode Tumor Microenvironment Diversity and Therapeutic Resistance
    Sabit, Hussein
    Arneth, Borros
    Pawlik, Timothy M.
    Abdel-Ghany, Shaimaa
    Ghazy, Aysha
    Abdelazeem, Rawan M.
    Alqosaibi, Amany
    Al-Dhuayan, Ibtesam S.
    Almulhim, Jawaher
    Alrabiah, Noof A.
    Hashash, Ahmed
    PHARMACEUTICALS, 2025, 18 (01)
  • [34] Single-cell multi-omics sequencing and its application in tumor heterogeneity
    Sun, Yuqing
    Liu, Zhiyu
    Fu, Yue
    Yang, Yuwei
    Lu, Junru
    Pan, Min
    Wen, Tian
    Xie, Xueying
    Bai, Yunfei
    Ge, Qinyu
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2023, 22 (04) : 313 - 328
  • [35] Gene regulatory network inference in the era of single-cell multi-omics
    Badia-i-Mompel, Pau
    Wessels, Lorna
    Mueller-Dott, Sophia
    Trimbour, Remi
    Flores, Ricardo Ramirez O.
    Argelaguet, Ricard
    Saez-Rodriguez, Julio
    NATURE REVIEWS GENETICS, 2023, 24 (11) : 739 - 754
  • [36] Deconvolution of single-cell multi-omics layers reveals regulatory heterogeneity
    Liu, Longqi
    Liu, Chuanyu
    Quintero, Andres
    Wu, Liang
    Yuan, Yue
    Wang, Mingyue
    Cheng, Mengnan
    Leng, Lizhi
    Xu, Liqin
    Dong, Guoyi
    Li, Rui
    Liu, Yang
    Wei, Xiaoyu
    Xu, Jiangshan
    Chen, Xiaowei
    Lu, Haorong
    Chen, Dongsheng
    Wang, Quanlei
    Zhou, Qing
    Lin, Xinxin
    Li, Guibo
    Liu, Shiping
    Wang, Qi
    Wang, Hongru
    Fink, J. Lynn
    Gao, Zhengliang
    Liu, Xin
    Hou, Yong
    Zhu, Shida
    Yang, Huanming
    Ye, Yunming
    Lin, Ge
    Chen, Fang
    Herrmann, Carl
    Eils, Roland
    Shang, Zhouchun
    Xu, Xun
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [37] Charting plant gene functions in the multi-omics and single-cell era
    Depuydt, Thomas
    De Rybel, Bert
    Vandepoele, Klaas
    TRENDS IN PLANT SCIENCE, 2023, 28 (03) : 283 - 296
  • [38] Sequencing-based methods for single-cell multi-omics studies
    Shanshan Qin
    Songmei Liu
    Xiaocheng Weng
    Science China Chemistry, 2023, 66 : 3024 - 3043
  • [39] LMSVCR: novel effective method of semi-supervised multi-classification
    Dong, Zijie
    Qin, Yimo
    Zou, Bin
    Xu, Jie
    Tang, Yuan Yan
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (05) : 3857 - 3873
  • [40] LMSVCR: novel effective method of semi-supervised multi-classification
    Zijie Dong
    Yimo Qin
    Bin Zou
    Jie Xu
    Yuan Yan Tang
    Neural Computing and Applications, 2022, 34 : 3857 - 3873