SECANT: a biology-guided semi-supervised method for clustering, classification, and annotation of single-cell multi-omics

被引:0
|
作者
Wang, Xinjun [1 ,2 ]
Xu, Zhongli [3 ,4 ]
Hu, Haoran [1 ]
Zhou, Xueping [1 ]
Zhang, Yanfu [5 ]
Lafyatis, Robert [6 ]
Chen, Kong [6 ]
Huang, Heng [5 ]
Ding, Ying [1 ]
Duerr, Richard H. [6 ]
Chen, Wei [1 ,3 ]
机构
[1] Univ Pittsburgh, Dept Biostat, Pittsburgh, PA 15213 USA
[2] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10065 USA
[3] Univ Pittsburgh, Dept Pediat, Pittsburgh, PA 15224 USA
[4] Tsinghua Univ, Sch Med, Beijing 100084, Peoples R China
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
[6] Univ Pittsburgh, Dept Med, Pittsburgh, PA 15261 USA
来源
PNAS NEXUS | 2022年 / 1卷 / 04期
基金
美国国家卫生研究院;
关键词
scRNA-Seq; CITE-Seq; single-cell multi-omics; semi-supervised learning; MESSENGER-RNA; CHROMATIN ACCESSIBILITY; INTEGRATED ANALYSIS; EXPRESSION; QUANTIFICATION; IDENTIFICATION; PROTEIN;
D O I
10.1093/pnasnexus/pgac165
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The recent advance of single cell sequencing (scRNA-seq) technology such as Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) allows researchers to quantify cell surface protein abundance and RNA expression simultaneously at single cell resolution. Although CITE-seq and other similar technologies have gained enormous popularity, novel methods for analyzing this type of single cell multi-omics data are in urgent need. A limited number of available tools utilize data-driven approach, which may undermine the biological importance of surface protein data. In this study, we developed SECANT, a biology-guided SEmi-supervised method for Clustering, classification, and ANnoTation of single-cell multi-omics. SECANT is used to analyze CITE-seq data, or jointly analyze CITE-seq and scRNA-seq data. The novelties of SECANT include (1) using confident cell type label identified from surface protein data as guidance for cell clustering, (2) providing general annotation of confident cell types for each cell cluster, (3) utilizing cells with uncertain or missing cell type label to increase performance, and (4) accurate prediction of confident cell types for scRNA-seq data. Besides, as a model-based approach, SECANT can quantify the uncertainty of the results through easily interpretable posterior probability, and our framework can be potentially extended to handle other types of multi-omics data. We successfully demonstrated the validity and advantages of SECANT via simulation studies and analysis of public and in-house datasets from multiple tissues. We believe this new method will be complementary to existing tools for characterizing novel cell types and make new biological discoveries using single-cell multi-omics data.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] scMultiSim: simulation of single-cell multi-omics and spatial data guided by gene regulatory networks and cell-cell interactions
    Li, Hechen
    Zhang, Ziqi
    Squires, Michael
    Chen, Xi
    Zhang, Xiuwei
    NATURE METHODS, 2025, : 982 - 993
  • [42] scSSA: A clustering method for single cell RNA-seq data based on semi-supervised autoencoder
    Zhao, Jian-Ping
    Hou, Tong-Shuai
    Su, Yansen
    Zheng, Chun-Hou
    METHODS, 2022, 208 : 66 - 74
  • [43] New Anti-Fibrotic Strategies for Keloids: Insights From Single-Cell Multi-Omics
    Zhao, Songyun
    Xie, Jiaheng
    Zhang, Qian
    Ni, Tianyi
    Lin, Jinde
    Gao, Weicheng
    Zhao, Liping
    Yi, Min
    Tu, Liying
    Zhang, Pengpeng
    Wu, Dan
    Tang, Qikai
    Ma, Chenfeng
    He, Yucang
    Li, Liqun
    Wu, Guoping
    Yan, Wei
    CELL PROLIFERATION, 2025,
  • [44] Current and future perspectives of single-cell multi-omics technologies in cardiovascular research
    Tan, Wilson Lek Wen
    Seow, Wei Qiang
    Zhang, Angela
    Rhee, Siyeon
    Wong, Wing H.
    Greenleaf, William J.
    Wu, Joseph C.
    NATURE CARDIOVASCULAR RESEARCH, 2023, 2 (01): : 20 - 34
  • [45] Single-cell multi-omics of human preimplantation embryos shows susceptibility to glucocorticoids
    Zhao, Cheng
    Biondic, Savana
    Vandal, Katherine
    Bjoerklund, Asa K.
    Hagemann-Jensen, Michael
    Sommer, Theresa Maria
    Canizo, Jesica
    Clark, Stephen
    Raymond, Pascal
    Zenklusen, Daniel R. R.
    Rivron, Nicolas
    Reik, Wolf
    Petropoulos, Sophie
    GENOME RESEARCH, 2022, 32 (09) : 1627 - 1641
  • [46] ScImmOmics: a manually curated resource of single-cell multi-omics immune data
    Li, Yan-Yu
    Zhou, Li-Wei
    Qian, Feng-Cui
    Fang, Qiao-Li
    Yu, Zheng-Min
    Cui, Ting
    Dong, Fu-Juan
    Cai, Fu-Hong
    Yu, Ting-Ting
    Li, Li-Dong
    Wang, Qiu-Yu
    Zhu, Yan-Bing
    Tang, Hui-Fang
    Hu, Bao-Yang
    Li, Chun-Quan
    NUCLEIC ACIDS RESEARCH, 2024, 53 (D1) : D1162 - D1172
  • [47] Integration of single-cell multi-omics data by regression analysis on unpaired observations
    Qiuyue Yuan
    Zhana Duren
    Genome Biology, 23
  • [48] Single-Cell Multi-omics: An Engine for New Quantitative Models of Gene Regulation
    Packer, Jonathan
    Trapnell, Cole
    TRENDS IN GENETICS, 2018, 34 (09) : 653 - 665
  • [49] Multi-omics at single-cell resolution: comparison of experimental and data fusion approaches
    Leonavicius, Karolis
    Nainys, Juozas
    Kuciauskas, Dalius
    Mazutis, Linas
    CURRENT OPINION IN BIOTECHNOLOGY, 2019, 55 : 159 - 166
  • [50] Integration of single-cell multi-omics data by regression analysis on unpaired observations
    Yuan, Qiuyue
    Duren, Zhana
    GENOME BIOLOGY, 2022, 23 (01)