scMAE: a masked autoencoder for single-cell RNA-seq clustering

被引:7
|
作者
Fang, Zhaoyu [1 ]
Zheng, Ruiqing [1 ]
Li, Min [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, 932 South Lushan Rd, Changsha 410083, Peoples R China
基金
中国国家自然科学基金;
关键词
HETEROGENEITY; MODEL;
D O I
10.1093/bioinformatics/btae020
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Single-cell RNA sequencing has emerged as a powerful technology for studying gene expression at the individual cell level. Clustering individual cells into distinct subpopulations is fundamental in scRNA-seq data analysis, facilitating the identification of cell types and exploration of cellular heterogeneity. Despite the recent development of many deep learning-based single-cell clustering methods, few have effectively exploited the correlations among genes, resulting in suboptimal clustering outcomes.Results Here, we propose a novel masked autoencoder-based method, scMAE, for cell clustering. scMAE perturbs gene expression and employs a masked autoencoder to reconstruct the original data, learning robust and informative cell representations. The masked autoencoder introduces a masking predictor, which captures relationships among genes by predicting whether gene expression values are masked. By integrating this masking mechanism, scMAE effectively captures latent structures and dependencies in the data, enhancing clustering performance. We conducted extensive comparative experiments using various clustering evaluation metrics on 15 scRNA-seq datasets from different sequencing platforms. Experimental results indicate that scMAE outperforms other state-of-the-art methods on these datasets. In addition, scMAE accurately identifies rare cell types, which are challenging to detect due to their low abundance. Furthermore, biological analyses confirm the biological significance of the identified cell subpopulations.Availability and implementation The source code of scMAE is available at: https://zenodo.org/records/10465991.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data
    Srinivasan, Suhas
    Leshchyk, Anastasia
    Johnson, Nathan T.
    Korkin, Dmitry
    RNA, 2020, 26 (10) : 1303 - 1319
  • [42] scMUG: deep clustering analysis of single-cell RNA-seq data on multiple gene functional modules
    Liang, De-Min
    Du, Pu-Feng
    BRIEFINGS IN BIOINFORMATICS, 2025, 26 (02)
  • [43] GRACE: A Graph-Based Cluster Ensemble Approach for Single-Cell RNA-Seq Data Clustering
    Guan, Jihong
    Li, Rui-Yi
    Wang, Jiasheng
    IEEE ACCESS, 2020, 8 : 166730 - 166741
  • [44] LAK: Lasso and K-Means Based Single-Cell RNA-Seq Data Clustering Analysis
    Hua, Jiao
    Liu, Hongkun
    Zhang, Boyang
    Jin, Shuilin
    IEEE ACCESS, 2020, 8 : 129679 - 129688
  • [45] scHFC: a hybrid fuzzy clustering method for single-cell RNA-seq data optimized by natural computation
    Wang, Jing
    Xia, Junfeng
    Tan, Dayu
    Lin, Rongxin
    Su, Yansen
    Zheng, Chun-Hou
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [46] Single-cell RNA-seq and bulk RNA-seq explore the prognostic value of exhausted T cells in hepatocellular carcinoma
    Tang, Xiaolong
    Miao, Yandong
    Yang, Lixia
    Ha, Wuhua
    Li, Zheng
    Mi, Denghai
    IET SYSTEMS BIOLOGY, 2023, 17 (04) : 228 - 244
  • [47] scBKAP: A Clustering Model for Single-Cell RNA-Seq Data Based on Bisecting K-Means
    Wang, Xiaolin
    Gao, Hongli
    Qi, Ren
    Zheng, Ruiqing
    Gao, Xin
    Yu, Bin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 2007 - 2015
  • [48] Identification of cancer subtypes from single-cell RNA-seq data using a consensus clustering method
    Gan, Yanglan
    Li, Ning
    Zou, Guobing
    Xin, Yongchang
    Guan, Jihong
    BMC MEDICAL GENOMICS, 2018, 11
  • [49] Deep Batch Integration and Denoise of Single-Cell RNA-Seq Data
    Qin, Lu
    Zhang, Guangya
    Zhang, Shaoqiang
    Chen, Yong
    ADVANCED SCIENCE, 2024, 11 (29)
  • [50] Systematic comparison of high-throughput single-cell RNA-seq
    Zhong, Yulong
    Wang, Linyan
    Yu, Xianhong
    GENE REPORTS, 2025, 39