CIForm as a Transformer-based model for cell-type annotation of large-scale single-cell RNA-seq data

被引:21
|
作者
Xu, Jing [1 ,2 ]
Zhang, Aidi [1 ]
Liu, Fang [1 ]
Chen, Liang [1 ]
Zhang, Xiujun [1 ]
机构
[1] Chinese Acad Sci, Key Lab Plant Germplasm Enhancement & Specialty Ag, Wuhan Bot Garden, Wuhan 430074, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
cell-type annotation; deep learning; Transformer; scRNA-seq; large-scale dataset; HETEROGENEITY; ATLAS;
D O I
10.1093/bib/bbad195
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell omics technologies have made it possible to analyze the individual cells within a biological sample, providing a more detailed understanding of biological systems. Accurately determining the cell type of each cell is a crucial goal in single-cell RNA-seq (scRNA-seq) analysis. Apart from overcoming the batch effects arising from various factors, single-cell annotation methods also face the challenge of effectively processing large-scale datasets. With the availability of an increase in the scRNA-seq datasets, integrating multiple datasets and addressing batch effects originating from diverse sources are also challenges in cell-type annotation. In this work, to overcome the challenges, we developed a supervised method called CIForm based on the Transformer for cell-type annotation of large-scale scRNA-seq data. To assess the effectiveness and robustness of CIForm, we have compared it with some leading tools on benchmark datasets. Through the systematic comparisons under various cell-type annotation scenarios, we exhibit that the effectiveness of CIForm is particularly pronounced in cell-type annotation. The source code and data are available at .
引用
收藏
页数:11
相关论文
共 50 条
  • [1] TransCluster: A Cell-Type Identification Method for single-cell RNA-Seq data using deep learning based on transformer
    Song, Tao
    Dai, Huanhuan
    Wang, Shuang
    Wang, Gan
    Zhang, Xudong
    Zhang, Ying
    Jiao, Linfang
    FRONTIERS IN GENETICS, 2022, 13
  • [2] Review of single-cell RNA-seq data clustering for cell-type identification and characterization
    Zhang, Shixiong
    Li, Xiangtao
    Lin, Jiecong
    Lin, Qiuzhen
    Wong, Ka-Chun
    RNA, 2023, 29 (05) : 517 - 530
  • [3] SCSA: A Cell Type Annotation Tool for Single-Cell RNA-seq Data
    Cao, Yinghao
    Wang, Xiaoyue
    Peng, Gongxin
    FRONTIERS IN GENETICS, 2020, 11
  • [4] transCAE: Enhancing Cell Type Annotation in Single-cell RNA-seq Data with Transfer Learning and Convolutional Autoencoder
    Liu, Qingchun
    Xu, Yan
    JOURNAL OF MOLECULAR BIOLOGY, 2025, 437 (04)
  • [5] Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
    Huang, Qianhui
    Liu, Yu
    Du, Yuheng
    Garmire, Lana X.
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2021, 19 (02) : 267 - 281
  • [6] scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data
    Alquicira-Hernandez, Jose
    Sathe, Anuja
    Ji, Hanlee P.
    Quan Nguyen
    Powell, Joseph E.
    GENOME BIOLOGY, 2019, 20 (01)
  • [7] scReClassify: post hoc cell type classification of single-cell rNA-seq data
    Kim, Taiyun
    Lo, Kitty
    Geddes, Thomas A.
    Kim, Hani Jieun
    Yang, Jean Yee Hwa
    Yang, Pengyi
    BMC GENOMICS, 2019, 20 (Suppl 9)
  • [8] Model-based autoencoders for imputing discrete single-cell RNA-seq data
    Tian, Tian
    Min, Martin Renqiang
    Wei, Zhi
    METHODS, 2021, 192 : 112 - 119
  • [9] De novo prediction of cell-type complexity in single-cell RNA-seq and tumor microenvironments
    Woo, Jun
    Winterhoff, Boris J.
    Starr, Timothy K.
    Aliferis, Constantin
    Wang, Jinhua
    LIFE SCIENCE ALLIANCE, 2019, 2 (04)
  • [10] Deep Batch Integration and Denoise of Single-Cell RNA-Seq Data
    Qin, Lu
    Zhang, Guangya
    Zhang, Shaoqiang
    Chen, Yong
    ADVANCED SCIENCE, 2024, 11 (29)