CIForm as a Transformer-based model for cell-type annotation of large-scale single-cell RNA-seq data

被引:21
|
作者
Xu, Jing [1 ,2 ]
Zhang, Aidi [1 ]
Liu, Fang [1 ]
Chen, Liang [1 ]
Zhang, Xiujun [1 ]
机构
[1] Chinese Acad Sci, Key Lab Plant Germplasm Enhancement & Specialty Ag, Wuhan Bot Garden, Wuhan 430074, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
cell-type annotation; deep learning; Transformer; scRNA-seq; large-scale dataset; HETEROGENEITY; ATLAS;
D O I
10.1093/bib/bbad195
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell omics technologies have made it possible to analyze the individual cells within a biological sample, providing a more detailed understanding of biological systems. Accurately determining the cell type of each cell is a crucial goal in single-cell RNA-seq (scRNA-seq) analysis. Apart from overcoming the batch effects arising from various factors, single-cell annotation methods also face the challenge of effectively processing large-scale datasets. With the availability of an increase in the scRNA-seq datasets, integrating multiple datasets and addressing batch effects originating from diverse sources are also challenges in cell-type annotation. In this work, to overcome the challenges, we developed a supervised method called CIForm based on the Transformer for cell-type annotation of large-scale scRNA-seq data. To assess the effectiveness and robustness of CIForm, we have compared it with some leading tools on benchmark datasets. Through the systematic comparisons under various cell-type annotation scenarios, we exhibit that the effectiveness of CIForm is particularly pronounced in cell-type annotation. The source code and data are available at .
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Testing for Phylogenetic Signal in Single-Cell RNA-Seq Data
    Moravec, Jiri C.
    Lanfear, Robert
    Spector, David L.
    Diermeier, Sarah D.
    Gavryushkin, Alex
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2023, 30 (04) : 518 - 537
  • [32] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [33] SCnorm: robust normalization of single-cell RNA-seq data
    Bacher, Rhonda
    Chu, Li-Fang
    Leng, Ning
    Gasch, Audrey P.
    Thomson, James A.
    Stewart, Ron M.
    Newton, Michael
    Kendziorski, Christina
    NATURE METHODS, 2017, 14 (06) : 584 - +
  • [34] sc-ImmuCC: hierarchical annotation for immune cell types in single-cell RNA-seq
    Jiang, Ying
    Chen, Ziyi
    Han, Na
    Shang, Jingzhe
    Wu, Aiping
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [35] Improving replicability in single-cell RNA-Seq cell type discovery with Dune
    de Bezieux, Hector Roux
    Street, Kelly
    Fischer, Stephan
    Van den Berge, Koen
    Chance, Rebecca
    Risso, Davide
    Gillis, Jesse
    Ngai, John
    Purdom, Elizabeth
    Dudoit, Sandrine
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [36] PhytoCluster: a generative deep learning model for clustering plant single-cell RNA-seq data
    Wang, Hao
    Fu, Xiangzheng
    Liu, Lijia
    Wang, Yi
    Hong, Jingpeng
    Pan, Bintao
    Cao, Yaning
    Chen, Yanqing
    Cao, Yongsheng
    Ma, Xiaoding
    Fang, Wei
    Yan, Shen
    ABIOTECH, 2025,
  • [37] HArmonized single-cell RNA-seq Cell type Assisted Deconvolution (HASCAD)
    Chiu, Yen-Jung
    Ni, Chung-En
    Huang, Yen-Hua
    BMC MEDICAL GENOMICS, 2023, 16 (SUPPL 2)
  • [38] scASGC: An adaptive simplified graph convolution model for clustering single-cell RNA-seq data
    Wang, Shudong
    Zhang, Yu
    Zhang, Yulin
    Wu, Wenhao
    Ye, Lan
    Li, Yunyin
    Su, Jionglong
    Pang, Shanchen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [39] HArmonized single-cell RNA-seq Cell type Assisted Deconvolution (HASCAD)
    Yen-Jung Chiu
    Chung-En Ni
    Yen-Hua Huang
    BMC Medical Genomics, 16
  • [40] An optimized graph-based structure for single-cell RNA-seq cell-type classification based on non-linear dimension reduction
    Abadi, Saeedeh Akbari Rokn
    Laghaee, Seyed Pouria
    Koohi, Somayyeh
    BMC GENOMICS, 2023, 24 (01)