SCIBER: a simple method for removing batch effects from single-cell RNA-sequencing data

被引:3
|
作者
Gan, Dailin [1 ]
Li, Jun [1 ]
机构
[1] Univ Notre Dame, Dept Appl & Computat Math & Stat, Notre Dame, IN 46556 USA
基金
美国国家卫生研究院;
关键词
SEQ; EXPRESSION; TRANSCRIPTOMICS; PROGENITOR; ATLAS; STEM; MAP;
D O I
10.1093/bioinformatics/btac819
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Integrative analysis of multiple single-cell RNA-sequencing datasets allows for more comprehensive characterizations of cell types, but systematic technical differences between datasets, known as 'batch effects', need to be removed before integration to avoid misleading interpretation of the data. Although many batch-effect-removal methods have been developed, there is still a large room for improvement: most existing methods only give dimension-reduced data instead of expression data of individual genes, are based on computationally demanding models and are black-box models and thus difficult to interpret or tune. Results: Here, we present a new batch-effect-removal method called SCIBER (Single-Cell Integrator and Batch Effect Remover) and study its performance on real datasets. SCIBER matches cell clusters across batches according to the overlap of their differentially expressed genes. As a simple algorithm that has better scalability to data with a large number of cells and is easy to tune, SCIBER shows comparable and sometimes better accuracy in removing batch effects on real datasets compared to the state-of-the-art methods, which are much more complicated. Moreover, SCIBER outputs expression data in the original space, that is, the expression of individual genes, which can be used directly for downstream analyses. Additionally, SCIBER is a reference-based method, which assigns one of the batches as the reference batch and keeps it untouched during the process, making it especially suitable for integrating user-generated datasets with standard reference data such as the Human Cell Atlas.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] Cell type matching in single-cell RNA-sequencing data using FR-Match
    Zhang, Yun
    Aevermann, Brian
    Gala, Rohan
    Scheuermann, Richard H.
    SCIENTIFIC REPORTS, 2022, 12 (01):
  • [12] Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing data
    Kim, Jong Kyoung
    Marioni, John C.
    GENOME BIOLOGY, 2013, 14 (01): : 1 - 12
  • [13] Comparison of Computational Methods for Imputing Single-Cell RNA-Sequencing Data
    Zhang, Lihua
    Zhang, Shihua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) : 376 - 389
  • [14] Single-cell RNA-sequencing of mammalian brain development: insights and future directions
    Vinsland, Elin
    Linnarsson, Sten
    DEVELOPMENT, 2022, 149 (10):
  • [15] Single-Cell RNA-Sequencing: Opening New Horizons for Breast Cancer Research
    Xiang, Lingyan
    Rao, Jie
    Yuan, Jingping
    Xie, Ting
    Yan, Honglin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (17)
  • [16] Combining bulk RNA-sequencing and single-cell RNA-sequencing data to reveal the immune microenvironment and metabolic pattern of osteosarcoma
    Huang, Ruichao
    Wang, Xiaohu
    Yin, Xiangyun
    Zhou, Yaqi
    Sun, Jiansheng
    Yin, Zhongxiu
    Zhu, Zhi
    FRONTIERS IN GENETICS, 2022, 13
  • [17] Systematic comparison of single-cell and single-nucleus RNA-sequencing methods
    Ding, Jiarui
    Adiconis, Xian
    Simmons, Sean K.
    Kowalczyk, Monika S.
    Hession, Cynthia C.
    Marjanovic, Nemanja D.
    Hughes, Travis K.
    Wadsworth, Marc H.
    Burks, Tyler
    Nguyen, Lan T.
    Kwon, John Y. H.
    Baraks, Boaz
    Ge, William
    Kedaigle, Amanda J.
    Carroll, Shaina
    Li, Shuqiang
    Hacohen, Nir
    Rozenblatt-Rosen, Orit
    Shalek, Alex K.
    Villani, Alexandra-Chloe
    Regev, Aviv
    Levin, Joshua Z.
    NATURE BIOTECHNOLOGY, 2020, 38 (06) : 737 - +
  • [18] A benchmark of batch-effect correction methods for single-cell RNA sequencing data
    Hoa Thi Nhu Tran
    Ang, Kok Siong
    Chevrier, Marion
    Zhang, Xiaomeng
    Lee, Nicole Yee Shin
    Goh, Michelle
    Chen, Jinmiao
    GENOME BIOLOGY, 2020, 21 (01)
  • [19] scruff: an R/Bioconductor package for preprocessing single-cell RNA-sequencing data
    Wang, Zhe
    Hu, Junming
    Johnson, W. Evan
    Campbell, Joshua D.
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [20] Joint learning dimension reduction and clustering of single-cell RNA-sequencing data
    Wu, Wenming
    Ma, Xiaoke
    BIOINFORMATICS, 2020, 36 (12) : 3825 - 3832