SCIBER: a simple method for removing batch effects from single-cell RNA-sequencing data

被引:3
|
作者
Gan, Dailin [1 ]
Li, Jun [1 ]
机构
[1] Univ Notre Dame, Dept Appl & Computat Math & Stat, Notre Dame, IN 46556 USA
基金
美国国家卫生研究院;
关键词
SEQ; EXPRESSION; TRANSCRIPTOMICS; PROGENITOR; ATLAS; STEM; MAP;
D O I
10.1093/bioinformatics/btac819
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Integrative analysis of multiple single-cell RNA-sequencing datasets allows for more comprehensive characterizations of cell types, but systematic technical differences between datasets, known as 'batch effects', need to be removed before integration to avoid misleading interpretation of the data. Although many batch-effect-removal methods have been developed, there is still a large room for improvement: most existing methods only give dimension-reduced data instead of expression data of individual genes, are based on computationally demanding models and are black-box models and thus difficult to interpret or tune. Results: Here, we present a new batch-effect-removal method called SCIBER (Single-Cell Integrator and Batch Effect Remover) and study its performance on real datasets. SCIBER matches cell clusters across batches according to the overlap of their differentially expressed genes. As a simple algorithm that has better scalability to data with a large number of cells and is easy to tune, SCIBER shows comparable and sometimes better accuracy in removing batch effects on real datasets compared to the state-of-the-art methods, which are much more complicated. Moreover, SCIBER outputs expression data in the original space, that is, the expression of individual genes, which can be used directly for downstream analyses. Additionally, SCIBER is a reference-based method, which assigns one of the batches as the reference batch and keeps it untouched during the process, making it especially suitable for integrating user-generated datasets with standard reference data such as the Human Cell Atlas.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Single-cell RNA-sequencing of human eosinophils in allergic inflammation in the esophagus
    Morgenstern, Netali Ben-Baruch
    Rochman, Mark
    Kotliar, Michael
    Dunn, Julia L. M.
    Mack, Lydia
    Besse, John
    Natale, Mia A.
    Klingler, Andrea M.
    Felton, Jennifer M.
    Caldwell, Julie M.
    Barski, Artem
    Rothenberg, Marc E.
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2024, 154 (04) : 974 - 987
  • [42] scMODD: A model-driven algorithm for doublet identification in single-cell RNA-sequencing data
    Zhao, Xinye
    Du, Alexander
    Qiu, Peng
    FRONTIERS IN SYSTEMS BIOLOGY, 2023, 2
  • [43] IBRAP: integrated benchmarking single-cell RNA-sequencing analytical pipeline
    Knight, Connor H.
    Khan, Faraz
    Patel, Ankit
    Gill, Upkar S.
    Okosun, Jessica
    Wang, Jun
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (02)
  • [44] Characterization of iCell cardiomyocytes using single-cell RNA-sequencing methods
    Schmid, Christina
    Wohnhaas, Christian T.
    Hildebrandt, Tobias
    Baum, Patrick
    Rast, Georg
    JOURNAL OF PHARMACOLOGICAL AND TOXICOLOGICAL METHODS, 2020, 106
  • [45] Single-Cell RNA-Sequencing: Assessment of Differential Expression Analysis Methods
    Dal Molin, Alessandra
    Baruzzo, Giacomo
    Di Camillo, Barbara
    FRONTIERS IN GENETICS, 2017, 8
  • [46] PanoView: An iterative clustering method for single-cell RNA sequencing data
    Hu, Ming-Wen
    Kim, Dong Won
    Liu, Sheng
    Zack, Donald J.
    Blackshaw, Seth
    Qian, Jiang
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (08)
  • [47] PPMS: A framework to Profile Primary MicroRNAs from Single-cell RNA-sequencing datasets
    Ji, Jiahui
    Anwar, Maryam
    Petretto, Enrico
    Emanueli, Costanza
    Srivastava, Prashant Kumar
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
  • [48] Single-cell RNA-sequencing analysis of the ciliary epithelium and contiguous tissues in the mouse eye
    Youkilis, J. C.
    Bassnett, S.
    EXPERIMENTAL EYE RESEARCH, 2021, 213
  • [49] Analysis of single-cell RNA sequencing data based on autoencoders
    Tangherloni, Andrea
    Ricciuti, Federico
    Besozzi, Daniela
    Lio, Pietro
    Cvejic, Ana
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [50] A Bayesian factorization method to recover single-cell RNA sequencing data
    Wen, Zi-Hang
    Langsam, Jeremy L.
    Zhang, Lu
    Shen, Wenjun
    Zhou, Xin
    CELL REPORTS METHODS, 2022, 2 (01):