SCIBER: a simple method for removing batch effects from single-cell RNA-sequencing data

被引:3
|
作者
Gan, Dailin [1 ]
Li, Jun [1 ]
机构
[1] Univ Notre Dame, Dept Appl & Computat Math & Stat, Notre Dame, IN 46556 USA
基金
美国国家卫生研究院;
关键词
SEQ; EXPRESSION; TRANSCRIPTOMICS; PROGENITOR; ATLAS; STEM; MAP;
D O I
10.1093/bioinformatics/btac819
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Integrative analysis of multiple single-cell RNA-sequencing datasets allows for more comprehensive characterizations of cell types, but systematic technical differences between datasets, known as 'batch effects', need to be removed before integration to avoid misleading interpretation of the data. Although many batch-effect-removal methods have been developed, there is still a large room for improvement: most existing methods only give dimension-reduced data instead of expression data of individual genes, are based on computationally demanding models and are black-box models and thus difficult to interpret or tune. Results: Here, we present a new batch-effect-removal method called SCIBER (Single-Cell Integrator and Batch Effect Remover) and study its performance on real datasets. SCIBER matches cell clusters across batches according to the overlap of their differentially expressed genes. As a simple algorithm that has better scalability to data with a large number of cells and is easy to tune, SCIBER shows comparable and sometimes better accuracy in removing batch effects on real datasets compared to the state-of-the-art methods, which are much more complicated. Moreover, SCIBER outputs expression data in the original space, that is, the expression of individual genes, which can be used directly for downstream analyses. Additionally, SCIBER is a reference-based method, which assigns one of the batches as the reference batch and keeps it untouched during the process, making it especially suitable for integrating user-generated datasets with standard reference data such as the Human Cell Atlas.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Single-Cell RNA-Sequencing Data Clustering via Locality Preserving Kernel Matrix Alignment
    Zheng, Xiao
    Chen, Jiajia
    Tang, Chang
    Zhou, Suqin
    IEEE ACCESS, 2020, 8 : 201577 - 201594
  • [22] A systematic evaluation of single-cell RNA-sequencing imputation methods
    Hou, Wenpin
    Ji, Zhicheng
    Ji, Hongkai
    Hicks, Stephanie C.
    GENOME BIOLOGY, 2020, 21 (01)
  • [23] Unraveling flavivirus pathogenesis: from bulk to single-cell RNA-sequencing strategies
    Kim, Doyeong
    Jeong, Seonghun
    Park, Sang-Min
    KOREAN JOURNAL OF PHYSIOLOGY & PHARMACOLOGY, 2024, 28 (05) : 403 - 411
  • [24] Single-Cell RNA-Sequencing Reveals the Breadth of Osteoblast Heterogeneity
    Yoshioka, Hirotaka
    Okita, Saki
    Nakano, Masashi
    Minamizaki, Tomoko
    Nubukiyo, Asako
    Sotomaru, Yusuke
    Bonnelye, Edith
    Kozai, Katsuyuki
    Tanimoto, Kotaro
    Aubin, Jane E.
    Yoshiko, Yuji
    JBMR PLUS, 2021, 5 (06)
  • [25] Single-cell RNA-sequencing: The future of genome biology is now
    Picelli, Simone
    RNA BIOLOGY, 2017, 14 (05) : 637 - 650
  • [26] scShapes: a statistical framework for identifying distribution shapes in single-cell RNA-sequencing data
    Dharmaratne, Malindrie
    Kulkarni, Ameya S.
    Fard, Atefeh Taherian
    Mar, Jessica C.
    GIGASCIENCE, 2023, 12
  • [27] Are cells from a snowman realistic? Cryopreserved tissues as a source for single-cell RNA-sequencing experiments
    Braga, Felipe A. Vieira
    Teichmann, Sarah A.
    Stubbington, Michael J. T.
    GENOME BIOLOGY, 2017, 18
  • [28] Benchmarking single-cell RNA-sequencing protocols for cell atlas projects
    Mereu, Elisabetta
    Lafzi, Atefeh
    Moutinho, Catia
    Ziegenhain, Christoph
    McCarthy, Davis J.
    Alvarez-Varela, Adrian
    Batlle, Eduard
    Sagar
    Gruen, Dominic
    Lau, Julia K.
    Boutet, Stephane C.
    Sanada, Chad
    Ooi, Aik
    Jones, Robert C.
    Kaihara, Kelly
    Brampton, Chris
    Talaga, Yasha
    Sasagawa, Yohei
    Tanaka, Kaori
    Hayashi, Tetsutaro
    Braeuning, Caroline
    Fischer, Cornelius
    Sauers, Sascha
    Trefzer, Timo
    Conrad, Christian
    Adiconis, Xian
    Nguyen, Lan T.
    Regev, Aviv
    Levin, Joshua Z.
    Parekh, Swati
    Janjic, Aleksandar
    Wange, Lucas E.
    Bagnoli, Johannes W.
    Enard, Wolfgang
    Gut, Marta
    Sandberg, Rickard
    Nikaido, Itoshi
    Gut, Ivo
    Stegle, Oliver
    Heyn, Holger
    NATURE BIOTECHNOLOGY, 2020, 38 (06) : 747 - +
  • [29] scQCEA: a framework for annotation and quality control report of single-cell RNA-sequencing data
    Isar Nassiri
    Benjamin Fairfax
    Angela Lee
    Yanxia Wu
    David Buck
    Paolo Piazza
    BMC Genomics, 24
  • [30] Scanning sample-specific miRNA regulation from bulk and single-cell RNA-sequencing data
    Zhang, Junpeng
    Liu, Lin
    Wei, Xuemei
    Zhao, Chunwen
    Luo, Yanbi
    Li, Jiuyong
    Le, Thuc Duy
    BMC BIOLOGY, 2024, 22 (01)