Split Pool Ligation-based Single-cell Transcriptome sequencing (SPLiT-seq) data processing pipeline comparison

被引:1
|
作者
Kuijpers, Lucas [1 ,2 ]
Hornung, Bastian [2 ]
van den Hout-van Vroonhoven, Mirjam C. G. N. [2 ]
van IJcken, Wilfred F. J. [2 ]
Grosveld, Frank [1 ]
Mulugeta, Eskeatnaf [1 ]
机构
[1] Erasmus Univ Med Ctr Rotterdam Erasmus MC, Dept Cell Biol, Wytemaweg 80, NL-3015 CN Rotterdam, Netherlands
[2] Erasmus Univ Med Ctr Rotterdam Erasmus MC, Ctr Biom, Rotterdam, Netherlands
关键词
SPLiT-seq; Split-pool barcoding; Combinatorial barcoding; Data-preprocessing; Single cell RNA sequencing; RNA-SEQ; STAR;
D O I
10.1186/s12864-024-10285-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Single-cell sequencing techniques are revolutionizing every field of biology by providing the ability to measure the abundance of biological molecules at a single-cell resolution. Although single-cell sequencing approaches have been developed for several molecular modalities, single-cell transcriptome sequencing is the most prevalent and widely applied technique. SPLiT-seq (split-pool ligation-based transcriptome sequencing) is one of these single-cell transcriptome techniques that applies a unique combinatorial-barcoding approach by splitting and pooling cells into multi-well plates containing barcodes. This unique approach required the development of dedicated computational tools to preprocess the data and extract the count matrices. Here we compare eight bioinformatic pipelines (alevin-fry splitp, LR-splitpipe, SCSit, splitpipe, splitpipeline, SPLiTseq-demultiplex, STARsolo and zUMI) that have been developed to process SPLiT-seq data. We provide an overview of the tools, their computational performance, functionality and impact on downstream processing of the single-cell data, which vary greatly depending on the tool used.Results We show that STARsolo, splitpipe and alevin-fry splitp can all handle large amount of data within reasonable time. In contrast, the other five pipelines are slow when handling large datasets. When using smaller dataset, cell barcode results are similar with the exception of SPLiTseq-demultiplex and splitpipeline. LR-splitpipe that is originally designed for processing long-read sequencing data is the slowest of all pipelines. Alevin-fry produced different down-stream results that are difficult to interpret. STARsolo functions nearly identical to splitpipe and produce results that are highly similar to each other. However, STARsolo lacks the function to collapse random hexamer reads for which some additional coding is required.Conclusion Our comprehensive comparative analysis aids users in selecting the most suitable analysis tool for efficient SPLiT-seq data processing, while also detailing the specific prerequisites for each of these pipelines. From the available pipelines, we recommend splitpipe or STARSolo for SPLiT-seq data analysis.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] SCSit: A high-efficiency preprocessing tool for single-cell sequencing data from SPLiT-seq
    Luan, Mei-Wei
    Lin, Jia-Lun
    Wang, Ye-Fan
    Liu, Yu-Xiao
    Xiao, Chuan-Le
    Wu, Rongling
    Xie, Shang-Qian
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 4574 - 4580
  • [2] Microbial single-cell RNA sequencing by split-pool barcoding
    Kuchina, Anna
    Brettner, Leandra M.
    Paleologu, Luana
    Roco, Charles M.
    Rosenberg, Alexander B.
    Carignano, Alberto
    Kibler, Ryan
    Hirano, Matthew
    DePaolo, R. William
    Seelig, Georg
    SCIENCE, 2021, 371 (6531) : 798 - +
  • [3] Holo-Seq: single-cell sequencing of holo-transcriptome
    Xiao, Zhengyun
    Cheng, Guo
    Jiao, Yang
    Pan, Chen
    Li, Ran
    Jia, Danmei
    Zhu, Jing
    Wu, Chao
    Zheng, Min
    Jia, Junling
    GENOME BIOLOGY, 2018, 19
  • [4] Holo-Seq: single-cell sequencing of holo-transcriptome
    Zhengyun Xiao
    Guo Cheng
    Yang Jiao
    Chen Pan
    Ran Li
    Danmei Jia
    Jing Zhu
    Chao Wu
    Min Zheng
    Junling Jia
    Genome Biology, 19
  • [5] SCMeTA: a pipeline for single-cell metabolic analysis data processing
    Pan, Xingyu
    Pan, Siyuan
    Du, Murong
    Yang, Jinlei
    Yao, Huan
    Zhang, Xinrong
    Zhang, Sichun
    BIOINFORMATICS, 2024, 40 (09)
  • [6] Comparison of transformations for single-cell RNA-seq data
    Constantin Ahlmann-Eltze
    Wolfgang Huber
    Nature Methods, 2023, 20 : 665 - 672
  • [7] Comparison of transformations for single-cell RNA-seq data
    Ahlmann-Eltze, Constantin
    Huber, Wolfgang
    NATURE METHODS, 2023, 20 (05) : 665 - +
  • [8] Comparison of high-throughput single-cell RNA sequencing data processing pipelines
    Gao, Mingxuan
    Ling, Mingyi
    Tang, Xinwei
    Wang, Shun
    Xiao, Xu
    Qiao, Ying
    Yang, Wenxian
    Yu, Rongshan
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [9] Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding
    Rosenberg, Alexander B.
    Roco, Charles M.
    Muscat, Richard A.
    Kuchina, Anna
    Sample, Paul
    Yao, Zizhen
    Graybuck, Lucas T.
    Peeler, David J.
    Mukherjee, Sumit
    Chen, Wei
    Pun, Suzie H.
    Sellers, Drew L.
    Tasic, Bosiljka
    Seelig, Georg
    SCIENCE, 2018, 360 (6385) : 176 - +
  • [10] Comparison of cell subsets in chronic obstructive pulmonary disease and controls based on single-cell transcriptome sequencing
    An, Li
    Xia, Hong
    Zheng, Weiying
    Hua, Lin
    TECHNOLOGY AND HEALTH CARE, 2023, 31 : S9 - S24