FastCount: A Fast Gene Count Software for Single-cell RNA-seq Data

被引:1
|
作者
Liu, Jinpeng [1 ]
Liu, Xinan [1 ]
Yu, Ye [1 ]
Wang, Chi [1 ]
Liu, Jinze [2 ]
机构
[1] Univ Kentucky, Lexington, KY 40506 USA
[2] Virginia Commonwealth Univ, Richmond, VA USA
关键词
single cell RNA-seq; UMI quantification; alignment-free; k-mers; SEQUENCE;
D O I
10.1145/3459930.3469544
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Motivation: The advent of single cell RNA-seq (scRNA-seq) enables scientists to characterize the transcriptomic response of cells under different conditions and understand expression heterogeneity at single cell level. One of the fundamental steps in scRNA-seq analysis is to summarize raw sequencing reads into a list of gene counts for each individual cell. However, this step remains to be most time-consuming and resource intensive in the analysis workflow due to the large amount of data produced in a scRNA-seq experiment. It is further complicated by the special handling of cell barcodes and unique molecular identifiers (UMIs) information in the read sequences. For example, the gene count summarization of 10X Chromium sequencing by standard Cell Ranger count often takes many hours to finish when running on a computing cluster. Although several alignment-free algorithms have been developed to improve efficiency, their derived gene count suffer from poor concordance with Cell Ranger count and algorithm-specific bias[1]. Results: In this work, we present a light-weight k-mer based gene count algorithm, FastCount, to support efficient UMI counts from single cell RNA-seq data. We demonstrate that FastCount is over an order of magnitude faster than Cell Ranger count while achieving competitive accuracy on 10X Genomics single cell RNAseq data. FastCount is a stand-alone program implemented in C++.
引用
收藏
页数:8
相关论文
共 50 条
  • [11] Comparison of Gene Selection Methods for Clustering Single-cell RNA-seq Data
    Zhu, Xiaoshu
    Wang, Jianxin
    Li, Rongruan
    Peng, Xiaoqing
    CURRENT BIOINFORMATICS, 2023, 18 (01) : 1 - 11
  • [12] FlowGrid enables fast clustering of very large single-cell RNA-seq data
    Fang, Xiunan
    Ho, Joshua W. K.
    BIOINFORMATICS, 2022, 38 (01) : 282 - 283
  • [13] scDIOR: single cell RNA-seq data IO software
    Feng, Huijian
    Lin, Lihui
    Chen, Jiekai
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [14] scDIOR: single cell RNA-seq data IO software
    Huijian Feng
    Lihui Lin
    Jiekai Chen
    BMC Bioinformatics, 23
  • [15] Computational analysis of alternative polyadenylation from standard RNA-seq and single-cell RNA-seq data
    Gao, Yipeng
    Li, Wei
    MRNA 3' END PROCESSING AND METABOLISM, 2021, 655 : 225 - 243
  • [16] Evaluating imputation methods for single-cell RNA-seq data
    Yi Cheng
    Xiuli Ma
    Lang Yuan
    Zhaoguo Sun
    Pingzhang Wang
    BMC Bioinformatics, 24
  • [17] Analysis of Single-Cell RNA-seq Data by Clustering Approaches
    Zhu, Xiaoshu
    Li, Hong-Dong
    Guo, Lilu
    Wu, Fang-Xiang
    Wang, Jianxin
    CURRENT BIOINFORMATICS, 2019, 14 (04) : 314 - 322
  • [18] SCnorm: robust normalization of single-cell RNA-seq data
    Bacher, Rhonda
    Chu, Li-Fang
    Leng, Ning
    Gasch, Audrey P.
    Thomson, James A.
    Stewart, Ron M.
    Newton, Michael
    Kendziorski, Christina
    NATURE METHODS, 2017, 14 (06) : 584 - +
  • [19] Quantifying the clusterness and trajectoriness of single-cell RNA-seq data
    Lim, Hong Seo
    Qiu, Peng
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (02)
  • [20] Evaluating imputation methods for single-cell RNA-seq data
    Cheng, Yi
    Ma, Xiuli
    Yuan, Lang
    Sun, Zhaoguo
    Wang, Pingzhang
    BMC BIOINFORMATICS, 2023, 24 (01)