FlowGrid enables fast clustering of very large single-cell RNA-seq data

被引:4
|
作者
Fang, Xiunan [1 ]
Ho, Joshua W. K. [1 ,2 ]
机构
[1] Univ Hong Kong, Li Ka Shing Fac Med, Sch Biomed Sci, Hong Kong, Peoples R China
[2] Lab Data Discovery Hlth Ltd D24H, Hong Kong Sci Pk, Hong Kong, Peoples R China
关键词
D O I
10.1093/bioinformatics/btab521
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Scalable clustering algorithms are needed to analyze millions of cells in single cell RNA-seq (scRNA-seq) data. Results: Here, we present an open source python package called FlowGrid that can integrate into the Scanpy workflow to perform clustering on very large scRNA-seq datasets. FlowGrid implements a fast density-based clustering algorithm originally designed for flow cytometry data analysis. We introduce a new automated parameter tuning procedure, and show that FlowGrid can achieve comparable clustering accuracy as state-of-the-art clustering algorithms but at a substantially reduced run time for very large single cell RNA-seq datasets. For example, FlowGrid can complete a one-hour clustering task for one million cells in about five min.
引用
收藏
页码:282 / 283
页数:2
相关论文
共 50 条
  • [41] scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data
    Wang, Zongqin
    Xie, Xiaojun
    Liu, Shouyang
    Ji, Zhiwei
    LIFE SCIENCE ALLIANCE, 2023, 6 (12)
  • [42] scSemiAAE: a semi-supervised clustering model for single-cell RNA-seq data
    Wang, Zile
    Wang, Haiyun
    Zhao, Jianping
    Zheng, Chunhou
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [43] A deep matrix factorization based approach for single-cell RNA-seq data clustering
    Liang, Zhenlan
    Zheng, Ruiqing
    Chen, Siqi
    Yan, Xuhua
    Li, Min
    METHODS, 2022, 205 : 114 - 122
  • [44] Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis
    Xiangjie Li
    Kui Wang
    Yafei Lyu
    Huize Pan
    Jingxiao Zhang
    Dwight Stambolian
    Katalin Susztak
    Muredach P. Reilly
    Gang Hu
    Mingyao Li
    Nature Communications, 11
  • [45] Review of single-cell RNA-seq data clustering for cell-type identification and characterization
    Zhang, Shixiong
    Li, Xiangtao
    Lin, Jiecong
    Lin, Qiuzhen
    Wong, Ka-Chun
    RNA, 2023, 29 (05) : 517 - 530
  • [46] A Hybrid Clustering Algorithm for Identifying Cell Types from Single-Cell RNA-Seq Data
    Zhu, Xiaoshu
    Li, Hong-Dong
    Xu, Yunpei
    Guo, Lilu
    Wu, Fang-Xiang
    Duan, Guihua
    Wang, Jianxin
    GENES, 2019, 10 (02)
  • [47] Deep single-cell RNA-seq data clustering with graph prototypical contrastive learning
    Lee, Junseok
    Kim, Sungwon
    Hyun, Dongmin
    Lee, Namkyeong
    Kim, Yejin
    Park, Chanyoung
    BIOINFORMATICS, 2023, 39 (06)
  • [48] GSE: Graph similarity enhancement algorithm for single-cell RNA-seq data clustering
    Bu, Shugui
    Guo, Lilu
    Li, Rongyuan
    Lu, Jianbo
    Zhu, Xiaoshu
    2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP 2019), 2019, : 406 - 410
  • [49] FEATS: feature selection-based clustering of single-cell RNA-seq data
    Vans, Edwin
    Patil, Ashwini
    Sharma, Alok
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [50] Diffusion Kernel based Fast Adaptive Clustering of Single Cell RNA-seq Data
    Kausar, Samina
    Xu Huahu
    Mehmood, Rashid
    Iqbal, Muhammad Shahid
    ICBBT 2019: 2019 11TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL TECHNOLOGY, 2019, : 86 - 93