Analysis of Single-Cell RNA-seq Data by Clustering Approaches

被引:22
|
作者
Zhu, Xiaoshu [1 ,2 ,3 ]
Li, Hong-Dong [1 ]
Guo, Lilu [2 ,3 ]
Wu, Fang-Xiang [4 ,5 ]
Wang, Jianxin [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
[2] Yulin Normal Univ, Sch Comp Sci & Engn, Yulin 537000, Guangxi, Peoples R China
[3] Yulin Normal Univ, Guangxi Univ Key Lab Complex Syst Optimizat & Big, Yulin 537000, Guangxi, Peoples R China
[4] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada
[5] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada
基金
中国国家自然科学基金;
关键词
Single-cell sequencing technology; single-cell RNA-seq data; similarity measurement; clustering of cell types; cluster method; feature selection; TRANSCRIPTOMICS REVEALS; FATE DECISIONS; EXPRESSION; GENOME; CLASSIFICATION; DISCOVERY; IDENTIFICATION; HETEROGENEITY; POPULATIONS; DIVERSITY;
D O I
10.2174/1574893614666181120095038
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The recently developed single-cell RNA sequencing (scRNA-seq) has attracted a great amount of attention due to its capability to interrogate expression of individual cells, which is superior to traditional bulk cell sequencing that can only measure mean gene expression of a population of cells. scRNA-seq has been successfully applied in finding new cell subtypes. New computational challenges exist in the analysis of scRNA-seq data. Objective: We provide an overview of the features of different similarity calculation and clustering methods, in order to facilitate users to select methods that are suitable for their scRNA-seq. We would also like to show that feature selection methods are important to improve clustering performance. Results: We first described similarity measurement methods, followed by reviewing some new clustering methods, as well as their algorithmic details. This analysis revealed several new questions, including how to automatically estimate the number of clustering categories, how to discover novel subpopulation, and how to search for new marker genes by using feature selection methods. Conclusion: Without prior knowledge about the number of cell types, clustering or semisupervised learning methods are important tools for exploratory analysis of scRNA-seq data.
引用
收藏
页码:314 / 322
页数:9
相关论文
共 50 条
  • [41] Tumor genetic analysis from single-cell RNA-seq data
    Tal Nawy
    Nature Methods, 2018, 15 : 571 - 571
  • [42] ascend: R package for analysis of single-cell RNA-seq data
    Senabouth, Anne
    Lukowski, Samuel W.
    Hernandez, Jose Alquicira
    Andersen, Stacey B.
    Mei, Xin
    Nguyen, Quan H.
    Powell, Joseph E.
    GIGASCIENCE, 2019, 8 (08):
  • [43] scSemiAAE: a semi-supervised clustering model for single-cell RNA-seq data
    Zile Wang
    Haiyun Wang
    Jianping Zhao
    Chunhou Zheng
    BMC Bioinformatics, 24
  • [44] Impact of data preprocessing on cell-type clustering based on single-cell RNA-seq data
    Wang, Chunxiang
    Gao, Xin
    Liu, Juntao
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [45] Impact of data preprocessing on cell-type clustering based on single-cell RNA-seq data
    Chunxiang Wang
    Xin Gao
    Juntao Liu
    BMC Bioinformatics, 21
  • [46] CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data
    Lin, Peijie
    Troup, Michael
    Ho, Joshua W. K.
    GENOME BIOLOGY, 2017, 18
  • [47] FlowGrid enables fast clustering of very large single-cell RNA-seq data
    Fang, Xiunan
    Ho, Joshua W. K.
    BIOINFORMATICS, 2022, 38 (01) : 282 - 283
  • [48] CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data
    Peijie Lin
    Michael Troup
    Joshua W. K. Ho
    Genome Biology, 18
  • [49] DTWscore: differential expression and cell clustering analysis for time-series single-cell RNA-seq data
    Wang, Zhuo
    Jin, Shuilin
    Liu, Guiyou
    Zhang, Xiurui
    Wang, Nan
    Wu, Deliang
    Hu, Yang
    Zhang, Chiping
    Jiang, Qinghua
    Xu, Li
    Wang, Yadong
    BMC BIOINFORMATICS, 2017, 18
  • [50] DTWscore: differential expression and cell clustering analysis for time-series single-cell RNA-seq data
    Zhuo Wang
    Shuilin Jin
    Guiyou Liu
    Xiurui Zhang
    Nan Wang
    Deliang Wu
    Yang Hu
    Chiping Zhang
    Qinghua Jiang
    Li Xu
    Yadong Wang
    BMC Bioinformatics, 18