A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data

被引:67
|
作者
Xiang, Ruizhi [1 ]
Wang, Wencan [2 ,3 ]
Yang, Lei [1 ]
Wang, Shiyuan [1 ]
Xu, Chaohan [1 ]
Chen, Xiaowen [1 ]
机构
[1] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin, Peoples R China
[2] Wenzhou Med Univ, Sch Optometry & Ophthalmol, Wenzhou, Peoples R China
[3] Wenzhou Med Univ, Eye Hosp, Wenzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell RNA-seq; dimension reduction; benchmark; sequences analysis; deep learning; GENE-EXPRESSION;
D O I
10.3389/fgene.2021.646936
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell RNA sequencing (scRNA-seq) is a high-throughput sequencing technology performed at the level of an individual cell, which can have a potential to understand cellular heterogeneity. However, scRNA-seq data are high-dimensional, noisy, and sparse data. Dimension reduction is an important step in downstream analysis of scRNA-seq. Therefore, several dimension reduction methods have been developed. We developed a strategy to evaluate the stability, accuracy, and computing cost of 10 dimensionality reduction methods using 30 simulation datasets and five real datasets. Additionally, we investigated the sensitivity of all the methods to hyperparameter tuning and gave users appropriate suggestions. We found that t-distributed stochastic neighbor embedding (t-SNE) yielded the best overall performance with the highest accuracy and computing cost. Meanwhile, uniform manifold approximation and projection (UMAP) exhibited the highest stability, as well as moderate accuracy and the second highest computing cost. UMAP well preserves the original cohesion and separation of cell populations. In addition, it is worth noting that users need to set the hyperparameters according to the specific situation before using the dimensionality reduction methods based on non-linear model and neural network.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder
    Jiang, Jing
    Xu, Junlin
    Liu, Yuansheng
    Song, Bosheng
    Guo, Xiulan
    Zeng, Xiangxiang
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [2] Emerging deep learning methods for single-cell RNA-seq data analysis
    Zheng, Jie
    Wang, Ke
    QUANTITATIVE BIOLOGY, 2019, 7 (04) : 247 - 254
  • [3] An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
    Sun, Xifang
    Sun, Shiquan
    Yang, Sheng
    CELLS, 2019, 8 (10)
  • [4] Single-cell RNA-seq data clustering: A survey with performance comparison study
    Li, Ruiyi
    Guan, Jihong
    Zhou, Shuigeng
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2020, 18 (04)
  • [5] scGADR: Dimensionality Reduction of Single-Cell RNA-seq Data with ZINB-Based Graph Attention Autoencoder
    Tang, Yongxuan
    Luo, Jiawei
    Xu, Zhongyuan
    Xiong, Zehao
    Wang, Bo
    Nguyen Hoang Tu
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 357 - 368
  • [6] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [7] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    GENOME BIOLOGY, 2019, 20 (1)
  • [8] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Tao Peng
    Qin Zhu
    Penghang Yin
    Kai Tan
    Genome Biology, 20
  • [9] VASC: Dimension Reduction and Visualization of Single-cell RNA-seq Data by Deep Variational Autoencoder
    Wang, Dongfang
    Gu, Jin
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2018, 16 (05) : 320 - 331
  • [10] Identification of innate lymphoid cells in single-cell RNA-Seq data
    Suffiotti, Madeleine
    Carmona, Santiago J.
    Jandus, Camilla
    Gfeller, David
    IMMUNOGENETICS, 2017, 69 (07) : 439 - 450