Cell Subtype Classification via Representation Learning Based on a Denoising Autoencoder for Single-Cell RNA Sequencing

被引:4
|
作者
Choi, Joungmin [1 ]
Rhee, Je-Keun [2 ]
Chae, Heejoon [1 ]
机构
[1] Sookmyung Womens Univ, Div Comp Sci, Seoul 04310, South Korea
[2] Soongsil Univ, Sch Syst Biomed Sci, Seoul 06978, South Korea
基金
新加坡国家研究基金会;
关键词
Feature extraction; Gene expression; Biology; Data models; Biological system modeling; RNA; Neural networks; Cell subtype; classification; gene expression; scRNA-seq; single-cell; SEQ DATA; GENE-EXPRESSION; TUMOR; TRANSCRIPTOMES; HETEROGENEITY; HEALTH;
D O I
10.1109/ACCESS.2021.3052923
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identification of single-cell subtypes is one of the fundamental processes required to understand a heterogeneous population composed of multiple cells, based on single-cell RNA sequencing data. Previously, cell subtype identification was mainly carried out by dimension reduction and clustering approaches that grouped cells with similar expressed profiles together. However, for high robustness to noises and systematic annotation of the subtype in each cell, supervised classification approaches have been widely used. Recently, deep neural network (DNN) models have been widely presented in various fields, including biology. By capturing the composite relationship between sample features and target outcomes, a DNN model enables significant performance improvements in biological data mining analyses. In this paper, we constructed a DNN model, called scDAE for single-cell subtype identification combined with representative feature extraction using a multilayer denoising autoencoder (DAE). The feature sets were learned by the DAE and were further tuned by fully connected layers using a softmax classifier. The model was compared against four state-of-the-art cell subtype identification methods and two conventional machine learning algorithms. From multiple tests, scDAE significantly outperformed competing methods especially on data sets having a large number of cell subtypes and noises. Extracted cell features from the proposed model were clearly clustered with respect to subtype. The results of the experiments indicated that our proposed model is effective in identifying single-cell subtypes and molecular signatures representative of each cell subtype. scDAE is publicly available at https://github.com/cbi-bioinfo/scDAE.
引用
收藏
页码:14540 / 14548
页数:9
相关论文
共 50 条
  • [1] Cell Classification Based on Stacked Autoencoder for Single-Cell RNA Sequencing
    Qi, Rong
    Zheng, Chun-Hou
    Ji, Cun-Mei
    Yu, Ning
    Ni, Jian-Cheng
    Wang, Yu-Tian
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 245 - 259
  • [2] Clustering and classification methods for single-cell RNA-sequencing data
    Qi, Ren
    Ma, Anjun
    Ma, Qin
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1196 - 1208
  • [3] Dimensionality Reduction of Single-Cell RNA Sequencing Data by Combining Entropy and Denoising AutoEncoder
    Zhu, Xiaoshu
    Li, Jian
    Lin, Yongchang
    Zhao, Liquan
    Wang, Jianxin
    Peng, Xiaoqing
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (10) : 1074 - 1084
  • [4] Graph attention autoencoder model with dual decoder for clustering single-cell RNA sequencing data
    Wang, Shudong
    Zhang, Yu
    Zhang, Yuanyuan
    Zhang, Yulin
    Pang, Shanchen
    Su, Jionglong
    Liu, Yingye
    APPLIED INTELLIGENCE, 2024, 54 (06) : 5136 - 5146
  • [5] RIA: a novel Regression-based Imputation Approach for single-cell RNA sequencing
    Bang Tran
    Duc Tran
    Hung Nguyen
    Nam Sy Vo
    Tin Nguyen
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 229 - 237
  • [6] Non-negative low-rank representation based on dictionary learning for single-cell RNA-sequencing data analysis
    Wang, Juan
    Zhang, Nana
    Yuan, Shasha
    Shang, Junliang
    Dai, Lingyun
    Li, Feng
    Liu, Jinxing
    BMC GENOMICS, 2022, 23 (01)
  • [7] Microfluidics Facilitates the Development of Single-Cell RNA Sequencing
    Pan, Yating
    Cao, Wenjian
    Mu, Ying
    Zhu, Qiangyuan
    BIOSENSORS-BASEL, 2022, 12 (07):
  • [8] Single-Cell RNA Sequencing to Understand Host-Pathogen Interactions
    Penaranda, Cristina
    Hung, Deborah T.
    ACS INFECTIOUS DISEASES, 2019, 5 (03): : 336 - +
  • [9] Effectively Clustering Single Cell RNA Sequencing Data by Sparse Representation
    Li, Rui-Yi
    Wang, Zhiye
    Guan, Jihong
    Zhou, Shuigeng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3425 - 3434
  • [10] The new technologies of high-throughput single-cell RNA sequencing
    Vodiasova, E. A.
    Chelebieva, E. S.
    Kuleshova, O. N.
    VAVILOVSKII ZHURNAL GENETIKI I SELEKTSII, 2019, 23 (05): : 508 - 518