Cell Classification Based on Stacked Autoencoder for Single-Cell RNA Sequencing

被引:0
作者
Qi, Rong [1 ]
Zheng, Chun-Hou [2 ]
Ji, Cun-Mei [1 ]
Yu, Ning [1 ]
Ni, Jian-Cheng [3 ]
Wang, Yu-Tian [1 ]
机构
[1] Qufu Normal Univ, Sch Cyber Sci & Engn, Qufu, Shandong, Peoples R China
[2] Anhui Univ, Sch Artifial Intelligence, Hefei, Peoples R China
[3] Qufu Normal Univ, Network Informat Ctr, Qufu, Shandong, Peoples R China
来源
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II | 2022年 / 13394卷
基金
中国国家自然科学基金;
关键词
scRNA-seq; Representation learning; Stacked autoencoder; Classification; NETWORK; HEALTH; ATLAS;
D O I
10.1007/978-3-031-13829-4_20
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Single-cell transcriptome sequencing (scRNA-seq) allows researchers to investigate cellular heterogeneity in gene expression profiles, identify cell types and predict cell fate at the single-cell level. Analysis of scRNA-seq data can effectively extract unknown heterogeneity and functional diversity of cell populations. Cell classification is one of the most important tasks in scRNA-seq data analysis, which contains cell clustering and classification of cell subtypes. Before assigning a cell type to each cluster, the unsupervised clustering methods look for marker genes for each cluster. These approaches are susceptible to a number of drawbacks in terms of sources of variation, technology, etc. Meanwhile, as more cell subtypes are gradually discovered, cluster-based cell type identification methods have been gradually leaning towards classification-based cell type identification. In this paper, we proposed a new cell classification method based on stacked autoencoder for representation learning (scSAERLs), which enhances the accuracy of classification by learning the feature representation of the data through deep network models. The stacked autoencoder-based classification model employed an unsupervised greedy pre-training learning procedure, which was followed by supervised label-based fine-tuning of the entire classification model. We tested the model in the intra and inter datasets, evaluated its performance with a standard classification metric. Experimental results showed that scSAERLs outperformed other commonly used classification methods in terms of classification accuracy and F1-score.
引用
收藏
页码:245 / 259
页数:15
相关论文
共 37 条
[1]   scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data [J].
Alquicira-Hernandez, Jose ;
Sathe, Anuja ;
Ji, Hanlee P. ;
Quan Nguyen ;
Powell, Joseph E. .
GENOME BIOLOGY, 2019, 20 (01)
[2]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Integrating single-cell transcriptomic data across different conditions, technologies, and species [J].
Butler, Andrew ;
Hoffman, Paul ;
Smibert, Peter ;
Papalexi, Efthymia ;
Satija, Rahul .
NATURE BIOTECHNOLOGY, 2018, 36 (05) :411-+
[5]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[6]   Learning for single-cell assignment [J].
Duan, Bin ;
Zhu, Chenyu ;
Chuai, Guohui ;
Tang, Chen ;
Chen, Xiaohan ;
Chen, Shaoqi ;
Fu, Shaliu ;
Li, Gaoyang ;
Liu, Qi .
SCIENCE ADVANCES, 2020, 6 (44)
[7]  
Hinton Geoffrey E, 2009, NIPS, P1607
[8]   Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq [J].
Islam, Saiful ;
Kjallquist, Una ;
Moliner, Annalena ;
Zajac, Pawel ;
Fan, Jian-Bing ;
Lonnerberg, Peter ;
Linnarsson, Sten .
GENOME RESEARCH, 2011, 21 (07) :1160-1167
[9]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[10]  
Laaksonen J, 1996, IEEE IJCNN, P1480, DOI 10.1109/ICNN.1996.549118