Discovering cell types using manifold learning and enhanced visualization of single-cell RNA-Seq data

被引:14
作者
Vasighizaker, Akram [1 ]
Danda, Saiteja [1 ]
Rueda, Luis [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
DIMENSIONALITY REDUCTION; ENRICHMENT ANALYSIS;
D O I
10.1038/s41598-021-03613-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying relevant disease modules such as target cell types is a significant step for studying diseases. High-throughput single-cell RNA-Seq (scRNA-seq) technologies have advanced in recent years, enabling researchers to investigate cells individually and understand their biological mechanisms. Computational techniques such as clustering, are the most suitable approach in scRNA-seq data analysis when the cell types have not been well-characterized. These techniques can be used to identify a group of genes that belong to a specific cell type based on their similar gene expression patterns. However, due to the sparsity and high-dimensionality of scRNA-seq data, classical clustering methods are not efficient. Therefore, the use of non-linear dimensionality reduction techniques to improve clustering results is crucial. We introduce a method that is used to identify representative clusters of different cell types by combining non-linear dimensionality reduction techniques and clustering algorithms. We assess the impact of different dimensionality reduction techniques combined with the clustering of thirteen publicly available scRNA-seq datasets of different tissues, sizes, and technologies. We further performed gene set enrichment analysis to evaluate the proposed method's performance. As such, our results show that modified locally linear embedding combined with independent component analysis yields overall the best performance relative to the existing unsupervised methods across different datasets.
引用
收藏
页数:16
相关论文
共 41 条
[1]   A Single-Cell Transcriptomic Map of the Human and Mouse Pancreas Reveals Inter- and Intra-cell Population Structure [J].
Baron, Maayan ;
Veres, Adrian ;
Wolock, Samuel L. ;
Faust, Aubrey L. ;
Gaujoux, Renaud ;
Vetere, Amedeo ;
Ryu, Jennifer Hyoje ;
Wagner, Bridget K. ;
Shen-Orr, Shai S. ;
Klein, Allon M. ;
Melton, Douglas A. ;
Yanai, Itai .
CELL SYSTEMS, 2016, 3 (04) :346-+
[2]   Dimensionality reduction for visualizing single-cell data using UMAP [J].
Becht, Etienne ;
McInnes, Leland ;
Healy, John ;
Dutertre, Charles-Antoine ;
Kwok, Immanuel W. H. ;
Ng, Lai Guan ;
Ginhoux, Florent ;
Newell, Evan W. .
NATURE BIOTECHNOLOGY, 2019, 37 (01) :38-+
[3]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[4]  
Calinski T., 1974, COMMUN STAT-THEOR M, V3, P1, DOI DOI 10.1080/03610927408827101
[5]   ToppGene Suite for gene list enrichment analysis and candidate gene prioritization [J].
Chen, Jing ;
Bardes, Eric E. ;
Aronow, Bruce J. ;
Jegga, Anil G. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W305-W311
[6]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[7]   Recurrent herpes simplex virus-1 infection induces hallmarks of neurodegeneration and cognitive deficits in mice [J].
De Chiara, Giovanna ;
Piacentini, Roberto ;
Fabiani, Marco ;
Mastrodonato, Alessia ;
Marcocci, Maria Elena ;
Limongi, Dolores ;
Napoletani, Giorgia ;
Protto, Virginia ;
Coluccio, Paolo ;
Celestino, Ignacio ;
Li Puma, Domenica Donatella ;
Grassi, Claudio ;
Palamara, Anna Teresa .
PLOS PATHOGENS, 2019, 15 (03)
[8]   Comprehensive review of the identification of essential genes using computational methods: focusing on feature implementation and assessment [J].
Dong, Chuan ;
Jin, Yan-Ting ;
Hua, Hong-Li ;
Wen, Qing-Feng ;
Luo, Sen ;
Zheng, Wen-Xin ;
Guo, Feng-Biao .
BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) :171-181
[9]   Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study [J].
Feng, Chao ;
Liu, Shufen ;
Zhang, Hao ;
Guan, Renchu ;
Li, Dan ;
Zhou, Fengfeng ;
Liang, Yanchun ;
Feng, Xiaoyue .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (06)
[10]  
Genomics, 2016, 3k PBMCs from a Healthy Donor