A comprehensive survey of dimensionality reduction and clustering methods for single-cell and spatial transcriptomics data

被引:6
作者
Sun, Yidi [1 ]
Kong, Lingling [1 ]
Huang, Jiayi [1 ]
Deng, Hongyan [1 ]
Bian, Xinling [1 ]
Li, Xingfeng [1 ]
Cui, Feifei [1 ]
Dou, Lijun [2 ]
Cao, Chen [3 ]
Zou, Quan [4 ,5 ]
Zhang, Zilong [1 ]
机构
[1] Hainan Univ, Sch Comp Sci & Technol, Haikou 570228, Peoples R China
[2] Lerner Res Inst, Genom Med Inst, Cleveland, OH 44106 USA
[3] Nanjing Med Univ, Sch Biomed Engn & Informat, Nanjing 210029, Peoples R China
[4] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 610054, Peoples R China
[5] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou 324000, Peoples R China
基金
中国国家自然科学基金;
关键词
ScRNA-seq; spatial transcriptomics; dimensionality reduction; clustering; GENES;
D O I
10.1093/bfgp/elae023
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
In recent years, the application of single-cell transcriptomics and spatial transcriptomics analysis techniques has become increasingly widespread. Whether dealing with single-cell transcriptomic or spatial transcriptomic data, dimensionality reduction and clustering are indispensable. Both single-cell and spatial transcriptomic data are often high-dimensional, making the analysis and visualization of such data challenging. Through dimensionality reduction, it becomes possible to visualize the data in a lower-dimensional space, allowing for the observation of relationships and differences between cell subpopulations. Clustering enables the grouping of similar cells into the same cluster, aiding in the identification of distinct cell subpopulations and revealing cellular diversity, providing guidance for downstream analyses. In this review, we systematically summarized the most widely recognized algorithms employed for the dimensionality reduction and clustering analysis of single-cell transcriptomic and spatial transcriptomic data. This endeavor provides valuable insights and ideas that can contribute to the development of novel tools in this rapidly evolving field.
引用
收藏
页码:733 / 744
页数:12
相关论文
共 58 条
[1]   Stardust: improving spatial transcriptomics data analysis through space-aware modularity optimization-based clustering [J].
Avesani, Simone ;
Viesi, Eva ;
Alessandri, Luca ;
Motterle, Giovanni ;
Bonnici, Vincenzo ;
Beccuti, Marco ;
Calogero, Raffaele ;
Giugno, Rosalba .
GIGASCIENCE, 2022, 11
[2]   MetaCell: analysis of single-cell RNA-seq data using K-nn graph partitions [J].
Baran, Yael ;
Bercovich, Akhiad ;
Sebe-Pedros, Arnau ;
Lubling, Yaniv ;
Giladi, Amir ;
Chomsky, Elad ;
Meir, Zohar ;
Hoichman, Michael ;
Lifshitz, Aviezer ;
Tanay, Amos .
GENOME BIOLOGY, 2019, 20 (01)
[3]   Dimensionality reduction for visualizing single-cell data using UMAP [J].
Becht, Etienne ;
McInnes, Leland ;
Healy, John ;
Dutertre, Charles-Antoine ;
Kwok, Immanuel W. H. ;
Ng, Lai Guan ;
Ginhoux, Florent ;
Newell, Evan W. .
NATURE BIOTECHNOLOGY, 2019, 37 (01) :38-+
[4]   Metacell-2: a divide-and-conquer metacell algorithm for scalable scRNA-seq analysis [J].
Ben-Kiki, Oren ;
Bercovich, Akhiad ;
Lifshitz, Aviezer ;
Tanay, Amos .
GENOME BIOLOGY, 2022, 23 (01)
[5]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[6]   Interpretable dimensionality reduction of single cell transcriptome data with deep generative models [J].
Ding, Jiarui ;
Condon, Anne ;
Shah, Sohrab P. .
NATURE COMMUNICATIONS, 2018, 9
[7]   Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder [J].
Dong, Kangning ;
Zhang, Shihua .
NATURE COMMUNICATIONS, 2022, 13 (01)
[8]   Machine learning-based prediction model for distant metastasis of breast cancer [J].
Duan, Hao ;
Zhang, Yu ;
Qiu, Haoye ;
Fu, Xiuhao ;
Liu, Chunling ;
Zang, Xiaofeng ;
Xu, Anqi ;
Wu, Ziyue ;
Li, Xingfeng ;
Zhang, Qingchen ;
Zhang, Zilong ;
Cui, Feifei .
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
[9]   DrImpute: imputing dropout events in single cell RNA sequencing data [J].
Gong, Wuming ;
Kwak, Il-Youp ;
Pota, Pruthvi ;
Koyano-Nakagawa, Naoko ;
Garry, Daniel J. .
BMC BIOINFORMATICS, 2018, 19
[10]   qSNE: quadratic rate t-SNE optimizer with automatic parameter tuning for large datasets [J].
Hakkinen, Antti ;
Koiranen, Juha ;
Casado, Julia ;
Kaipio, Katja ;
Lehtonen, Oskari ;
Petrucci, Eleonora ;
Hynninen, Johanna ;
Hietanen, Sakari ;
Carpen, Olli ;
Pasquini, Luca ;
Biffoni, Mauro ;
Lehtonen, Rainer ;
Hautaniemi, Sampsa .
BIOINFORMATICS, 2020, 36 (20) :5086-5092