Spectral Clustering of Single-Cell RNA-Sequencing Data by Multiple Feature Sets Affinity

被引:0
|
作者
Liu, Yang [1 ]
Li, Feng [1 ]
Shang, Junliang [1 ]
Ge, Daohui [1 ]
Ren, Qianqian [1 ]
Li, Shengjun [1 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Peoples R China
基金
中国国家自然科学基金;
关键词
scRNA-seq; feature extraction; fusion; clustering; GENE-EXPRESSION;
D O I
10.1007/978-981-99-4749-2_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A critical stage in the study of single-cell RNA-sequencing (scRNA-seq) data is cell clustering. The quality of feature selection, which comes first in unsupervised clustering, directly affects the quality of the analysis that follows. It is difficult to choose high-quality characteristics since the gene expression data from scRNA-seq are high dimensional. Feature extraction is often used on gene expression data to choose highly expressed features, that is, subsets of original features. The typical ways for feature selection are to either reserve by percentage or to simply establish a specified threshold number based on experience. It is challenging to guarantee that the first-rank clustering results can be procured using these methods because they are so subjective. In this study, we propose a feature selection method scMFSA to overcome the one-dimensional shortcoming of the traditional PCA method by selecting multiple top-level feature sets. The similarity matrix constructed from each feature set is enhanced by affinity to optimize the feature learning. Lastly, studies are carried out on the actual scRNA-seq datasets using the features discovered in scMFSA. The findings indicate that when paired with clustering methods, the features chosen by scMFSA can increase the accuracy of clustering results. As a result, scMFSA can be an effective tool for researchers to employ when analyzing scRNA-seq data.
引用
收藏
页码:268 / 278
页数:11
相关论文
共 50 条
  • [1] Significance analysis for clustering with single-cell RNA-sequencing data
    Grabski, Isabella N.
    Street, Kelly
    Irizarry, Rafael A.
    NATURE METHODS, 2023, 20 (08) : 1196 - +
  • [2] Significance analysis for clustering with single-cell RNA-sequencing data
    Isabella N. Grabski
    Kelly Street
    Rafael A. Irizarry
    Nature Methods, 2023, 20 : 1196 - 1202
  • [3] Clustering and classification methods for single-cell RNA-sequencing data
    Qi, Ren
    Ma, Anjun
    Ma, Qin
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1196 - 1208
  • [4] One-step spectral clustering of weighted variables on single-cell RNA-sequencing data
    Park, Min Young
    Park, Seyoung
    KOREAN JOURNAL OF APPLIED STATISTICS, 2020, 33 (04) : 511 - 526
  • [5] Consensus Nature Inspired Clustering of Single-Cell RNA-Sequencing Data
    Abou El-Naga, Amany H.
    Sayed, Sabah
    Salah, Akram
    Mohsen, Heba
    IEEE ACCESS, 2022, 10 : 98079 - 98094
  • [6] Clustering single-cell rna-sequencing data based on matching clusters structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    Tehnicki Vjesnik, 2020, 27 (01): : 89 - 95
  • [7] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Tian, Yu
    Zheng, Ruiqing
    Liang, Zhenlan
    Li, Suning
    Wu, Fang-Xiang
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (05) : 772 - 789
  • [8] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Yu Tian
    Ruiqing Zheng
    Zhenlan Liang
    Suning Li
    Fang-Xiang Wu
    Min Li
    TsinghuaScienceandTechnology, 2021, 26 (05) : 772 - 789
  • [9] Clustering Single-cell RNA-sequencing Data based on Matching Clusters Structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (01): : 89 - 95
  • [10] Machine learning and statistical methods for clustering single-cell RNA-sequencing data
    Petegrosso, Raphael
    Li, Zhuliu
    Kuang, Rui
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1209 - 1223