Spectral Clustering of Single-Cell RNA-Sequencing Data by Multiple Feature Sets Affinity

被引:0
|
作者
Liu, Yang [1 ]
Li, Feng [1 ]
Shang, Junliang [1 ]
Ge, Daohui [1 ]
Ren, Qianqian [1 ]
Li, Shengjun [1 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III | 2023年 / 14088卷
基金
中国国家自然科学基金;
关键词
scRNA-seq; feature extraction; fusion; clustering; GENE-EXPRESSION;
D O I
10.1007/978-981-99-4749-2_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A critical stage in the study of single-cell RNA-sequencing (scRNA-seq) data is cell clustering. The quality of feature selection, which comes first in unsupervised clustering, directly affects the quality of the analysis that follows. It is difficult to choose high-quality characteristics since the gene expression data from scRNA-seq are high dimensional. Feature extraction is often used on gene expression data to choose highly expressed features, that is, subsets of original features. The typical ways for feature selection are to either reserve by percentage or to simply establish a specified threshold number based on experience. It is challenging to guarantee that the first-rank clustering results can be procured using these methods because they are so subjective. In this study, we propose a feature selection method scMFSA to overcome the one-dimensional shortcoming of the traditional PCA method by selecting multiple top-level feature sets. The similarity matrix constructed from each feature set is enhanced by affinity to optimize the feature learning. Lastly, studies are carried out on the actual scRNA-seq datasets using the features discovered in scMFSA. The findings indicate that when paired with clustering methods, the features chosen by scMFSA can increase the accuracy of clustering results. As a result, scMFSA can be an effective tool for researchers to employ when analyzing scRNA-seq data.
引用
收藏
页码:268 / 278
页数:11
相关论文
共 50 条
  • [1] Clustering and classification methods for single-cell RNA-sequencing data
    Qi, Ren
    Ma, Anjun
    Ma, Qin
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1196 - 1208
  • [2] A Data-Driven Clustering Recommendation Method for Single-Cell RNA-Sequencing Data
    Tian, Yu
    Zheng, Ruiqing
    Liang, Zhenlan
    Li, Suning
    Wu, Fang-Xiang
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (05) : 772 - 789
  • [3] One-step spectral clustering of weighted variables on single-cell RNA-sequencing data
    Park, Min Young
    Park, Seyoung
    KOREAN JOURNAL OF APPLIED STATISTICS, 2020, 33 (04) : 511 - 526
  • [4] Machine learning and statistical methods for clustering single-cell RNA-sequencing data
    Petegrosso, Raphael
    Li, Zhuliu
    Kuang, Rui
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1209 - 1223
  • [5] A HIERARCHICAL BAYESIAN MODEL FOR SINGLE-CELL CLUSTERING USING RNA-SEQUENCING DATA
    Liu, Yiyi
    Warren, Joshua L.
    Zhao, Hongyu
    ANNALS OF APPLIED STATISTICS, 2019, 13 (03) : 1733 - 1752
  • [6] Joint learning dimension reduction and clustering of single-cell RNA-sequencing data
    Wu, Wenming
    Ma, Xiaoke
    BIOINFORMATICS, 2020, 36 (12) : 3825 - 3832
  • [7] Clustering Single-cell RNA-sequencing Data based on Matching Clusters Structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (01): : 89 - 95
  • [8] An Introduction to the Analysis of Single-Cell RNA-Sequencing Data
    AlJanahi, Aisha A.
    Danielsen, Mark
    Dunbar, Cynthia E.
    MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT, 2018, 10 : 189 - 196
  • [9] Transcriptomics and single-cell RNA-sequencing
    Chambers, Daniel C.
    Carew, Alan M.
    Lukowski, Samuel W.
    Powell, Joseph E.
    RESPIROLOGY, 2019, 24 (01) : 29 - 36
  • [10] Single-cell RNA-sequencing in asthma research
    Tang, Weifeng
    Li, Mihui
    Teng, Fangzhou
    Cui, Jie
    Dong, Jingcheng
    Wang, Wenqian
    FRONTIERS IN IMMUNOLOGY, 2022, 13