Non-Negative Low-Rank Representation With Similarity Correction for Cell Type Identification in scRNA-Seq Data

被引:0
作者
Liu, Jing-Xing [1 ]
Zhang, Dai-Jun [1 ]
Zhao, Jing-Xiu [1 ]
Zheng, Chun-Hou [1 ]
Gao, Ying-Lian [2 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Shandong, Peoples R China
[2] Qufu Normal Univ, Lib Qufu Normal Univ, Rizhao 276826, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Single-cell RNA sequencing data; clustering; locality-sensitive hashing; manifold learning; low-rank represen-tation; gene marker;
D O I
10.1109/TCBB.2023.3319375
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA sequencing (scRNA-Seq) technology has emerged as a powerful tool to investigate cellular heterogeneity within tissues, organs, and organisms. One fundamental question pertaining to single-cell gene expression data analysis revolves around the identification of cell types, which constitutes a critical step within the data processing workflow. However, existing methods for cell type identification through learning low-dimensional latent embeddings often overlook the intercellular structural relationships. In this paper, we present a novel non-negative low-rank similarity correction model (NLRSIM) that leverages subspace clustering to preserve the global structure among cells. This model introduces a novel manifold learning process to address the issue of imbalanced neighbourhood spatial density in cells, thereby effectively preserving local geometric structures. This procedure utilizes a position-sensitive hashing algorithm to construct the graph structure of the data. The experimental results demonstrate that the NLRSIM surpasses other advanced models in terms of clustering effects and visualization experiments. The validated effectiveness of gene expression information after calibration by the NLRSIM model has been duly ascertained in the realm of relevant biological studies. The NLRSIM model offers unprecedented insights into gene expression, states, and structures at the individual cellular level, thereby contributing novel perspectives to the field.
引用
收藏
页码:3737 / 3747
页数:11
相关论文
共 51 条
  • [1] Low Rank Matrix Factorization Algorithm Based on Multi-Graph Regularization for Detecting Drug-Disease Association
    Ai, Chengwei
    Yang, Hongpeng
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3033 - 3043
  • [2] Human dendritic cells (DCs) are derived from distinct circulating precursors that are precommitted to become CD1c+ or CD141+ DCs
    Breton, Gaelle
    Zheng, Shiwei
    Valieris, Renan
    da Silva, Israel Tojal
    Satija, Rahul
    Nussenzweig, Michel C.
    [J]. JOURNAL OF EXPERIMENTAL MEDICINE, 2016, 213 (13) : 2861 - 2870
  • [3] Graph Regularized Nonnegative Matrix Factorization for Data Representation
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1548 - 1560
  • [4] Efficient Deep Embedded Subspace Clustering
    Cai, Jinyu
    Fan, Jicong
    Guo, Wenzhong
    Wang, Shiping
    Zhang, Yunhe
    Zhang, Zhao
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21 - 30
  • [5] Superpixel Contracted Neighborhood Contrastive Subspace Clustering Network for Hyperspectral Images
    Cai, Yaoming
    Zhang, Zijia
    Ghamisi, Pedram
    Ding, Yao
    Liu, Xiaobo
    Cai, Zhihua
    Gloaguen, Richard
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Subspace clustering guided convex nonnegative matrix factorization
    Cui, Guosheng
    Li, Xuelong
    Dong, Yongsheng
    [J]. NEUROCOMPUTING, 2018, 292 : 38 - 48
  • [7] A survey of human brain transcriptome diversity at the single cell level
    Darmanis, Spyros
    Sloan, Steven A.
    Zhang, Ye
    Enge, Martin
    Caneda, Christine
    Shuer, Lawrence M.
    Gephart, Melanie G. Hayden
    Barres, Ben A.
    Quake, Stephen R.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (23) : 7285 - 7290
  • [8] Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells
    Deng, Qiaolin
    Ramskold, Daniel
    Reinius, Bjorn
    Sandberg, Rickard
    [J]. SCIENCE, 2014, 343 (6167) : 193 - 196
  • [9] ScCCL: Single-Cell Data Clustering Based on Self-Supervised Contrastive Learning
    Du, Linlin
    Han, Rui
    Liu, Bo
    Wang, Yadong
    Li, Junyi
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 2233 - 2241
  • [10] Cellular and molecular features of neurogenic skeletal muscle atrophy
    Ehmsen, Jeffrey T.
    Hoke, Ahmet
    [J]. EXPERIMENTAL NEUROLOGY, 2020, 331