共 50 条
LRSK: a low-rank self-representationK-means method for clustering single-cell RNA-sequencing data
被引:7
|作者:
Sun, Ye-Sen
[1
]
Le Ou-Yang
[2
]
Dai, Dao-Qing
[1
]
机构:
[1] Sun Yat Sen Univ, Intelligent Data Ctr, Sch Math, Guangzhou, Peoples R China
[2] Shenzhen Univ, Shenzhen Key Lab Media Secur, Coll Elect & Informat Engn, Shenzhen, Peoples R China
基金:
中国国家自然科学基金;
关键词:
GENE-EXPRESSION;
HETEROGENEITY;
FATE;
D O I:
10.1039/d0mo00034e
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
The development of single-cell RNA-sequencing (scRNA-seq) technologies brings tremendous opportunities for quantitative research and analyses at the cellular level. In particular, as a crucial task of scRNA-seq analysis, single cell clustering shines a light on natural groupings of cells to give new insights into the biological mechanisms and disease studies. However, it remains a challenge to identify cell clusters from lots of cell mixtures effectively and accurately. In this paper, we propose a novel adaptive joint clustering framework, named the low-rank self-representationK-means method (LRSK), to learn the data representation matrix and cluster indicator matrix jointly from scRNA-seq data. Specifically, instead of calculating the similarities among cells from the original data, we seek a low-rank representation of the original data to better reflect the underlying relationships among cells. Moreover, an Augmented Lagrangian Multiplier (ALM) based optimization algorithm is adopted to solve this problem. Experimental results on various scRNA-seq datasets and case studies demonstrate that our method performs better than other state-of-the-art single cell clustering algorithms. The analysis of unlabeled large single-cell liver cancer sequencing data further shows that our prediction results are more reasonable and interpretable.
引用
收藏
页码:465 / 473
页数:9
相关论文