GSTRPCA: irregular tensor singular value decomposition for single-cell multi-omics data clustering

被引:0
|
作者
Cui, Lubin [1 ]
Guo, Guiliang [1 ]
Ng, Michael K. [2 ]
Zou, Quan [3 ]
Qiu, Yushan [4 ]
机构
[1] Henan Normal Univ, Sch Math & Stat, Xinxiang 453007, Peoples R China
[2] Hong Kong Baptist Univ, Dept Math, Hong Kong 999077, Peoples R China
[3] Elect Sci & Technol Univ, Inst Fundamental & Frontier Sci, Chengdu 611731, Peoples R China
[4] Shenzhen Univ, Sch Math Sci, Shenzhen 518000, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell multi-omics data; irregular tensor decomposition; weighted threshold; joint tensor; PROTEINS;
D O I
10.1093/bib/bbae649
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell multi-omics refers to the various types of biological data at the single-cell level. These data have enabled insight and resolution to cellular phenotypes, biological processes, and developmental stages. Current advances hold high potential for breakthroughs by integrating multiple different omics layers. However, singlecell multi-omics data usually have different feature dimensions and direct or indirect relationships. How to keep the data structure of these different data and extract hidden relationships is a major challenge for omics data integration, and effective integration models are urgently needed. In this paper, we propose an irregular tensor decomposition model (GSTRPCA) based on tensor robust principal component analysis (TRPCA). We developed a weighted threshold model for the decomposition of irregular tensor data by combining low-rank and sparsity constraints, which requires that the low-dimensional embeddings of the data remain lowrank and sparse. The major advantage of the GSTRPCA algorithm is its ability to keep the original data structure and explore hidden related features among omics data. For GSTRPCA, we also designed an effective algorithm that theoretically guarantees global convergence for the tensor decomposition. The computational experiments on irregular tensor datasets demonstrate that GSTRPCA significantly outperformed the state-of-the-art methods and hence confirm the superiority of GSTRPCA in clustering single-cell multiomics data. To our knowledge, this is the first tensor decomposition method for irregular tensor data to keep the data structure and hence improve the clustering performance for single-cell multi-omics data. GSTRPCA is a Matlabbased algorithm, and the code is available from https://github.com/GGL-B/GSTRPCA.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Clustering single-cell multi-omics data with MoClust
    Yuan, Musu
    Chen, Liang
    Deng, Minghua
    BIOINFORMATICS, 2023, 39 (01)
  • [2] scMoC: single-cell multi-omics clustering
    Eltager, Mostafa
    Abdelaal, Tamim
    Mahfouz, Ahmed
    Reinders, Marcel J. T.
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [3] Spectral clustering of single-cell multi-omics data on multilayer graphs
    Zhang, Shuyi
    Leistico, Jacob R.
    Cho, Raymond J.
    Cheng, Jeffrey B.
    Song, Jun S.
    BIOINFORMATICS, 2022, 38 (14) : 3600 - 3608
  • [4] Clustering of single-cell multi-omics data with a multimodal deep learning method
    Xiang Lin
    Tian Tian
    Zhi Wei
    Hakon Hakonarson
    Nature Communications, 13
  • [5] Clustering of single-cell multi-omics data with a multimodal deep learning method
    Lin, Xiang
    Tian, Tian
    Wei, Zhi
    Hakonarson, Hakon
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [6] Intricacies of single-cell multi-omics data integration
    Rautenstrauch, Pia
    Vlot, Anna Hendrika Cornelia
    Saran, Sepideh
    Ohler, Uwe
    TRENDS IN GENETICS, 2022, 38 (02) : 128 - 139
  • [7] Integrating single-cell multi-omics data through self-supervised clustering
    Zeng, Yuansong
    Chen, Jianing
    Pan, Zixiang
    Yu, Weijiang
    Yang, Yuedong
    APPLIED SOFT COMPUTING, 2025, 169
  • [8] Multi-omics single-cell analysis
    Nicole Rusk
    Nature Methods, 2019, 16 : 679 - 679
  • [9] Multi-omics single-cell analysis
    Rusk, Nicole
    NATURE METHODS, 2019, 16 (08) : 679 - 679
  • [10] scLRTD : A Novel Low Rank Tensor Decomposition Method for Imputing Missing Values in Single-Cell Multi-Omics Sequencing Data
    Ni, Zhijie
    Zheng, Xiaoying
    Zheng, Xiao
    Zou, Xiufen
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 1144 - 1153