Truncated Robust Principal Component Analysis and Noise Reduction for Single Cell RNA Sequencing Data

被引:7
作者
Gogolewski, Krzysztof [1 ]
Sykulski, Maciej [2 ,3 ]
Chung, Neo Christopher [1 ]
Gambin, Anna [1 ]
机构
[1] Univ Warsaw, Inst Informat, Fac Math Informat & Mech, Banacha 2, PL-02097 Warsaw, Poland
[2] Warsaw Med Univ, Dept Med Genet, Warsaw, Poland
[3] GenXone Inc, Res & Dev Lab, Poznan, Poland
关键词
matrix decomposition; principal component analysis; robust PCA; single cell RNA-seq; truncated singular value decomposition; unsupervised learning; GENE-EXPRESSION; DECOMPOSITION;
D O I
10.1089/cmb.2018.0255
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The development of single cell RNA sequencing (scRNA-seq) has enabled innovative approaches to investigating mRNA abundances. In our study, we are interested in extracting the systematic patterns of scRNA-seq data in an unsupervised manner; thus, we have developed two extensions of robust principal component analysis (RPCA). First, we present a truncated version of RPCA (tRPCA), which is much faster and memory efficient. Second, we introduce a noise reduction in tRPCA with L-2 regularization. Unlike RPCA that only considers a low-rank L and sparse S matrices, the proposed method can also extract a noise E matrix inherent in modern genomic data. We demonstrate its usefulness by applying our methods on the peripheral blood mononuclear cell scRNA-seq data. Particularly, the clustering of a low-rank L matrix showcases better classification of unlabeled single cells. Overall, the proposed variants are well suited for high-dimensional and noisy data that are routinely generated in genomics.
引用
收藏
页码:782 / 793
页数:12
相关论文
共 50 条
  • [41] Coherence Pursuit: Fast, Simple, and Robust Principal Component Analysis
    Rahmani, Mostafa
    Atia, George K.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (23) : 6260 - 6275
  • [42] Online Robust Principal Component Analysis With Change Point Detection
    Xiao, Wei
    Huang, Xiaolin
    He, Fan
    Silva, Jorge
    Emrani, Saba
    Chaudhuri, Arin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (01) : 59 - 68
  • [43] Application of Deep Learning on Single-cell RNA Sequencing Data Analysis: A Review
    Brendel, Matthew
    Su, Chang
    Bai, Zilong
    Zhang, Hao
    Elemento, Olivier
    Wang, Fei
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2022, 20 (05) : 814 - 835
  • [44] Tools for the analysis of high-dimensional single-cell RNA sequencing data
    Wu, Yan
    Zhang, Kun
    NATURE REVIEWS NEPHROLOGY, 2020, 16 (07) : 408 - 421
  • [45] Modelling Non-stationarities in EEG Data with Robust Principal Component Analysis
    Pascual, Javier
    Kawanabe, Motoaki
    Vidaurre, Carmen
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 51 - 58
  • [46] Data Subdivision Based Dual-Weighted Robust Principal Component Analysis
    Wang, Sisi
    Nie, Feiping
    Wang, Zheng
    Wang, Rong
    Li, Xuelong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1271 - 1284
  • [47] Structure aware noise reduction of multi-channel ground penetrating radar data using Principal Component Analysis
    Linford, Neil
    ADVANCES IN ON- AND OFFSHORE ARCHAEOLOGICAL PROSPECTION: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ARCHAEOLOGICAL PROSPECTION, 2023, : 427 - 429
  • [48] Studies of High Spectral Resolution Atmospheric Sounding Data Compression and Noise Reduction Based on Principal Component Analysis Method
    Zhang Shuiping
    Zhang Shuiping
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4693 - +
  • [49] Robust Principal Component Analysis: An IRLS Approach
    Polyak, Boris T.
    Khlebnikov, Mikhail V.
    IFAC PAPERSONLINE, 2017, 50 (01): : 2762 - 2767
  • [50] Robust kernel principal component analysis and classification
    Michiel Debruyne
    Tim Verdonck
    Advances in Data Analysis and Classification, 2010, 4 : 151 - 167