Truncated Robust Principal Component Analysis and Noise Reduction for Single Cell RNA Sequencing Data

被引:7
作者
Gogolewski, Krzysztof [1 ]
Sykulski, Maciej [2 ,3 ]
Chung, Neo Christopher [1 ]
Gambin, Anna [1 ]
机构
[1] Univ Warsaw, Inst Informat, Fac Math Informat & Mech, Banacha 2, PL-02097 Warsaw, Poland
[2] Warsaw Med Univ, Dept Med Genet, Warsaw, Poland
[3] GenXone Inc, Res & Dev Lab, Poznan, Poland
关键词
matrix decomposition; principal component analysis; robust PCA; single cell RNA-seq; truncated singular value decomposition; unsupervised learning; GENE-EXPRESSION; DECOMPOSITION;
D O I
10.1089/cmb.2018.0255
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The development of single cell RNA sequencing (scRNA-seq) has enabled innovative approaches to investigating mRNA abundances. In our study, we are interested in extracting the systematic patterns of scRNA-seq data in an unsupervised manner; thus, we have developed two extensions of robust principal component analysis (RPCA). First, we present a truncated version of RPCA (tRPCA), which is much faster and memory efficient. Second, we introduce a noise reduction in tRPCA with L-2 regularization. Unlike RPCA that only considers a low-rank L and sparse S matrices, the proposed method can also extract a noise E matrix inherent in modern genomic data. We demonstrate its usefulness by applying our methods on the peripheral blood mononuclear cell scRNA-seq data. Particularly, the clustering of a low-rank L matrix showcases better classification of unlabeled single cells. Overall, the proposed variants are well suited for high-dimensional and noisy data that are routinely generated in genomics.
引用
收藏
页码:782 / 793
页数:12
相关论文
共 50 条
  • [1] Truncated Robust Principal Component Analysis and Noise Reduction for Single Cell RNA-seq Data
    Gogolewski, Krzysztof
    Sykulski, Maciej
    Chung, Neo Christopher
    Gambin, Anna
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2018, 2018, 10847 : 335 - 346
  • [2] Benchmarking principal component analysis for large-scale single-cell RNA-sequencing
    Tsuyuzaki, Koki
    Sato, Hiroyuki
    Sato, Kenta
    Nikaido, Itoshi
    GENOME BIOLOGY, 2020, 21 (01)
  • [3] Benchmarking principal component analysis for large-scale single-cell RNA-sequencing
    Koki Tsuyuzaki
    Hiroyuki Sato
    Kenta Sato
    Itoshi Nikaido
    Genome Biology, 21
  • [4] Noise Reduction and Brain Mapping based Robust Principal Component Analysis
    Turnip, Arjon
    2015 IEEE 12th International Conference on Networking, Sensing and Control (ICNSC), 2015, : 550 - 553
  • [5] Nonlocal Weighted Robust Principal Component Analysis for Seismic Noise Attenuation
    Liu, Xingye
    Chen, Xiaohong
    Li, Jingye
    Chen, Yangkang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (02): : 1745 - 1756
  • [6] Screen technical noise in single cell RNA sequencing data
    Bai, Yu-Long
    Baddoo, Melody
    Flemington, Erik K.
    Nakhoul, Hani N.
    Liu, Yao-Zhong
    GENOMICS, 2020, 112 (01) : 346 - 355
  • [7] Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data
    Jia, Cheng
    Hu, Yu
    Kelly, Derek
    Kim, Junhyong
    Li, Mingyao
    Zhang, Nancy R.
    NUCLEIC ACIDS RESEARCH, 2017, 45 (19) : 10978 - 10988
  • [8] Noise reduction in remote sensing imagery using data masking and principal component analysis
    Corner, BR
    Narayanan, RM
    Reichenbach, SE
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXIII, 2000, 4115 : 1 - 11
  • [9] Online robust principal component analysis via truncated nuclear norm regularization
    Hong, Bin
    Wei, Long
    Hu, Yao
    Cai, Deng
    He, Xiaofei
    NEUROCOMPUTING, 2016, 175 : 216 - 222
  • [10] SIEVE: identifying robust single cell variable genes for single-cell RNA sequencing data
    Zhang, Yinan
    Xie, Xiaowei
    Wu, Peng
    Zhu, Ping
    BLOOD SCIENCE, 2021, 3 (02): : 35 - 39