netNMF-sc: leveraging gene-gene interactions for imputation and dimensionality reduction in single-cell expression analysis

被引:63
|
作者
Elyanow, Rebecca [1 ,2 ]
Dumitrascu, Bianca [3 ,5 ,6 ]
Engelhardt, Barbara E. [2 ,4 ,7 ]
Raphael, Benjamin J. [2 ]
机构
[1] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
[2] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA
[3] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08540 USA
[4] Princeton Univ, Ctr Stat & Machine Learning, Princeton, NJ 08540 USA
[5] Duke Univ, SAMSI, Durham, NC 27706 USA
[6] Duke Univ, Dept Stat Sci, Durham, NC 27706 USA
[7] Genomics Plc, Oxford, England
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
RNA-SEQ; REVEALS; TRANSCRIPTOMICS; MICROARRAY; DATABASE;
D O I
10.1101/gr.251603.119
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA-sequencing (scRNA-seq) enables high-throughput measurement of RNA expression in single cells. However, because of technical limitations, scRNA-seq data often contain zero counts for many transcripts in individual cells. These zero counts, or dropout events, complicate the analysis of scRNA-seq data using standard methods developed for bulk RNA-seq data. Current scRNA-seq analysis methods typically overcome dropout by combining information across cells in a lower-dimensional space, leveraging the observation that cells generally occupy a small number of RNA expression states. We introduce netNMF-sc, an algorithm for scRNA-seq analysis that leverages information across both cells and genes. netNMF-sc learns a low-dimensional representation of scRNA-seq transcript counts using network-regularized non-negative matrix factorization. The network regularization takes advantage of prior knowledge of gene-gene interactions, encouraging pairs of genes with known interactions to be nearby each other in the low-dimensional representation. The resulting matrix factorization imputes gene abundance for both zero and nonzero counts and can be used to cluster cells into meaningful subpopulations. We show that netNMF-sc outperforms existing methods at clustering cells and estimating gene-gene covariance using both simulated and real scRNA-seq data, with increasing advantages at higher dropout rates (e.g., >60%). We also show that the results from netNMF-sc are robust to variation in the input network, with more representative networks leading to greater performance gains.
引用
收藏
页码:195 / 204
页数:10
相关论文
共 50 条
  • [31] Detection of Imprinted Genes by Single-Cell Allele-Specific Gene Expression
    Santoni, Federico A.
    Stamoulis, Georgios
    Garieri, Marco
    Falconnet, Emilie
    Ribaux, Pascale
    Borel, Christelle
    Antonarakis, Stylianos E.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2017, 100 (03) : 444 - 453
  • [32] Single-cell transcriptome analysis reveals widespread monoallelic gene expression in individual rice mesophyll cells
    Han, Yingying
    Chu, Xiao
    Yu, Haopeng
    Ma, Ying-Ke
    Wang, Xiu-Jie
    Qian, Wenfeng
    Jiao, Yuling
    SCIENCE BULLETIN, 2017, 62 (19) : 1304 - 1314
  • [33] Method for single-cell microarray analysis and application to gene-expression profiling of GABAergic neuron progenitors
    Esumi, Shigeyuki
    Wu, Sheng-Xi
    Yanagawa, Yuchio
    Obata, Kunihiko
    Suaimoto, Yukihiko
    Tamamaki, Nobuald
    NEUROSCIENCE RESEARCH, 2008, 60 (04) : 439 - 451
  • [34] Single-cell gene and isoform expression analysis reveals signatures of ageing in haematopoietic stem and progenitor cells
    Mincarelli, Laura
    Uzun, Vladimir
    Wright, David
    Scoones, Anita
    Rushworth, Stuart A.
    Haerty, Wilfried
    Macaulay, Iain C.
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [35] MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
    Feigelman, Justin
    Theis, Fabian J.
    Marr, Carsten
    BMC BIOINFORMATICS, 2014, 15
  • [36] A scalable SCENIC workflow for single-cell gene regulatory network analysis
    van de Sande, Bram
    Flerin, Christopher
    Davie, Kristofer
    De Waegeneer, Maxime
    Hulselmans, Gert
    Aibar, Sara
    Seurinck, Ruth
    Saelens, Wouter
    Cannoodt, Robrecht
    Rouchon, Quentin
    Verbeiren, Toni
    De Maeyer, Dries
    Reumers, Joke
    Saeys, Yvan
    Aerts, Stein
    NATURE PROTOCOLS, 2020, 15 (07) : 2247 - 2276
  • [37] Beyond comparisons of means: understanding changes in gene expression at the single-cell level
    Vallejos, Catalina A.
    Richardson, Sylvia
    Marioni, John C.
    GENOME BIOLOGY, 2016, 17
  • [38] Multi-modal generative modeling for joint analysis of single-cell T cell receptor and gene expression data
    Drost, Felix
    An, Yang
    Bonafonte-Pardas, Irene
    Dratva, Lisa M.
    Lindeboom, Rik G. H.
    Haniffa, Muzlifah
    Teichmann, Sarah A.
    Theis, Fabian
    Lotfollahi, Mohammad
    Schubert, Benjamin
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [39] Single-cell gene expression reveals a landscape of regulatory T cell phenotypes shaped by the TCR
    Zemmour, David
    Zilionis, Rapolas
    Kiner, Evgeny
    Klein, Allon M.
    Mathis, Diane
    Benoist, Christophe
    NATURE IMMUNOLOGY, 2018, 19 (03) : 291 - +
  • [40] From single-cell to cell-pool transcriptomes: Stochasticity in gene expression and RNA splicing
    Marinov, Georgi K.
    Williams, Brian A.
    McCue, Ken
    Schroth, Gary P.
    Gertz, Jason
    Myers, Richard M.
    Wold, Barbara J.
    GENOME RESEARCH, 2014, 24 (03) : 496 - 510