netNMF-sc: leveraging gene-gene interactions for imputation and dimensionality reduction in single-cell expression analysis

被引:63
|
作者
Elyanow, Rebecca [1 ,2 ]
Dumitrascu, Bianca [3 ,5 ,6 ]
Engelhardt, Barbara E. [2 ,4 ,7 ]
Raphael, Benjamin J. [2 ]
机构
[1] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
[2] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA
[3] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08540 USA
[4] Princeton Univ, Ctr Stat & Machine Learning, Princeton, NJ 08540 USA
[5] Duke Univ, SAMSI, Durham, NC 27706 USA
[6] Duke Univ, Dept Stat Sci, Durham, NC 27706 USA
[7] Genomics Plc, Oxford, England
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
RNA-SEQ; REVEALS; TRANSCRIPTOMICS; MICROARRAY; DATABASE;
D O I
10.1101/gr.251603.119
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA-sequencing (scRNA-seq) enables high-throughput measurement of RNA expression in single cells. However, because of technical limitations, scRNA-seq data often contain zero counts for many transcripts in individual cells. These zero counts, or dropout events, complicate the analysis of scRNA-seq data using standard methods developed for bulk RNA-seq data. Current scRNA-seq analysis methods typically overcome dropout by combining information across cells in a lower-dimensional space, leveraging the observation that cells generally occupy a small number of RNA expression states. We introduce netNMF-sc, an algorithm for scRNA-seq analysis that leverages information across both cells and genes. netNMF-sc learns a low-dimensional representation of scRNA-seq transcript counts using network-regularized non-negative matrix factorization. The network regularization takes advantage of prior knowledge of gene-gene interactions, encouraging pairs of genes with known interactions to be nearby each other in the low-dimensional representation. The resulting matrix factorization imputes gene abundance for both zero and nonzero counts and can be used to cluster cells into meaningful subpopulations. We show that netNMF-sc outperforms existing methods at clustering cells and estimating gene-gene covariance using both simulated and real scRNA-seq data, with increasing advantages at higher dropout rates (e.g., >60%). We also show that the results from netNMF-sc are robust to variation in the input network, with more representative networks leading to greater performance gains.
引用
收藏
页码:195 / 204
页数:10
相关论文
共 50 条
  • [1] Multivariate Quantitative Multifactor Dimensionality Reduction for Detecting Gene-Gene Interactions
    Yu, Wenbao
    Kwon, Min-Seok
    Park, Taesung
    HUMAN HEREDITY, 2015, 79 (3-4) : 168 - 181
  • [2] Single-Cell and Single-Molecule Analysis of Gene Expression Regulation
    Vera, Maria
    Biswas, Jeetayu
    Senecal, Adrien
    Singer, Robert H.
    Park, Hye Yoon
    ANNUAL REVIEW OF GENETICS, VOL 50, 2016, 50 : 267 - 291
  • [3] Spatial reconstruction of single-cell gene expression data
    Satija, Rahul
    Farrell, Jeffrey A.
    Gennert, David
    Schier, Alexander F.
    Regev, Aviv
    NATURE BIOTECHNOLOGY, 2015, 33 (05) : 495 - U206
  • [4] Single-cell gene expression of the bovine blastocyst
    Negron-Perez, Veronica M.
    Zhang, Yanping
    Hansen, Peter J.
    REPRODUCTION, 2017, 154 (05) : 627 - 644
  • [5] Single-Cell and Regional Gene Expression Analysis in Alzheimer’s Disease
    Ruby Kwong
    Michelle K. Lupton
    Michal Janitz
    Cellular and Molecular Neurobiology, 2012, 32 : 477 - 489
  • [6] Spatiotemporal single-cell analysis of gene expression in the mouse suprachiasmatic nucleus
    Wen, Shao'ang
    Ma, Danyi
    Zhao, Meng
    Xie, Lucheng
    Wu, Qingqin
    Gou, Lingfeng
    Zhu, Chuanzhen
    Fan, Yuqi
    Wang, Haifang
    Yan, Jun
    NATURE NEUROSCIENCE, 2020, 23 (03) : 456 - +
  • [7] Single-Cell and Regional Gene Expression Analysis in Alzheimer's Disease
    Kwong, Ruby
    Lupton, Michelle K.
    Janitz, Michal
    CELLULAR AND MOLECULAR NEUROBIOLOGY, 2012, 32 (04) : 477 - 489
  • [8] SC3-seq: a method for highly parallel and quantitative measurement of single-cell gene expression
    Nakamura, Tomonori
    Yabuta, Yukihiro
    Okamoto, Ikuhiro
    Aramaki, Shinya
    Yokobayashi, Shihori
    Kurimoto, Kazuki
    Sekiguchi, Kiyotoshi
    Nakagawa, Masato
    Yamamoto, Takuya
    Saitou, Mitinori
    NUCLEIC ACIDS RESEARCH, 2015, 43 (09)
  • [9] Temporal dynamics and transcriptional control using single-cell gene expression analysis
    Kouno, Tsukasa
    de Hoon, Michiel
    Mar, Jessica C.
    Tomaru, Yasuhiro
    Kawano, Mitsuoki
    Carninci, Piero
    Suzuki, Harukazu
    Hayashizaki, Yoshihide
    Shin, Jay W.
    GENOME BIOLOGY, 2013, 14 (10):
  • [10] Bifurcation analysis of single-cell gene expression data reveals epigenetic landscape
    Marco, Eugenio
    Karp, Robert L.
    Guo, Guoji
    Robson, Paul
    Hart, Adam H.
    Trippa, Lorenzo
    Yuan, Guo-Cheng
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (52) : E5643 - E5650