An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data

被引:17
作者
Sun, Xifang [1 ]
Sun, Shiquan [2 ,3 ]
Yang, Sheng [4 ]
机构
[1] Xian Shiyou Univ, Sch Sci, Dept Math, Xian 710065, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[3] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[4] Nanjing Med Univ, Sch Publ Hlth, Dept Biostat, Nanjing 211166, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
cell-type compositions; deconvolution; single-cell RNA-seq; nonnegative matrix factorization; gene expression; HETEROGENEITY; ORIGIN;
D O I
10.3390/cells8101161
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omics Matrix Factorization (MOMF), to estimate the cell-type compositions of bulk RNA sequencing (RNA-seq) data by leveraging cell type-specific gene expression levels from single-cell RNA sequencing (scRNA-seq) data. MOMF not only directly models the count nature of gene expression data, but also effectively accounts for the uncertainty of cell type-specific mean gene expression levels. We demonstrate the benefits of MOMF through three real data applications, i.e., Glioblastomas (GBM), colorectal cancer (CRC) and type II diabetes (T2D) studies. MOMF is able to accurately estimate disease-related cell type proportions, i.e., oligodendrocyte progenitor cells and macrophage cells, which are strongly associated with the survival of GBM and CRC, respectively.
引用
收藏
页数:18
相关论文
共 58 条
[1]   Cellular Heterogeneity: Do Differences Make a Difference? [J].
Altschuler, Steven J. ;
Wu, Lani F. .
CELL, 2010, 141 (04) :559-563
[2]  
Amrhein L., 2019, bioRxiv, P657619, DOI DOI 10.1101/657619
[3]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[4]   A Single-Cell Transcriptomic Map of the Human and Mouse Pancreas Reveals Inter- and Intra-cell Population Structure [J].
Baron, Maayan ;
Veres, Adrian ;
Wolock, Samuel L. ;
Faust, Aubrey L. ;
Gaujoux, Renaud ;
Vetere, Amedeo ;
Ryu, Jennifer Hyoje ;
Wagner, Bridget K. ;
Shen-Orr, Shai S. ;
Klein, Allon M. ;
Melton, Douglas A. ;
Yanai, Itai .
CELL SYSTEMS, 2016, 3 (04) :346-+
[5]   Distributed optimization and statistical learning via the alternating direction method of multipliers [J].
Boyd S. ;
Parikh N. ;
Chu E. ;
Peleato B. ;
Eckstein J. .
Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122
[6]   Pancreatic duct replication is increased with obesity and type 2 diabetes in humans [J].
Butler, A. E. ;
Galasso, R. ;
Matveyenko, A. ;
Rizza, R. A. ;
Dry, S. ;
Butler, P. C. .
DIABETOLOGIA, 2010, 53 (01) :21-26
[7]   Integrating single-cell transcriptomic data across different conditions, technologies, and species [J].
Butler, Andrew ;
Hoffman, Paul ;
Smibert, Peter ;
Papalexi, Efthymia ;
Satija, Rahul .
NATURE BIOTECHNOLOGY, 2018, 36 (05) :411-+
[8]   Comprehensive genomic characterization defines human glioblastoma genes and core pathways [J].
Chin, L. ;
Meyerson, M. ;
Aldape, K. ;
Bigner, D. ;
Mikkelsen, T. ;
VandenBerg, S. ;
Kahn, A. ;
Penny, R. ;
Ferguson, M. L. ;
Gerhard, D. S. ;
Getz, G. ;
Brennan, C. ;
Taylor, B. S. ;
Winckler, W. ;
Park, P. ;
Ladanyi, M. ;
Hoadley, K. A. ;
Verhaak, R. G. W. ;
Hayes, D. N. ;
Spellman, Paul T. ;
Absher, D. ;
Weir, B. A. ;
Ding, L. ;
Wheeler, D. ;
Lawrence, M. S. ;
Cibulskis, K. ;
Mardis, E. ;
Zhang, Jinghui ;
Wilson, R. K. ;
Donehower, L. ;
Wheeler, D. A. ;
Purdom, E. ;
Wallis, J. ;
Laird, P. W. ;
Herman, J. G. ;
Schuebel, K. E. ;
Weisenberger, D. J. ;
Baylin, S. B. ;
Schultz, N. ;
Yao, Jun ;
Wiedemeyer, R. ;
Weinstein, J. ;
Sander, C. ;
Gibbs, R. A. ;
Gray, J. ;
Kucherlapati, R. ;
Lander, E. S. ;
Myers, R. M. ;
Perou, C. M. ;
McLendon, Roger .
NATURE, 2008, 455 (7216) :1061-1068
[9]   Performance Assessment and Selection of Normalization Procedures for Single-Cell RNA-Seq [J].
Cole, Michael B. ;
Risso, Davide ;
Wagner, Allon ;
DeTomaso, David ;
Ngai, John ;
Purdom, Elizabeth ;
Dudoit, Sandrine ;
Yosef, Nir .
CELL SYSTEMS, 2019, 8 (04) :315-+
[10]   A survey of human brain transcriptome diversity at the single cell level [J].
Darmanis, Spyros ;
Sloan, Steven A. ;
Zhang, Ye ;
Enge, Martin ;
Caneda, Christine ;
Shuer, Lawrence M. ;
Gephart, Melanie G. Hayden ;
Barres, Ben A. ;
Quake, Stephen R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (23) :7285-7290