An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data

被引:16
|
作者
Sun, Xifang [1 ]
Sun, Shiquan [2 ,3 ]
Yang, Sheng [4 ]
机构
[1] Xian Shiyou Univ, Sch Sci, Dept Math, Xian 710065, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[3] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[4] Nanjing Med Univ, Sch Publ Hlth, Dept Biostat, Nanjing 211166, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
cell-type compositions; deconvolution; single-cell RNA-seq; nonnegative matrix factorization; gene expression; HETEROGENEITY; ORIGIN;
D O I
10.3390/cells8101161
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omics Matrix Factorization (MOMF), to estimate the cell-type compositions of bulk RNA sequencing (RNA-seq) data by leveraging cell type-specific gene expression levels from single-cell RNA sequencing (scRNA-seq) data. MOMF not only directly models the count nature of gene expression data, but also effectively accounts for the uncertainty of cell type-specific mean gene expression levels. We demonstrate the benefits of MOMF through three real data applications, i.e., Glioblastomas (GBM), colorectal cancer (CRC) and type II diabetes (T2D) studies. MOMF is able to accurately estimate disease-related cell type proportions, i.e., oligodendrocyte progenitor cells and macrophage cells, which are strongly associated with the survival of GBM and CRC, respectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Normalization Methods on Single-Cell RNA-seq Data: An Empirical Survey
    Lytal, Nicholas
    Ran, Di
    An, Lingling
    FRONTIERS IN GENETICS, 2020, 11
  • [32] Zero-preserving imputation of single-cell RNA-seq data
    Linderman, George C.
    Zhao, Jun
    Roulis, Manolis
    Bielecki, Piotr
    Flavell, Richard A.
    Nadler, Boaz
    Kluger, Yuval
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [33] A Hybrid Clustering Algorithm for Identifying Cell Types from Single-Cell RNA-Seq Data
    Zhu, Xiaoshu
    Li, Hong-Dong
    Xu, Yunpei
    Guo, Lilu
    Wu, Fang-Xiang
    Duan, Guihua
    Wang, Jianxin
    GENES, 2019, 10 (02)
  • [34] A novel method for predicting cell abundance based on single-cell RNA-seq data
    Jiajie Peng
    Lu Han
    Xuequn Shang
    BMC Bioinformatics, 22
  • [35] A novel method for predicting cell abundance based on single-cell RNA-seq data
    Peng, Jiajie
    Han, Lu
    Shang, Xuequn
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 9)
  • [36] Construction of cancer- associated fibroblasts related risk signature based on single-cell RNA-seq and bulk RNA-seq data in bladder urothelial carcinoma
    Liu, Yunxun
    Jian, Jun
    Zhang, Ye
    Wang, Lei
    Liu, Xiuheng
    Chen, Zhiyuan
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [37] Imputation method for single-cell RNA-seq data using neural topic model
    Qi, Yueyang
    Han, Shuangkai
    Tang, Lin
    Liu, Lin
    GIGASCIENCE, 2023, 12
  • [38] Integrated analysis of single-cell RNA-seq and bulk RNA-seq unravels the heterogeneity of cancer-associated fibroblasts in TNBC
    Wu, Xiaoqing
    Lu, Wenping
    Zhang, Weixuan
    Zhang, Dongni
    Mei, Heting
    Zhang, Mengfan
    Cui, Yongjia
    Zhuo, Zhili
    AGING-US, 2023, 15 (21): : 12674 - 12697
  • [39] Joint CC and Bimax: A Biclustering Method for Single-Cell RNA-Seq Data Analysis
    Chu, He-Ming
    Kong, Xiang-Zhen
    Liu, Jin-Xing
    Wang, Juan
    Yuan, Sha-Sha
    Dai, Ling-Yun
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 499 - 510
  • [40] scINRB: single-cell gene expression imputation with network regularization and bulk RNA-seq data
    Kang, Yue
    Zhang, Hongyu
    Guan, Jinting
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)