Deconvolution from bulk gene expression by leveraging sample-wise and gene-wise similarities and single-cell RNA-Seq data

被引:1
|
作者
Wang, Chenqi [1 ]
Lin, Yifan [1 ]
Li, Shuchao [1 ]
Guan, Jinting [1 ,2 ,3 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen, Peoples R China
[2] Minist Educ, Key Lab Syst Control & Informat Proc, Shanghai, Peoples R China
[3] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiamen, Peoples R China
来源
BMC GENOMICS | 2024年 / 25卷 / 01期
关键词
Deconvolution; Cell type abundance; Cell type-specific gene expression profile; Similarity matrix; Single-cell RNA-seq data; MOUSE; MAP; NORMALIZATION; HETEROGENEITY; DIVERSITY; ATLAS; STEM;
D O I
10.1186/s12864-024-10728-x
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundThe widely adopted bulk RNA-seq measures the gene expression average of cells, masking cell type heterogeneity, which confounds downstream analyses. Therefore, identifying the cellular composition and cell type-specific gene expression profiles (GEPs) facilitates the study of the underlying mechanisms of various biological processes. Although single-cell RNA-seq focuses on cell type heterogeneity in gene expression, it requires specialized and expensive resources and currently is not practical for a large number of samples or a routine clinical setting. Recently, computational deconvolution methodologies have been developed, while many of them only estimate cell type composition or cell type-specific GEPs by requiring the other as input. The development of more accurate deconvolution methods to infer cell type abundance and cell type-specific GEPs is still essential.ResultsWe propose a new deconvolution algorithm, DSSC, which infers cell type-specific gene expression and cell type proportions of heterogeneous samples simultaneously by leveraging gene-gene and sample-sample similarities in bulk expression and single-cell RNA-seq data. Through comparisons with the other existing methods, we demonstrate that DSSC is effective in inferring both cell type proportions and cell type-specific GEPs across simulated pseudo-bulk data (including intra-dataset and inter-dataset simulations) and experimental bulk data (including mixture data and real experimental data). DSSC shows robustness to the change of marker gene number and sample size and also has cost and time efficiencies.ConclusionsDSSC provides a practical and promising alternative to the experimental techniques to characterize cellular composition and heterogeneity in the gene expression of heterogeneous samples.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Bayesian correlation is a robust gene similarity measure for single-cell RNA-seq data
    Sanchez-Taltavull, Daniel
    Perkins, Theodore J.
    Dommann, Noelle
    Melin, Nicolas
    Keogh, Adrian
    Candinas, Daniel
    Stroka, Deborah
    Beldi, Guido
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (01)
  • [42] Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
    Xiaoxiao Sun
    Yiwen Liu
    Lingling An
    Nature Communications, 11
  • [43] Robust reconstruction of single-cell RNA-seq data with iterative gene weight updates
    Sheng, Yueqi
    Barak, Boaz
    Nitzan, Mor
    BIOINFORMATICS, 2023, 39 : I423 - I430
  • [44] Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
    Sun, Xiaoxiao
    Liu, Yiwen
    An, Lingling
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [45] Robust reconstruction of single-cell RNA-seq data with iterative gene weight updates
    Sheng, Yueqi
    Barak, Boaz
    Nitzan, Mor
    BIOINFORMATICS, 2023, 39 : i423 - i430
  • [46] A graph neural network model to estimate cell-wise metabolic flux using single-cell RNA-seq data
    Alghamdi, Norah
    Chang, Wennan
    Dang, Pengtao
    Lu, Xiaoyu
    Wan, Changlin
    Gampala, Silpa
    Huang, Zhi
    Wang, Jiashi
    Ma, Qin
    Zang, Yong
    Fishel, Melissa
    Cao, Sha
    Zhang, Chi
    GENOME RESEARCH, 2021, 31 (10) : 1867 - 1884
  • [47] Topological benchmarking of algorithms to infer gene regulatory networks from single-cell RNA-seq data
    Stock, Marco
    Popp, Niclas
    Fiorentino, Jonathan
    Scialdone, Antonio
    BIOINFORMATICS, 2024, 40 (05)
  • [48] Gene expression distribution deconvolution in single-cell RNA sequencing
    Wang, Jingshu
    Huang, Mo
    Torre, Eduardo
    Dueck, Hannah
    Shaffer, Sydney
    Murray, John
    Raj, Arjun
    Li, Mingyao
    Zhang, Nancy R.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (28) : E6437 - E6446
  • [49] Phylogenetic inference from single-cell RNA-seq data
    Xuan Liu
    Jason I. Griffiths
    Isaac Bishara
    Jiayi Liu
    Andrea H. Bild
    Jeffrey T. Chang
    Scientific Reports, 13
  • [50] Phylogenetic inference from single-cell RNA-seq data
    Liu, Xuan
    Griffiths, Jason I.
    Bishara, Isaac
    Liu, Jiayi
    Bild, Andrea H.
    Chang, Jeffrey T.
    SCIENTIFIC REPORTS, 2023, 13 (01)