Sources of variation in cell-type RNA-Seq profiles

被引:13
|
作者
Gustafsson, Johan [1 ,2 ]
Held, Felix [3 ,4 ]
Robinson, Jonathan L. [1 ,2 ]
Bjornson, Elias [1 ,5 ]
Jornsten, Rebecka [3 ,4 ]
Nielsen, Jens [1 ,2 ,6 ]
机构
[1] Chalmers Univ Technol, Dept Biol & Biol Engn, Gothenburg, Sweden
[2] Chalmers Univ Technol, Wallenberg Ctr Prot Res, Gothenburg, Sweden
[3] Univ Gothenburg, Math Sci, Gothenburg, Sweden
[4] Chalmers Univ Technol, Gothenburg, Sweden
[5] Univ Gothenburg, Wallenberg Lab Cardiovasc & Metab Res, Dept Mol & Clin Med, Gothenburg, Sweden
[6] BioInnovat Inst, Copenhagen, Denmark
来源
PLOS ONE | 2020年 / 15卷 / 09期
基金
美国国家卫生研究院;
关键词
MESSENGER-RNA; PACKAGE; GENES;
D O I
10.1371/journal.pone.0239495
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cell-type specific gene expression profiles are needed for many computational methods operating on bulk RNA-Seq samples, such as deconvolution of cell-type fractions and digital cytometry. However, the gene expression profile of a cell type can vary substantially due to both technical factors and biological differences in cell state and surroundings, reducing the efficacy of such methods. Here, we investigated which factors contribute most to this variation. We evaluated different normalization methods, quantified the variance explained by different factors, evaluated the effect on deconvolution of cell type fractions, and examined the differences between UMI-based single-cell RNA-Seq and bulk RNA-Seq. We investigated a collection of publicly available bulk and single-cell RNA-Seq datasets containing B and T cells, and found that the technical variation across laboratories is substantial, even for genes specifically selected for deconvolution, and this variation has a confounding effect on deconvolution. Tissue of origin is also a substantial factor, highlighting the challenge of using cell type profiles derived from blood with mixtures from other tissues. We also show that much of the differences between UMI-based single-cell and bulk RNA-Seq methods can be explained by the number of read duplicates per mRNA molecule in the single-cell sample. Our work shows the importance of either matching or correcting for technical factors when creating cell-type specific gene expression profiles that are to be used together with bulk samples.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] sNucConv: A bulk RNA-seq deconvolution method trained on single-nucleus RNA-seq data to estimate cell-type composition of human adipose tissues
    Sorek, Gil
    Haim, Yulia
    Chalifa-Caspi, Vered
    Lazarescu, Or
    Ziv-Agam, Maya
    Hagemann, Tobias
    Nankam, Pamela Arielle Nono
    Blueher, Matthias
    Liberty, Idit F.
    Dukhno, Oleg
    Kukeev, Ivan
    Yeger-Lotem, Esti
    Rudich, Assaf
    Levin, Liron
    ISCIENCE, 2024, 27 (07)
  • [2] scPred: accurate supervised method for cell-type classification from single-cell RNA-seq data
    Alquicira-Hernandez, Jose
    Sathe, Anuja
    Ji, Hanlee P.
    Quan Nguyen
    Powell, Joseph E.
    GENOME BIOLOGY, 2019, 20 (01)
  • [3] Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
    Kotliar, Dylan
    Veres, Adrian
    Nagy, M. Aurel
    Tabrizi, Shervin
    Hodis, Eran
    Melton, Douglas A.
    Sabeti, Pardis C.
    ELIFE, 2019, 8
  • [4] Cell-type-aware analysis of RNA-seq data
    Jin, Chong
    Chen, Mengjie
    Lin, Dan-Yu
    Sun, Wei
    NATURE COMPUTATIONAL SCIENCE, 2021, 1 (04): : 253 - 261
  • [5] Computational analysis of alternative polyadenylation from standard RNA-seq and single-cell RNA-seq data
    Gao, Yipeng
    Li, Wei
    MRNA 3' END PROCESSING AND METABOLISM, 2021, 655 : 225 - 243
  • [6] Single-cell RNA-Seq and bulk RNA-Seq reveal reliable diagnostic and prognostic biomarkers for CRC
    Zhang, Xing
    Yang, Longkun
    Deng, Ying
    Huang, Zhicong
    Huang, Hao
    Wu, Yuying
    He, Baochang
    Hu, Fulan
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (12) : 9805 - 9821
  • [7] Integrated Single-cell RNA-seq and Bulk RNA-seq Identify Diagnostic Biomarkers for Postmenopausal Osteoporosis
    Wang, Hanyu
    Peng, Chong
    Hu, Guangbing
    Chen, Wenhao
    Hu, Yong
    Pi, Honglin
    CURRENT MEDICINAL CHEMISTRY, 2024,
  • [8] Laser microdissection coupled with RNA-seq reveal cell-type and disease-specific markers in the salivary gland of Sjogren's syndrome patients
    Tandon, M.
    Perez, P.
    Burbelo, P. D.
    Calkins, C.
    Alevizos, I.
    CLINICAL AND EXPERIMENTAL RHEUMATOLOGY, 2017, 35 (05) : 777 - 785
  • [9] V-SVA: an R Shiny application for detecting and annotating hidden sources of variation in single-cell RNA-seq data
    Lawlor, Nathan
    Marquez, Eladio J.
    Lee, Donghyung
    Ucar, Duygu
    BIOINFORMATICS, 2020, 36 (11) : 3582 - 3584
  • [10] Computational Cell Cycle Analysis of Single Cell RNA-Seq Data
    Moussa, Marmar
    Mandoiu, Ion I.
    COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES, 2021, 12686 : 71 - 87