Sources of variation in cell-type RNA-Seq profiles

被引:13
|
作者
Gustafsson, Johan [1 ,2 ]
Held, Felix [3 ,4 ]
Robinson, Jonathan L. [1 ,2 ]
Bjornson, Elias [1 ,5 ]
Jornsten, Rebecka [3 ,4 ]
Nielsen, Jens [1 ,2 ,6 ]
机构
[1] Chalmers Univ Technol, Dept Biol & Biol Engn, Gothenburg, Sweden
[2] Chalmers Univ Technol, Wallenberg Ctr Prot Res, Gothenburg, Sweden
[3] Univ Gothenburg, Math Sci, Gothenburg, Sweden
[4] Chalmers Univ Technol, Gothenburg, Sweden
[5] Univ Gothenburg, Wallenberg Lab Cardiovasc & Metab Res, Dept Mol & Clin Med, Gothenburg, Sweden
[6] BioInnovat Inst, Copenhagen, Denmark
来源
PLOS ONE | 2020年 / 15卷 / 09期
基金
美国国家卫生研究院;
关键词
MESSENGER-RNA; PACKAGE; GENES;
D O I
10.1371/journal.pone.0239495
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cell-type specific gene expression profiles are needed for many computational methods operating on bulk RNA-Seq samples, such as deconvolution of cell-type fractions and digital cytometry. However, the gene expression profile of a cell type can vary substantially due to both technical factors and biological differences in cell state and surroundings, reducing the efficacy of such methods. Here, we investigated which factors contribute most to this variation. We evaluated different normalization methods, quantified the variance explained by different factors, evaluated the effect on deconvolution of cell type fractions, and examined the differences between UMI-based single-cell RNA-Seq and bulk RNA-Seq. We investigated a collection of publicly available bulk and single-cell RNA-Seq datasets containing B and T cells, and found that the technical variation across laboratories is substantial, even for genes specifically selected for deconvolution, and this variation has a confounding effect on deconvolution. Tissue of origin is also a substantial factor, highlighting the challenge of using cell type profiles derived from blood with mixtures from other tissues. We also show that much of the differences between UMI-based single-cell and bulk RNA-Seq methods can be explained by the number of read duplicates per mRNA molecule in the single-cell sample. Our work shows the importance of either matching or correcting for technical factors when creating cell-type specific gene expression profiles that are to be used together with bulk samples.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Integrated analysis of single-cell RNA-seq, bulk RNA-seq, Mendelian randomization, and eQTL reveals T cell-related nomogram model and subtype classification in rheumatoid arthritis
    Ding, Qiang
    Xu, Qingyuan
    Hong, Yini
    Zhou, Honghai
    He, Xinyu
    Niu, Chicheng
    Tian, Zhao
    Li, Hao
    Zeng, Ping
    Liu, Jinfu
    FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [42] Studying bacterial transcriptomes using RNA-seq
    Croucher, Nicholas J.
    Thomson, Nicholas R.
    CURRENT OPINION IN MICROBIOLOGY, 2010, 13 (05) : 619 - 624
  • [43] Dynamics in Transcriptomics: Advancements in RNA-seq Time Course and Downstream Analysis
    Spies, Daniel
    Ciaudo, Constance
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2015, 13 : 469 - 477
  • [44] RNA-seq reveals novel mechanistic targets of Livin in bladder cancer
    Li, Xianwen
    Fu, Chunhua
    Li, Guofeng
    He, Haolin
    BMC UROLOGY, 2023, 23 (01)
  • [45] Latent periodic process inference from single-cell RNA-seq data
    Liang, Shaoheng
    Wang, Fang
    Han, Jincheng
    Chen, Ken
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [46] Granatum: a graphical single-cell RNA-Seq analysis pipeline for genomics scientists
    Zhu, Xun
    Wolfgruber, Thomas K.
    Tasato, Austin
    Arisdakessian, Cedric
    Garmire, David G.
    Garmire, Lana X.
    GENOME MEDICINE, 2017, 9
  • [47] Single-cell RNA-seq highlights a specific carcinoembryonic cluster in ovarian cancer
    Zhao, Hongyu
    Gao, Yan
    Miao, Jinwei
    Chen, Suwen
    Li, Jie
    Li, Zhefeng
    Yin, Chenghong
    Yue, Wentao
    CELL DEATH & DISEASE, 2021, 12 (11)
  • [48] ComBat-seq: batch effect adjustment for RNA-seq count data
    Zhang, Yuqing
    Parmigiani, Giovanni
    Johnson, W. Evan
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (03)
  • [49] Giardia lamblia Transcriptome Analysis Using TSS-Seq and RNA-Seq
    Tolba, Mohammed E. M.
    Kobayashi, Seiki
    Imada, Mihoko
    Suzuki, Yutaka
    Sugano, Sumio
    PLOS ONE, 2013, 8 (10):
  • [50] A single-cell RNA-seq survey of the developmental landscape of the human prefrontal cortex
    Zhong, Suijuan
    Zhang, Shu
    Fan, Xiaoying
    Wu, Qian
    Yan, Liying
    Dong, Ji
    Zhang, Haofeng
    Li, Long
    Sun, Le
    Pan, Na
    Xu, Xiaohui
    Tang, Fuchou
    Zhang, Jun
    Qiao, Jie
    Wang, Xiaoqun
    NATURE, 2018, 555 (7697) : 524 - +