Sources of variation in cell-type RNA-Seq profiles

被引:13
|
作者
Gustafsson, Johan [1 ,2 ]
Held, Felix [3 ,4 ]
Robinson, Jonathan L. [1 ,2 ]
Bjornson, Elias [1 ,5 ]
Jornsten, Rebecka [3 ,4 ]
Nielsen, Jens [1 ,2 ,6 ]
机构
[1] Chalmers Univ Technol, Dept Biol & Biol Engn, Gothenburg, Sweden
[2] Chalmers Univ Technol, Wallenberg Ctr Prot Res, Gothenburg, Sweden
[3] Univ Gothenburg, Math Sci, Gothenburg, Sweden
[4] Chalmers Univ Technol, Gothenburg, Sweden
[5] Univ Gothenburg, Wallenberg Lab Cardiovasc & Metab Res, Dept Mol & Clin Med, Gothenburg, Sweden
[6] BioInnovat Inst, Copenhagen, Denmark
来源
PLOS ONE | 2020年 / 15卷 / 09期
基金
美国国家卫生研究院;
关键词
MESSENGER-RNA; PACKAGE; GENES;
D O I
10.1371/journal.pone.0239495
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cell-type specific gene expression profiles are needed for many computational methods operating on bulk RNA-Seq samples, such as deconvolution of cell-type fractions and digital cytometry. However, the gene expression profile of a cell type can vary substantially due to both technical factors and biological differences in cell state and surroundings, reducing the efficacy of such methods. Here, we investigated which factors contribute most to this variation. We evaluated different normalization methods, quantified the variance explained by different factors, evaluated the effect on deconvolution of cell type fractions, and examined the differences between UMI-based single-cell RNA-Seq and bulk RNA-Seq. We investigated a collection of publicly available bulk and single-cell RNA-Seq datasets containing B and T cells, and found that the technical variation across laboratories is substantial, even for genes specifically selected for deconvolution, and this variation has a confounding effect on deconvolution. Tissue of origin is also a substantial factor, highlighting the challenge of using cell type profiles derived from blood with mixtures from other tissues. We also show that much of the differences between UMI-based single-cell and bulk RNA-Seq methods can be explained by the number of read duplicates per mRNA molecule in the single-cell sample. Our work shows the importance of either matching or correcting for technical factors when creating cell-type specific gene expression profiles that are to be used together with bulk samples.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Comparison of the Gene Expression Profiles Between Smokers With and Without Lung Cancer Using RNA-Seq
    Cheng, Peng
    Cheng, You
    Li, Yan
    Zhao, Zhenguo
    Gao, Hui
    Li, Dong
    Li, Hua
    Zhang, Tao
    ASIAN PACIFIC JOURNAL OF CANCER PREVENTION, 2012, 13 (08) : 3605 - 3609
  • [32] A study of the differential expression profiles of Keshan disease lncRNA/mRNA genes based on RNA-seq
    Huang, Guangyong
    Liu, Jingwen
    Wang, Yuehai
    Xiang, Youzhang
    CARDIOVASCULAR DIAGNOSIS AND THERAPY, 2021, 11 (02) : 411 - 421
  • [33] Integrated analysis of single-cell RNA-seq and bulk RNA-seq reveals immune suppression subtypes and establishes a novel signature for determining the prognosis in lung adenocarcinoma
    Mao, Shengqiang
    Wang, Yilong
    Chao, Ningning
    Zeng, Lingyan
    Zhang, Li
    CELLULAR ONCOLOGY, 2024, 47 (05) : 1697 - 1713
  • [34] Performance Assessment and Selection of Normalization Procedures for Single-Cell RNA-Seq
    Cole, Michael B.
    Risso, Davide
    Wagner, Allon
    DeTomaso, David
    Ngai, John
    Purdom, Elizabeth
    Dudoit, Sandrine
    Yosef, Nir
    CELL SYSTEMS, 2019, 8 (04) : 315 - +
  • [35] RNA-Seq Perspectives to Improve Clinical Diagnosis
    Marco-Puche, Guillermo
    Lois, Sergio
    Benitez, Javier
    Carlos Trivino, Juan
    FRONTIERS IN GENETICS, 2019, 10
  • [36] TopHat: discovering splice junctions with RNA-Seq
    Trapnell, Cole
    Pachter, Lior
    Salzberg, Steven L.
    BIOINFORMATICS, 2009, 25 (09) : 1105 - 1111
  • [37] Alternative splicing, RNA-seq and drug discovery
    Zhao, Shanrong
    DRUG DISCOVERY TODAY, 2019, 24 (06) : 1258 - 1267
  • [38] A test metric for assessing single-cell RNA-seq batch correction
    Buettner, Maren
    Miao, Zhichao
    Wolf, F. Alexander
    Teichmann, Sarah A.
    Theis, Fabian J.
    NATURE METHODS, 2019, 16 (01) : 43 - +
  • [39] Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq Data
    Ghosh, Debashis
    Qin, Zhaohui S.
    GENES, 2010, 1 (02) : 317 - 334
  • [40] A conjoint analysis of bulk RNA-seq and single-nucleus RNA-seq for revealing the role of ferroptosis and iron metabolism in ALS
    Fu, Xiujuan
    He, Yizi
    Xie, Yongzhi
    Lu, Zuneng
    FRONTIERS IN NEUROSCIENCE, 2023, 17