Analyzing RNA-Seq Data in Complex Study Designs

被引:0
作者
Wei, Xiaoyu [1 ]
Gabriel, Ben [2 ]
Rothman, Alan [2 ]
Wu, Zhijin [1 ]
机构
[1] Brown Univ, Dept Biostat, Providence, RI 02912 USA
[2] Univ Rhode Isl, Inst Immunol & Informat, Providence, RI 02908 USA
关键词
RNA-seq; Genomics; Longitudinal data; Repeated measures; EXPRESSION ANALYSIS;
D O I
10.1007/s12561-024-09446-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recently, RNA-seq experiments have become a routine technique in studying the transcriptomic regulations in various biomedical problems. The experimental design has extended from simple two-group comparisons with only a few replicates to complex study designs involving multiple treatment groups and repeated measurements. We present that even when systemic biases are removed by normalization procedures, there are systemic heteroscedasticity that makes observations from some samples noisier than from others. We develop a novel method that accurately estimates the heteroscedasticity. Accounting for it in weighted regression leads to increased true discovery in differential expression (DE) detection in both independent and clustered (longitudinal) samples. We also present a new measure of effect size in DE in longitudinal studies, when the treatment effect is expected to last over a period and peak at different time for different genes. We illustrate the advantage of the proposed method over existing approaches through extensive simulations and demonstrate the application in a real data from a longitudinal study on vaccine efficacy for Dengue fever where a two-dose heterologous prime-boost vaccination regime is utilized.
引用
收藏
页数:22
相关论文
共 21 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] Differential expression analysis for sequence count data
    Anders, Simon
    Huber, Wolfgang
    [J]. GENOME BIOLOGY, 2010, 11 (10):
  • [3] [Anonymous], 2004, Stati. Applic. Genet. Mol. Biol, DOI [DOI 10.2202/1544-6115.1027, 10.2202/1544-6115.1027]
  • [4] Trendy: segmented regression analysis of expression dynamics in high-throughput ordered profiling experiments
    Bacher, Rhonda
    Leng, Ning
    Chu, Li-Fang
    Ni, Zijian
    Thomson, James A.
    Kendziorski, Christina
    Stewart, Ron
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [5] Bates D, 2014, Arxiv, DOI [arXiv:1406.5823, 10.18637/jss.v067.i01 1406.5823, DOI 10.18637/JSS.V067.I011406.5823]
  • [6] Multimodel inference - understanding AIC and BIC in model selection
    Burnham, KP
    Anderson, DR
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2004, 33 (02) : 261 - 304
  • [7] Removing technical variability in RNA-seq data using conditional quantile normalization
    Hansen, Kasper D.
    Irizarry, Rafael A.
    WU, Zhijin
    [J]. BIOSTATISTICS, 2012, 13 (02) : 204 - 216
  • [8] Dream: powerful differential expression analysis for repeated measures designs
    Hoffman, Gabriel E.
    Roussos, Panos
    [J]. BIOINFORMATICS, 2021, 37 (02) : 192 - 201
  • [9] voom: precision weights unlock linear model analysis tools for RNA-seq read counts
    Law, Charity W.
    Chen, Yunshun
    Shi, Wei
    Smyth, Gordon K.
    [J]. GENOME BIOLOGY, 2014, 15 (02):
  • [10] EBSeq-HMM: a Bayesian approach for identifying gene-expression changes in ordered RNA-seq experiments
    Leng, Ning
    Li, Yuan
    McIntosh, Brian E.
    Nguyen, Bao Kim
    Duffin, Bret
    Tian, Shulan
    Thomson, James A.
    Dewey, Colin N.
    Stewart, Ron
    Kendziorski, Christina
    [J]. BIOINFORMATICS, 2015, 31 (16) : 2614 - 2622