Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation

被引:69
作者
Aijo, Tarmo [1 ]
Butty, Vincent [2 ]
Chen, Zhi [3 ,4 ]
Salo, Verna [3 ,4 ]
Tripathi, Subhash [3 ,4 ]
Burge, Christopher B. [2 ]
Lahesmaa, Riitta [3 ,4 ]
Lahdesmaki, Harri [1 ,3 ,4 ]
机构
[1] Aalto Univ, Dept Informat & Comp Sci, FI-00076 Aalto, Finland
[2] MIT, Dept Biol, Cambridge, MA 02139 USA
[3] Univ Turku, Turku Ctr Biotechnol, FI-20520 Turku, Finland
[4] Abo Akad Univ, FI-20520 Turku, Finland
基金
芬兰科学院;
关键词
GENE-EXPRESSION;
D O I
10.1093/bioinformatics/btu274
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene expression profiling using RNA-seq is a powerful technique for screening RNA species' landscapes and their dynamics in an unbiased way. While several advanced methods exist for differential expression analysis of RNA-seq data, proper tools to analyze RNA-seq time-course have not been proposed. Results: In this study, we use RNA-seq to measure gene expression during the early human T helper 17 (Th17) cell differentiation and T-cell activation (Th0). To quantify Th17-specific gene expression dynamics, we present a novel statistical methodology, DyNB, for analyzing time-course RNA-seq data. We use non-parametric Gaussian processes to model temporal correlation in gene expression and combine that with negative binomial likelihood for the count data. To account for experiment-specific biases in gene expression dynamics, such as differences in cell differentiation efficiencies, we propose a method to rescale the dynamics between replicated measurements. We develop an MCMC sampling method to make inference of differential expression dynamics between conditions. DyNB identifies several known and novel genes involved in Th17 differentiation. Analysis of differentiation efficiencies revealed consistent patterns in gene expression dynamics between different cultures. We use qRT-PCR to validate differential expression and differentiation efficiencies for selected genes. Comparison of the results with those obtained via traditional timepoint-wise analysis shows that time-course analysis together with time rescaling between cultures identifies differentially expressed genes which would not otherwise be detected.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 24 条
[1]  
Adams R. P., 2009, P 26 ANN INT C MACH, P9, DOI DOI 10.1145/1553374.1553376
[2]   An integrative computational systems biology approach identifies differentially regulated dynamic transcriptome signatures which drive the initiation of human T helper cell differentiation [J].
Aijo, Tarmo ;
Edelman, Sanna M. ;
Lonnberg, Tapio ;
Larjo, Antti ;
Kallionpaa, Henna ;
Tuomela, Soile ;
Engstrom, Emilia ;
Lahesmaa, Riitta ;
Lahdesmaki, Harri .
BMC GENOMICS, 2012, 13
[3]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[4]   Phenotypical Characterization of Human Th17 Cells Unambiguously Identified by Surface IL-17A Expression [J].
Brucklacher-Waldert, Verena ;
Steinbach, Karin ;
Lioznov, Michael ;
Kolster, Manuela ;
Hoelscher, Christoph ;
Tolosa, Eva .
JOURNAL OF IMMUNOLOGY, 2009, 183 (09) :5494-5501
[5]  
Conesa A., 2013, EMBNET J, V19, P42, DOI [10.14806/ej.19.A.672, DOI 10.14806/EJ.19.A.672]
[6]   Model-based geostatistics [J].
Diggle, PJ ;
Tawn, JA ;
Moyeed, RA .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1998, 47 :299-326
[7]   TH 17 cells in development:: an updated view of their molecular identity and genetic programming [J].
Dong, Chen .
NATURE REVIEWS IMMUNOLOGY, 2008, 8 (05) :337-348
[8]  
Gelman A., 1992, Statist. Sci., V7, P457
[9]   The Rac activator Tiam1 controls efficient T-cell trafficking and route of transendothelial migration [J].
Gerard, Audrey ;
van der Kammen, Rob A. ;
Janssen, Hans ;
Ellenbroek, Saskia I. ;
Collard, John G. .
BLOOD, 2009, 113 (24) :6138-6147
[10]   baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data [J].
Hardcastle, Thomas J. ;
Kelly, Krystyna A. .
BMC BIOINFORMATICS, 2010, 11