TTCA: an R package for the identification of differentially expressed genes in time course microarray data

被引:13
作者
Albrecht, Marco [1 ,2 ]
Stichel, Damian [1 ,3 ]
Mueller, Benedikt [4 ]
Merkle, Ruth [5 ,6 ]
Sticht, Carsten [7 ]
Gretz, Norbert [7 ]
Klingmueller, Ursula [5 ,6 ]
Breuhahn, Kai [4 ]
Matthaeus, Franziska [1 ,8 ]
机构
[1] Complex Biol Syst Grp BIOMS IWR, Neuenheimer Feld 294, D-69120 Heidelberg, Germany
[2] Univ Luxembourg, Syst Biol Grp, 7 Ave Swing, L-4367 Belvaux, Luxembourg
[3] German Canc Res Ctr, CCU Neuropathol Grp, Neuenheimer Feld 221, D-69120 Heidelberg, Germany
[4] Univ Heidelberg Hosp, Inst Pathol, Neuenheimer Feld 672, D-69120 Heidelberg, Germany
[5] German Canc Res Ctr, Syst Biol Signal Transduct Grp, Neuenheimer Feld 280, D-69120 Heidelberg, Germany
[6] German Ctr Lung Res DZL, TLRC, Neuenheimer Feld 430, D-69120 Heidelberg, Germany
[7] Heidelberg Univ, Med Fac Mannheim, Med Res Ctr, Theodor Kutzer Ufer 1-3, D-68167 Mannheim, Germany
[8] Goethe Univ Frankfurt, FIAS, Ruth Moufang Str 1, D-60438 Frankfurt, Germany
关键词
Differential expression; Time series; EGF; Stimulation experiments; Gene ontology; Gene set analysis; FUNCTIONAL DATA-ANALYSIS; PATTERNS; NORMALIZATION; BIOCONDUCTOR; COMPONENTS; BIOMART; CYCLE;
D O I
10.1186/s12859-016-1440-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The analysis of microarray time series promises a deeper insight into the dynamics of the cellular response following stimulation. A common observation in this type of data is that some genes respond with quick, transient dynamics, while other genes change their expression slowly over time. The existing methods for detecting significant expression dynamics often fail when the expression dynamics show a large heterogeneity. Moreover, these methods often cannot cope with irregular and sparse measurements. Results: The method proposed here is specifically designed for the analysis of perturbation responses. It combines different scores to capture fast and transient dynamics as well as slow expression changes, and performs well in the presence of low replicate numbers and irregular sampling times. The results are given in the form of tables including links to figures showing the expression dynamics of the respective transcript. These allow to quickly recognise the relevance of detection, to identify possible false positives and to discriminate early and late changes in gene expression. An extension of the method allows the analysis of the expression dynamics of functional groups of genes, providing a quick overview of the cellular response. The performance of this package was tested on microarray data derived from lung cancer cells stimulated with epidermal growth factor (EGF). Conclusion: Here we describe a new, efficient method for the analysis of sparse and heterogeneous time course data with high detection sensitivity and transparency. It is implemented as R package TTCA (transcript time course analysis) and can be installed from the Comprehensive R Archive Network, CRAN. The source code is provided with the Additional file 1.
引用
收藏
页数:11
相关论文
共 54 条
[21]   Principal components analysis based methodology to identify differentially expressed genes in time-course microarray data [J].
Jonnalagadda, Sudhakar ;
Srinivasan, Rajagopalan .
BMC BIOINFORMATICS, 2008, 9 (1)
[22]   A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression [J].
Kalaitzis, Alfredo A. ;
Lawrence, Neil D. .
BMC BIOINFORMATICS, 2011, 12
[23]  
Kauffmann Audrey, 2009, Bioinformatics, V25, P415, DOI 10.1093/bioinformatics/btn647
[24]   Analysis of variance for gene expression microarray data [J].
Kerr, MK ;
Martin, M ;
Churchill, GA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (06) :819-837
[25]  
Koenker R., 2013, R Package Version 5(5)
[26]  
Koenker R., 2005, QUANTILE REGRESSION, DOI [10.1017/CBO9780511754098, DOI 10.1017/CBO9780511754098]
[27]   Tackling the widespread and critical impact of batch effects in high-throughput data [J].
Leek, Jeffrey T. ;
Scharpf, Robert B. ;
Bravo, Hector Corrada ;
Simcha, David ;
Langmead, Benjamin ;
Johnson, W. Evan ;
Geman, Donald ;
Baggerly, Keith ;
Irizarry, Rafael A. .
NATURE REVIEWS GENETICS, 2010, 11 (10) :733-739
[28]   EDGE: extraction and analysis of differential gene expression [J].
Leek, JT ;
Monsen, E ;
Dabney, AR ;
Storey, JD .
BIOINFORMATICS, 2006, 22 (04) :507-508
[29]   Identifying temporally differentially expressed genes through functional principal components analysis [J].
Liu, Xueli ;
Yang, Mark C. K. .
BIOSTATISTICS, 2009, 10 (04) :667-679
[30]   Distance-based differential analysis of gene curves [J].
Minas, Christopher ;
Waddell, Simon J. ;
Montana, Giovanni .
BIOINFORMATICS, 2011, 27 (22) :3135-3141