Identifying Differentially Expressed Genes for Time-course Microarray Data through Functional Data Analysis

被引:4
|
作者
Chen K. [1 ,2 ]
Wang J.-L. [2 ]
机构
[1] Dana-Farber Cancer Institute, Boston
[2] University of California, Davis
基金
美国国家科学基金会;
关键词
Differentially expressed genes; False discovery rate; Functional data analysis; Functional principal component; Hybrid EM; Time-course gene expression;
D O I
10.1007/s12561-010-9024-z
中图分类号
学科分类号
摘要
Identification of differentially expressed (DE) genes across two conditions is a common task with microarray. Most existing approaches accomplish this goal by examining each gene separately based on a model and then control the false discovery rate over all genes. We took a different approach that employs a uniform platform to simultaneously depict the dynamics of the gene trajectories for all genes and select differently expressed genes. A new Functional Principal Component (FPC) approach is developed for time-course microarray data to borrow strength across genes. The approach is flexible as the temporal trajectory of the gene expressions is modeled nonparametrically through a set of orthogonal basis functions, and often fewer basis functions are needed to capture the shape of the gene expression trajectory than existing nonparametric methods. These basis functions are estimated from the data reflecting major modes of variation in the data. The correlation structure of the gene expressions over time is also incorporated without any parametric assumptions and estimated from all genes such that the information across other genes can be shared to infer one individual gene. Estimation of the parameters is carried out by an efficient hybrid EM algorithm. The performance of the proposed method across different scenarios was compared favorably in simulation to two-way mixed-effects ANOVA and the EDGE method using B-spline basis function. Application to the real data on C. elegans developmental stages also suggested that FPC analysis combined with hybrid EM algorithm provides a computationally fast and efficient method for identifying DE genes based on time-course microarray data. © 2010 The Author(s).
引用
收藏
页码:95 / 119
页数:24
相关论文
共 50 条
  • [31] A spline function approach for detecting differentially expressed genes in microarray data analysis
    He, WQ
    BIOINFORMATICS, 2004, 20 (17) : 2954 - 2963
  • [32] Identifying temporally differentially expressed genes through functional principal components analysis
    Liu, Xueli
    Yang, Mark C. K.
    BIOSTATISTICS, 2009, 10 (04) : 667 - 679
  • [33] A Study of HME Model in Time-Course Microarray Data
    Myoung, Sungmin
    Kim, Donggeon
    Jo, Jinnam
    KOREAN JOURNAL OF APPLIED STATISTICS, 2012, 25 (03) : 415 - 422
  • [34] Adaptive thresholds to detect differentially expressed genes in microarray data
    Fukuoka, Yutaka
    Inaoka, Hidenori
    Noshiro, Makoto
    BIOINFORMATION, 2011, 7 (01) : 33 - 37
  • [35] MicroArray data simulator for improved selection of differentially expressed genes
    Singhal, S
    Kyvernitis, CG
    Johnson, SW
    Kaiser, LR
    Liebman, MN
    Albelda, SM
    CANCER BIOLOGY & THERAPY, 2003, 2 (04) : 383 - 391
  • [36] A Comparison of Algorithms to Find Differentially Expressed Genes in Microarray Data
    Ultsch, Alfred
    Pallasch, Christian
    Bergmann, Eckhard
    Christiansen, Holger
    ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, : 685 - +
  • [37] Clustering of time-course gene expression data using functional data analysis
    Song, Joon Jin
    Lee, Ho-Jin
    Morris, Jeffrey S.
    Kang, Sanghoon
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2007, 31 (04) : 265 - 274
  • [38] Estimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
    Hirakawa, Akihiro
    Sato, Yasunori
    Sozu, Takashi
    Hamada, Chikuma
    Yoshimura, Isao
    CANCER INFORMATICS, 2007, 3 : 140 - 148
  • [39] Identifying differentially expressed genes in cDNA microarray experiments
    Baggerly, KA
    Coombes, KR
    Hess, KR
    Stivers, DN
    Abruzzo, LV
    Zhang, W
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) : 639 - 659
  • [40] Model-based methods for identifying periodically expressed genes based on time course microarray gene expression data
    Luan, Y
    Li, H
    BIOINFORMATICS, 2004, 20 (03) : 332 - 339