Prediction of regulatory gene pairs using dynamic time warping and gene ontology

被引:3
作者
Yang, Andy C. [1 ]
Hsu, Hui-Huang [1 ]
Lu, Ming-Da [1 ]
Tseng, Vincent S. [2 ]
Shih, Timothy K. [3 ]
机构
[1] Tamkang Univ, Dept Comp Sci & Informat Engn, New Taipei City, Taiwan
[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
[3] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan, Taiwan
关键词
microarray time series data; missing value imputation; gene regulation prediction; DTW; dynamic time warping; gene ontology; MISSING VALUE ESTIMATION; MICROARRAY DATA; EXPRESSION; ALGORITHMS; IMPUTATION; NETWORKS;
D O I
10.1504/IJDMB.2014.064010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.
引用
收藏
页码:121 / 145
页数:25
相关论文
共 41 条
[21]   Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation [J].
Lord, PW ;
Stevens, RD ;
Brass, A ;
Goble, CA .
BIOINFORMATICS, 2003, 19 (10) :1275-1283
[22]   Estimating Missing Value in Microarray Data Using Fuzzy Clustering and Gene Ontology [J].
Mohammadi, Azadeh ;
Saraee, Mohammad Hossein .
2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2008, :382-385
[23]   PERFORMANCE TRADEOFFS IN DYNAMIC TIME WARPING ALGORITHMS FOR ISOLATED WORD RECOGNITION [J].
MYERS, C ;
RABINER, LR ;
ROSENBERG, AE .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (06) :623-635
[24]   A Bayesian missing value estimation method for gene expression profile data [J].
Oba, S ;
Sato, M ;
Takemasa, I ;
Monden, M ;
Matsubara, K ;
Ishii, S .
BIOINFORMATICS, 2003, 19 (16) :2088-2096
[25]   Gaussian mixture clustering and imputation of microarray data [J].
Ouyang, M ;
Welsh, WJ ;
Georgopoulos, P .
BIOINFORMATICS, 2004, 20 (06) :917-923
[26]   CONSIDERATIONS IN DYNAMIC TIME WARPING ALGORITHMS FOR DISCRETE WORD RECOGNITION [J].
RABINER, LR ;
ROSENBERG, AE ;
LEVINSON, SE .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (06) :575-582
[27]   DYNAMIC-PROGRAMMING ALGORITHM OPTIMIZATION FOR SPOKEN WORD RECOGNITION [J].
SAKOE, H ;
CHIBA, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (01) :43-49
[28]   Toward accurate dynamic time warping in linear time and space [J].
Salvadora, Stan ;
Chan, Philip .
INTELLIGENT DATA ANALYSIS, 2007, 11 (05) :561-580
[29]   Enhancing Automatic Biological Pathway Generation with GO-based Gene Similarity [J].
Sanfilippo, Antonio ;
Baddeley, Bob ;
Beagley, Nat ;
Riensche, Rick ;
Gopalan, Banu .
2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, :448-+
[30]   Kernel PCA Regression for Missing Data Estimation in DNA Microarray Analysis [J].
Shan, Ying ;
Deng, Guang .
ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, :1477-1480