Analysis of Temporal High-Dimensional Gene Expression Data for Identifying Informative Biomarker Candidates

被引:2
作者
Lou, Qiang [1 ]
Obradovic, Zoran [1 ]
机构
[1] Temple Univ, Dept Comp & Informat Sci, Ctr Data Analyt & Biomed Informat, Philadelphia, PA 19122 USA
来源
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012) | 2012年
关键词
high dimensional; temporal data; feature selection; margin; multivariate time series data;
D O I
10.1109/ICDM.2012.92
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying informative biomarkers from a large pool of candidates is the key step for accurate prediction of an individual's health status. In clinical applications traditional static feature selection methods that flatten the temporal data cannot be directly applied since the patient's observed clinical condition is a temporal multivariate time series where different variables can capture various stages of temporal change in the patient's health status. In this study, in order to identify informative genes in temporal microarray data, a margin based feature selection filter is proposed. The proposed method is based on well-established machine learning techniques without any assumptions about the data distribution. The objective function of temporal margin-based feature selection is defined to maximize each subject's temporal margin in its own relevant subspace. In the objective function, the uncertainty in calculating nearest neighbors is taken into account by considering the change in feature weights in each iteration. A fixed-point gradient descent method is proposed to solve the formulated objective function. The experimental results on both synthetic and real data provide evidence that the proposed method can identify more informative features than the alternatives that flatten the temporal data in advance.
引用
收藏
页码:996 / 1001
页数:6
相关论文
共 13 条
[1]  
Chen B., 2010, BMC BIOINFORMATICS
[2]  
Chen M., 2011, J AM STAT ASS, V106
[3]   Research on collaborative negotiation for e-commerce. [J].
Feng, YQ ;
Lei, Y ;
Li, Y ;
Cao, RZ .
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, :2085-2088
[4]  
Gilad-Bachrach R., 2004, PROC 21 INT C MACH L, P43
[5]   Wrappers for feature subset selection [J].
Kohavi, R ;
John, GH .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :273-324
[6]  
Lou Q, 2012, IEEE INT C BIOINF BI
[7]  
Lou Q., 2012, P AAAI C ART INT TOR, VVolume 26
[8]  
Margaritis D., 1999, NEURAL INFORM PROCES
[9]  
Shen J., 2008, P AAAI C ART INT AAA
[10]  
Song L., 2007, P 24 INT C MACH LEAR