Hybrid Feature Selection Method using Gene Expression Data

被引:0
作者
Chuang, Li-Yeh [1 ]
Wu, Kuo-Chuan [2 ]
Yang, Cheng-Hong [3 ]
机构
[1] I Shou Univ, Dept Chem Engn, Kaohsiung, Taiwan
[2] Natl Kaohsiung Univ Appl Sci, Comp Sci & Informat Engn, Kaohsiung 80778, Taiwan
[3] Natl Kaohsiung Univ Appl Sci, Elect Engn, Kaohsiung 80778, Taiwan
来源
2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08 | 2009年
关键词
FEATURE SUBSET-SELECTION; MICROARRAY DATA; CLASSIFICATION; ALGORITHMS; CLASSIFIERS; MACHINE;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Gene expression profiles, which represent the state of a cell at a molecular level, have great potential as a medical diagnosis tool. Compared to the number of genes involved available training data sets generally have a fairly small sample size in cancer type classification. These training data limitations constitute a challenge to certain classification methodologies. The gene (feature) selection can extract genes which influence classification accuracy effectively, to eliminate the useless genes, and to improve the calculate performance and the classification accuracy. This paper presents hybrid feature selection method Taguchi-Genetic algorithm to rind optimal feature subset, to appraise feature set using K-nearest neighbor with leave-one-out cross-validation based on Euclidean distance calculation. Experimental results show that our method simplifies features effectively and obtains a higher classification accuracy compared to other classification methods from the literature.
引用
收藏
页码:199 / +
页数:2
相关论文
共 24 条
[1]  
[Anonymous], 1991, NEAREST NEIGHB NORMS
[2]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[3]   Efficient leave-one-out cross-validation of kernel Fisher discriminant classifiers [J].
Cawley, GC ;
Talbot, NLC .
PATTERN RECOGNITION, 2003, 36 (11) :2585-2592
[4]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[5]   Data mining in bioinformatics using Weka [J].
Frank, E ;
Hall, M ;
Trigg, L ;
Holmes, G ;
Witten, IH .
BIOINFORMATICS, 2004, 20 (15) :2479-2481
[6]  
Holland J.H., 1992, ADAPTATION NATURE AR, DOI 10.7551/mitpress/1090.001.0001
[7]   Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers [J].
Huang, Hui-Ling ;
Lee, Chong-Cheng ;
Ho, Shinn-Ying .
BIOSYSTEMS, 2007, 90 (01) :78-86
[8]   Filter versus wrapper gene selection approaches in DNA microarray domains [J].
Inza, I ;
Larrañaga, P ;
Blanco, R ;
Cerrolaza, AJ .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 31 (02) :91-103
[9]   Toward integrating feature selection algorithms for classification and clustering [J].
Liu, H ;
Yu, L .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) :491-502
[10]  
NARENDRA P, 1977, IEEE T COMPUT, V26, P917, DOI 10.1109/TC.1977.1674939