Integrating Biological Information for Feature Selection in Microarray Data Classification

被引:5
作者
Fang, Ong Huey [1 ]
Mustapha, Norwati [1 ]
Sulaiman, Md. Nasir [1 ]
机构
[1] Univ Putra Malaysia, Dept Comp Sci, Serdang, Malaysia
来源
2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 2 | 2010年
关键词
Data Mining; Feature Selection; Microarray; Classification;
D O I
10.1109/ICCEA.2010.215
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Due to the high dimensionality of microarray data, feature selection is an indispensable task in classification to identify a smaller subset of relevant genes. However, feature selection techniques that consider solely on gene expression values might not be able to identify biologically meaningful genes. Thus, this paper presents an integrative feature selection method that is able to incorporate gene expression data with additional biological data for finding informative genes. The proposed approach is a two-stage method that combined the strength of both filter method and association analysis. The experimental results show that the selected gene subsets are able to improve classification accuracy.
引用
收藏
页码:330 / 334
页数:5
相关论文
共 22 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]  
Cheng-San Yang, 2008, IAENG International Journal of Computer Science, V35, P285
[4]  
Cios K., 2007, Data Mining A Knowledge Discovery
[5]  
FAYYAD, 1993, P INT JOINT C UNC AI, P1022
[6]  
Fellenberg Matthias, 2003, Biosilico, V1, P177, DOI 10.1016/S1478-5382(03)02372-2
[7]  
Han J., 2012, Data Mining, P393, DOI [DOI 10.1016/B978-0-12-381479-1.00009-5, 10.1016/B978-0-12-381479-1.00009-5]
[8]   An expert system to classify microarray gene expression data using gene selection by decision tree [J].
Horng, Jorng-Tzong ;
Wu, Li-Cheng ;
Liu, Baw-Juine ;
Kuo, Jun-Li ;
Kuo, Wen-Horng ;
Zhang, Jin-Jian .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (05) :9072-9081
[9]   From genomics to chemical genomics: new developments in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Hattori, Masahiro ;
Aoki-Kinoshita, Kiyoko F. ;
Itoh, Masumi ;
Kawashima, Shuichi ;
Katayama, Toshiaki ;
Araki, Michihiro ;
Hirakawa, Mika .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D354-D357
[10]   An integrated algorithm for gene selection and classification applied to microarray data of ovarian cancer [J].
Lee, Zne-Jung .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2008, 42 (01) :81-93