The application of ant colony optimization for gene selection in microarray-based cancer classification

被引:12
作者
Chiang, Yu-Min
Chiang, Huei-Min
Lin, Shang-Yi
机构
来源
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2008年
关键词
microarray; gene expression; ACO; data mining;
D O I
10.1109/ICMLC.2008.4621102
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
DNA microarrays technology enables us to obtain information about expression levels of thousands of genes at the same time. This technology promises to monitor the whole genome on a single chip so that researchers can have a better picture of the interactions among thousands of genes at the same time. It becomes a challenge to extract information from the large amount of data through data mining. One important application of gene expression microarray data is cancer classification. However, gene expression data collected for cancer classification usually has the property of the number of genes far exceeding the number of samples. Work in such a high dimensional space is extremely difficult. Previous researches have used two-stage classification method to deal with the gene expression data. Such approaches select genes to reduce problem dimension in the first stage and classify tumors in the second stage. In the study, the ant colony optimization (ACO) algorithm is introduced to select genes relevant to cancers first, then the multi-layer perceptrons (MLP) neural network and support vector machine (SVM) classifiers are used for cancer classification. Experimental results show that selecting genes by using ACO algorithm can improve the accuracy of BP and SVM classifiers. The optimal number of genes selected for cancer classification should be set according to both the microarray dataset and gene selection methods.
引用
收藏
页码:4001 / 4006
页数:6
相关论文
共 17 条
[1]   An epicurean learning approach to gene-expression data classification [J].
Albrecht, A ;
Vinterbo, SA ;
Ohno-Machado, L .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2003, 28 (01) :75-87
[2]   Effective dimension reduction methods for tumor classification using gene expression data [J].
Antoniadis, A ;
Lambert-Lacroix, S ;
Leblanc, F .
BIOINFORMATICS, 2003, 19 (05) :563-570
[3]   Data mining for gene expression profiles from DNA, microarray [J].
Cho, SB ;
Won, HH .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2003, 13 (06) :593-608
[4]   Ant algorithms and stigmergy [J].
Dorigo, M ;
Bonabeau, E ;
Theraulaz, G .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2000, 16 (08) :851-871
[5]   Ant system: Optimization by a colony of cooperating agents [J].
Dorigo, M ;
Maniezzo, V ;
Colorni, A .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1996, 26 (01) :29-41
[6]   Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[7]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[8]   Efficient selection of discriminative genes from microarray gene expression data for cancer diagnosis [J].
Huang, D ;
Chow, TWS ;
Ma, EWM ;
Li, JY .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2005, 52 (09) :1909-1918
[9]  
Keedwell E, 2005, INTELLIGENT BIOINFORMATICS: THE APPLICATION OF ARTIFICIAL INTELLIGENCE TECHNIQUES TO BIOINFORMATICS PROBLEMS, P1, DOI 10.1002/0470015721
[10]   Gene selection: a Bayesian variable selection approach [J].
Lee, KE ;
Sha, NJ ;
Dougherty, ER ;
Vannucci, M ;
Mallick, BK .
BIOINFORMATICS, 2003, 19 (01) :90-97