Glowworm Swarm Based Informative Attribute Selection Using Support Vector Machines for Simultaneous Feature Selection and Classification

被引:6
作者
Gurav, Aniket [1 ]
Nair, Vinay [1 ]
Gupta, Utkarsh [1 ,2 ]
Valadi, Jayaraman [1 ,3 ]
机构
[1] Ctr Dev Adv Comp, Evolutionary Comp & Image Proc Grp, Pune 411007, Maharashtra, India
[2] NITK, Dept Informat Technol, Mangalore 575025, India
[3] Shiv Nadar Univ, Ctr Informat, Dadri 203207, Uttar Pradesh, India
来源
SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, SEMCCO 2014 | 2015年 / 8947卷
关键词
CANCER; IDENTIFICATION; PREDICTION;
D O I
10.1007/978-3-319-20294-5_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a hybrid filter-wrapper algorithm, GSO-Infogain, for simultaneous feature selection for improved classification accuracy. GSO-Infogain employs Glowworm-Swarm Optimization(GSO) algorithm with Support Vector Machine(SVM) as its internal learning algorithm and utilizes feature ranking based on information gain as a heuristic. The GSO algorithm randomly generates a population of worms, each of which is a candidate subset of features. The fitness of each candidate solution, which is evaluated using Support Vector Machine, is encoded within its luciferin value. Each worm probabilistically moves towards the worm with the highest luciferin value in its neighbourhood. In the process, they explore the feature space and eventually converge to the global optimum. We have evaluated the performance of the hybrid algorithm for feature selection on a set of cancer datasets. We obtain a classification accuracy in the range 94-98% for these datasets, which is comparable to the best results from other classification algorithms. We further tested the robustness of GSO-Infogain by evaluating its performance on the CoEPrA training and test datasets. GSO-Infogain performs well in this experiment too by giving similar prediction accuracies on the training and test datasets thus indicating its robustness.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 29 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
Bellman R., 1961, Adaptive Control Processes: A Guided Tour, DOI DOI 10.1515/9781400874668
[3]   Combining dissimilarity based classifiers for cancer prediction using gene expression profiles [J].
Blanco, Angela ;
Martin-Merino, Manuel ;
Rivas, Javier De las .
BMC BIOINFORMATICS, 2007, 8 (Suppl 8)
[4]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[5]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]  
COLORNI A, 1992, FROM ANIM ANIMAT, P134
[8]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[9]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537