Cancer classification using a novel gene selection approach by means of shuffling based on data clustering with optimization

被引:39
作者
Elyasigomari, V. [1 ]
Mirjafari, M. S. [1 ]
Screen, H. R. C. [1 ]
Shaheed, M. H. [1 ]
机构
[1] Univ London, Sch Engn & Mat Sci, London E1 4NS, England
关键词
Cancer classification; Gene selection; Clustering; Evolutionary algorithms; Cuckoo optimization algorithm; COA-GA; SUPPORT VECTOR MACHINE; PARTICLE SWARM; MOLECULAR CLASSIFICATION; CLASS PREDICTION; MICROARRAY; DISCOVERY;
D O I
10.1016/j.asoc.2015.06.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research presents an innovative method for cancer identification and type classification using microarray data. The method is based on gene selection with shuffling in association with optimization based unconventional data clustering. A new hybrid optimization algorithm, COA-GA, is developed by synergizing recently invented Cuckoo Optimization Algorithm (COA) with a more traditional genetic algorithm (GA) for data clustering to select the most dominant genes using shuffling. For gene classification, Support Vector Machine (SVM) and Multilayer Perceptron (MLP) artificial neural networks are used. Literature suggests that data clustering using traditional approaches such as K-means, C-means and Hierarchical do not have any impact on classification accuracy. This is also confirmed in this investigation. However, results show that optimization based clustering with shuffling increase the classification accuracy significantly. The proposed algorithm (COA-GA) not only outperforms COA, GA and Particle Swarm optimization (PSO) in achieving better classification performance but also reaches a better global minimum with only few iterations. Higher accuracy is observed to have achieved with SVM classifier compared to MLP in all datasets used. Crown Copyright (C) 2015 Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:43 / 51
页数:9
相关论文
共 44 条
[1]  
Aghaei A, 2013, INT J COMPUT APPL, V61, P22
[2]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[3]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[4]  
[Anonymous], 2001, SWARM INTELL-US
[5]  
[Anonymous], FEATURE WEIGHTING CL
[6]   Gene expression profile classification: A review [J].
Asyali, Musa H. ;
Colak, Dilek ;
Demirkaya, Omer ;
Inan, Mehmet S. .
CURRENT BIOINFORMATICS, 2006, 1 (01) :55-73
[7]  
BALDI P, 2002, DNA MICROARRAYS GENE
[8]   Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments [J].
Breitling, R ;
Armengaud, P ;
Amtmann, A ;
Herzyk, P .
FEBS LETTERS, 2004, 573 (1-3) :83-92
[9]  
Chen S.-H., 2002, GENETIC ALGORITHMS G
[10]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482