Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selection based on Mutual Information

被引:16
作者
Alzubaidi, Abeer [1 ]
Cosma, Georgina [1 ]
Brown, David [1 ]
Pockley, A. Graham [2 ]
机构
[1] Nottingham Trent Univ, Sch Sci & Technol, Nottingham, England
[2] Nottingham Trent Univ, Sch Sci & Technol, John van Geest Canc Res Ctr, Nottingham, England
来源
2016 9TH INTERNATIONAL CONFERENCE ON INTERACTIVE TECHNOLOGIES AND GAMES (ITAG) | 2016年
关键词
Genetic Algorithm; Feature Selection; Cancer Diagnosis; Mutual Information; Predictive Modelling; SYSTEM;
D O I
10.1109/iTAG.2016.18
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Feature Selection is the process of selecting a subset of relevant features (i.e. predictors) for use in the construction of predictive models. This paper proposes a hybrid feature selection approach to breast cancer diagnosis which combines a Genetic Algorithm (GA) with Mutual Information (MI) for selecting the best combination of cancer predictors, with maximal discriminative capability. The selected features are then input into a classifier to predict whether a patient has breast cancer. Using a publicly available breast cancer dataset, experiments were performed to evaluate the performance of the Genetic Algorithm based on the Mutual Information approach with two different machine learning classifiers, namely the k-Nearest Neighbor (K-NN), and Support vector machine (SVM), each tuned using different distance measures and kernel functions, respectively. The results revealed that the proposed hybrid approach is highly accurate for predicting breast cancer, and it is very promising for predicting other cancers using clinical data.
引用
收藏
页码:70 / 76
页数:7
相关论文
共 27 条
[1]   A New Expert System for Diagnosis of Lung Cancer: GDA-LS_SVM [J].
Avci, Engin .
JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (03) :2005-2009
[2]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[3]   Model Comparison for Breast Cancer Prognosis Based on Clinical Data [J].
Boughorbel, Sabri ;
Al-Ali, Rashid ;
Elkum, Naser .
PLOS ONE, 2016, 11 (01)
[4]  
Brown G, 2012, J MACH LEARN RES, V13, P27
[5]   Prediction of Pathological Stage in Patients with Prostate Cancer: A Neuro-Fuzzy Model [J].
Cosma, Georgina ;
Acampora, Giovanni ;
Brown, David ;
Rees, Robert C. ;
Khan, Masood ;
Pockley, A. Graham .
PLOS ONE, 2016, 11 (06)
[6]   A Hybrid Automatic System for the Diagnosis of Lung Cancer Based on Genetic Algorithm and Fuzzy Extreme Learning Machines [J].
Daliri, Mohammad Reza .
JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (02) :1001-1005
[7]  
El Akadi A, 2008, INT J COMPUT SCI NET, V8, P116
[8]   Normalized Mutual Information Feature Selection [J].
Estevez, Pablo. A. ;
Tesmer, Michel ;
Perez, Claudio A. ;
Zurada, Jacek A. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02) :189-201
[9]   The Effect of Artificial Neural Network Model Combined with Six Tumor Markers in Auxiliary Diagnosis of Lung Cancer [J].
Feng, Feifei ;
Wu, Yiming ;
Wu, Yongjun ;
Nie, Guangjin ;
Ni, Ran .
JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (05) :2973-2980
[10]  
FERRI FJ, 1994, MACH INTELL PATT REC, V16, P403