Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases

被引:99
作者
Xie, Juanying [1 ,2 ]
Wang, Chunxia [2 ,3 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian 710062, Peoples R China
[3] Gansu Inst Mech & Elect, Tianshui 741001, Peoples R China
关键词
Support vector machines (SVM); Feature selection; Sequential forward search (SFS); Erythemato-squamous diseases; DECISION TREE CLASSIFIER; SIMILARITY CLASSIFIER; MEDICAL DATA; PREDICTION; FUZZY;
D O I
10.1016/j.eswa.2010.10.050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we developed a diagnosis model based on support vector machines (SVM) with a novel hybrid feature selection method to diagnose erythemato-squamous diseases. Our proposed hybrid feature selection method, named improved F-score and Sequential Forward Search (IFSFS), combines the advantages of filter and wrapper methods to select the optimal feature subset from the original feature set. In our IFSFS, we improved the original F-score from measuring the discrimination of two sets of real numbers to measuring the discrimination between more than two sets of real numbers. The improved F-score and Sequential Forward Search (SFS) are combined to find the optimal feature subset in the process of feature selection, where, the improved F-score is an evaluation criterion of filter method, and SFS is an evaluation system of wrapper method. The best parameters of kernel function of SVM are found out by grid search technique. Experiments have been conducted on different training-test partitions of the erythemato-squamous diseases dataset taken from UCI (University of California Irvine) machine learning database. Our experimental results show that the proposed SVM-based model with IFSFS achieves 98.61% classification accuracy and contains 21 features. With these results, we conclude our method is very promising compared to the previously reported results. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5809 / 5815
页数:7
相关论文
共 29 条
[1]   Support vector machines combined with feature selection for breast cancer diagnosis [J].
Akay, Mehmet Fatih .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3240-3247
[2]  
Basu A., 2003, SUPPORT VECTOR MACHI
[3]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[4]  
Chen YW., 2005, Combining SVMs with Various Feature Selection Strategies
[5]   The search for optimal feature set in power quality event classification [J].
Gunal, Serkan ;
Gerek, Omer Nezih ;
Ece, Dogan Gokhan ;
Edizkan, Rifat .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) :10266-10273
[6]   An expert system for the differential diagnosis of erythemato-squamous diseases [J].
Güvenir, HA ;
Emeksiz, N .
EXPERT SYSTEMS WITH APPLICATIONS, 2000, 18 (01) :43-49
[7]   Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals [J].
Guvenir, HA ;
Demiroz, G ;
Ilter, N .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 1998, 13 (03) :147-165
[8]  
Guyon I., 2003, J MACH LEARN RES, V3, P1157
[9]   Performance of feature-selection methods in the classification of high-dimension data [J].
Hua, Jianping ;
Tembe, Waibhav D. ;
Dougherty, Edward R. .
PATTERN RECOGNITION, 2009, 42 (03) :409-424
[10]  
Huang J, 1998, INT C PATT RECOG, P154, DOI 10.1109/ICPR.1998.711102