A novel intelligent classification model for breast cancer diagnosis

被引:88
作者
Liu, Na [1 ,2 ]
Qi, Er-Shi [1 ]
Xu, Man [3 ]
Gao, Bo [4 ]
Liu, Gui-Qiu [5 ]
机构
[1] Tianjin Univ, Coll Management & Econ, Tianjin 300072, Peoples R China
[2] Shihezi Univ, Sch Mech & Elect Engn, Shihezi 832000, Peoples R China
[3] Nankai Univ, Sch Business, Tianjin 300071, Peoples R China
[4] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[5] Tianjin Med Univ, Key Lab Artificial Cell, Dept Pathol, Cent Hosp 3, Tianjin 300170, Peoples R China
基金
中国国家自然科学基金;
关键词
Support vector machine; Breast cancer diagnosis; Information gain; Cost-sensitive learning; Genetic algorithm; SUPPORT VECTOR MACHINE; FEATURE-SELECTION; GENETIC ALGORITHM; INFORMATION GAIN; ENSEMBLE; HYBRID; OPTIMIZATION;
D O I
10.1016/j.ipm.2018.10.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Breast cancer is one of the leading causes of death among women worldwide. Accurate and early detection of breast cancer can ensure long-term surviving for the patients. However, traditional classification algorithms usually aim only to maximize the classification accuracy, failing to take into consideration the misclassification costs between different categories. Furthermore, the costs associated with missing a cancer case (false negative) are dearly much higher than those of mislabeling a benign one (false positive). To overcome this drawback and further improving the classification accuracy of the breast cancer diagnosis, in this work, a novel breast cancer intelligent diagnosis approach has been proposed, which employed information gain directed simulated annealing genetic algorithm wrapper (IGSAGAW) for feature selection, in this process, we performs the ranking of features according to IG algorithm, and extracting the top m optimal feature utilized the cost sensitive support vector machine (CSSVM) learning algorithm. Our proposed feature selection approach which can not only help to reduce the complexity of SAGASW algorithm and effectively extracting the optimal feature subset to a certain extent, but it can also obtain the maximum classification accuracy and minimum misclassification cost. The efficacy of our proposed approach is tested on Wisconsin Original Breast Cancer (WBC) and Wisconsin Diagnostic Breast Cancer (WDBC) breast cancer data sets, and the results demonstrate that our proposed hybrid algorithm outperforms other comparison methods. The main objective of this study was to apply our research in real clinical diagnostic system and thereby assist clinical physicians in making correct and effective decisions in the future, Moreover our proposed method could also be applied to other illness diagnosis.
引用
收藏
页码:609 / 623
页数:15
相关论文
共 43 条
[1]   Global optimization of case-based reasoning for breast cytology diagnosis [J].
Ahn, Hyunchul ;
Kim, Kyoung-jae .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (01) :724-734
[2]   Support vector machines combined with feature selection for breast cancer diagnosis [J].
Akay, Mehmet Fatih .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3240-3247
[3]   Applying support vector machines to imbalanced datasets [J].
Akbani, R ;
Kwek, S ;
Japkowicz, N .
MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 :39-50
[4]  
American Cancer Society, 2016, CANC FACTS FIG
[5]   Breast cancer diagnosis using Genetically Optimized Neural Network model [J].
Bhardwaj, Arpit ;
Tiwari, Aruna .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (10) :4611-4620
[6]  
Chang C C C, 2011, LIBSVM: a library for support vector machines
[7]   A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis [J].
Chen, Hui-Ling ;
Yang, Bo ;
Liu, Jie ;
Liu, Da-You .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) :9014-9022
[8]  
Cristianini N., 2000, INTRO SUPPORT VECTOR, DOI DOI 10.1017/CBO9780511801389
[9]   Energy-efficient scheduling for a flexible flow shop using an improved genetic-simulated annealing algorithm [J].
Dai, Min ;
Tang, Dunbing ;
Giret, Adriana ;
Salido, Miguel A. ;
Li, W. D. .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2013, 29 (05) :418-429
[10]   A novel hybrid genetic algorithm with granular information for feature selection and optimization [J].
Dong, Hongbin ;
Li, Tao ;
Ding, Rui ;
Sun, Jing .
APPLIED SOFT COMPUTING, 2018, 65 :33-46