A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis

被引:252
作者
Chen, Hui-Ling [1 ,2 ]
Yang, Bo [1 ,2 ]
Liu, Jie [1 ,2 ]
Liu, Da-You [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
Breast cancer diagnosis; Rough set theory; Support vector machines; Feature selection; SYSTEM; RULES;
D O I
10.1016/j.eswa.2011.01.120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Breast cancer is becoming a leading cause of death among women in the whole world, meanwhile, it is confirmed that the early detection and accurate diagnosis of this disease can ensure a long survival of the patients. Expert systems and machine learning techniques are gaining popularity in this field because of the effective classification and high diagnostic capability. In this paper, a rough set (RS) based supporting vector machine classifier (RS_SVM) is proposed for breast cancer diagnosis. In the proposed method (RS_SVM), RS reduction algorithm is employed as a feature selection tool to remove the redundant features and further improve the diagnostic accuracy by SVM. The effectiveness of the RS_SVM is examined on Wisconsin Breast Cancer Dataset (WBCD) using classification accuracy, sensitivity, specificity, confusion matrix and receiver operating characteristic (ROC) curves. Experimental results demonstrate the proposed RS_SVM can not only achieve very high classification accuracy but also detect a combination of five informative features, which can give an important clue to the physicians for breast diagnosis. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:9014 / 9022
页数:9
相关论文
共 28 条
  • [1] Supervised fuzzy clustering for the identification of fuzzy classifiers
    Abonyi, J
    Szeifert, F
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (14) : 2195 - 2207
  • [2] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [3] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [4] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
  • [5] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [6] Cristianini Nello, 2000, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods, DOI DOI 10.1017/CB09780511801389
  • [7] Feature selection for support vector machines by means of genetic algorithms
    Fröhlich, H
    Chapelle, O
    Schölkopf, B
    [J]. 15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 142 - 148
  • [8] GOODMAN D, 2002, INTELLIGENT ENG SYST, V12, P179
  • [9] Rough set approach for attribute reduction and rule generation: A case of patients with suspected breast cancer
    Hassanien, AE
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (11): : 954 - 962
  • [10] Hsu C., 2003, A practical guide to support vector classification