WCBA: Weighted classification based on association rules algorithm for breast cancer disease

被引:67
作者
Alwidian, Jaber [1 ]
Hammo, Bassam H. [1 ]
Obeid, Nadim [1 ]
机构
[1] Univ Jordan, King Abdullah II Sch Informat Technol, Amman, Jordan
关键词
Data mining; Association classification; Association rules; Apriori; Breast cancer;
D O I
10.1016/j.asoc.2017.11.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Breast cancer is the second most frequent human neoplasm that accounts for one quarter of all cancers in females. Among the other types of cancers, it is considered to be the main cause of death in women in most countries. An efficient classifier for accurately helping physicians to predict this chronic disease is in high demand. One approach for solving this problem has been tackled by many scholars using Association Classification (AC) techniques to enhance the classification process through applying association rules. However, most AC algorithms are suffering from the estimated measures used in the rule evaluation process and the prioritization techniques used at the attributes level, which could play a critical role in the rule generation process. In this article we attempt to solve this problem through an efficient weighted classification based on association rules algorithm, named WCBA. We also present a new pruning and prediction technique based on statistical measures to generate more accurate association rules to enhance the accuracy level of the AC classifiers. As a case study, we used WCBA to classify breast cancer instances with the help of subject matter experts from King Hussein Cancer Center (KHCC) located in Amman, Jordan. We compare WCBA with five well-known AC algorithms: CBA, CMAR, MCAR, FACA and ECBA running on two breast cancer datasets from UCI machine learning data repository. Experimental results show that WCBA, in most cases, outperformed the other AC algorithms for this case study. In addition, WCBA generates more accurate rules that contain the most efficient attributes for predicting breast cancer. WCBA algorithm aims to predict breast cancer in a patient. It serves all breast cancer patients by reducing the fear of the possibility of the recurrence of the disease and takes the necessary measures to prevent the progression of the disease and to predict breast cancer in a patient. The algorithm can be generalized to work on different domains with the help of subject matter experts. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:536 / 549
页数:14
相关论文
共 39 条
[21]  
Kulkarni S., 2015, INT J COMP APPL, V122
[22]   CMAR: Accurate and efficient classification based on Multiple Class-Association Rules [J].
Li, WM ;
Han, JW ;
Pei, J .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :369-376
[23]  
LICHMAN M., 2013, UCI MACHINE LEARNING
[24]   Investigating Associative Classification for Software Fault Prediction: An Experimental Perspective [J].
Ma, Baojun ;
Zhang, Huaping ;
Chen, Guoqing ;
Zhao, Yanping ;
Baesens, Bart .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2014, 24 (01) :61-90
[25]  
Majali J., 2015, IJARCCE, V4, P613, DOI [10.17148/IJARCCE.2015.43147, DOI 10.17148/IJARCCE.2015.43147]
[26]   A knowledge-based system for breast cancer classification using fuzzy logic method [J].
Nilashi, Mehrbakhsh ;
Ibrahim, Othman ;
Ahmadi, Hossein ;
Shahmoradi, Leila .
TELEMATICS AND INFORMATICS, 2017, 34 (04) :133-144
[27]  
Pears Russel, 2011, PAC AS C KNOWL DISC, P327
[28]  
Qian TY, 2005, LECT NOTES COMPUT SC, V3589, P378, DOI 10.1007/11546849_37
[29]  
Shajahaan S.S., 2013, International Journal of Emerging Technology and Advanced Engineering, V3, P362
[30]   Mining weighted association rules without preassigned weights [J].
Sun, Ke ;
Bai, Fengshan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (04) :489-495