Breast Cancer Prediction and Detection Using Data Mining Classification Algorithms: A Comparative Study

被引:18
作者
Kaya Keles, Mumine [1 ]
机构
[1] Adana Sci & Technol Univ, Dept Comp Engn, Balcali Mahallesi,Catalan Caddesi 201-1, TR-01250 Saricam Adana, Turkey
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2019年 / 26卷 / 01期
关键词
breast cancer; classification; data mining; detection and prediction of tumor; supervised machine learning algorithms; CLASSIFIERS;
D O I
10.17559/TV-20180417102943
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Today, cancer has become a common disease that can afflict the life of one of every three people. Breast cancer is also one of the cancer types for which early diagnosis and detection is especially important. The earlier breast cancer is detected, the higher the chances of the patient being treated. Therefore, many early detection or prediction methods are being investigated and used in the fight against breast cancer. In this paper, the aim was to predict and detect breast cancer early with non-invasive and painless methods that use data mining algorithms. All the data mining classification algorithms in Weka were run and compared against a data set obtained from the measurements of an antenna consisting of frequency bandwidth, dielectric constant of the antenna's substrate, electric field and tumor information for breast cancer detection and prediction. Results indicate that Bagging, IBk, Random Committee, Random Forest, and SimpleCART algorithms were the most successful algorithms, with over 90% accuracy in detection. This comparative study of several classification algorithms for breast cancer diagnosis using a data set from the measurements of an antenna with a 10-fold cross-validation method provided a perspective into the data mining methods' ability of relative prediction. From data obtained in this study it can be said that if a patient has a breast cancer tumor, detection of the tumor is possible.
引用
收藏
页码:149 / 155
页数:7
相关论文
共 35 条
[1]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[2]  
Amutha R., 2015, PARIPEX INDIAN J RES, V4, P6
[3]  
[Anonymous], THESIS
[4]  
[Anonymous], 2012, International Journal of Computer Science, Engineering and Information Technology (IJCSEIT)
[5]  
Avsar Aydin E., 2014, J CUKUROVA U FACULTY, V29, P27
[6]  
Avsar Aydin E., 2014, URS TURK 2014 7 SCI
[7]   Breast cancer detection using K-nearest neighbors data mining method obtained from the bow-tie antenna dataset [J].
Aydin, Emine Avsar ;
Keles, Mumine Kaya .
INTERNATIONAL JOURNAL OF RF AND MICROWAVE COMPUTER-AIDED ENGINEERING, 2017, 27 (06)
[8]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32