Comparative evaluation of automated machine learning techniques for breast cancer diagnosis

被引:8
作者
Rashed, Amr E. Eldin [1 ,4 ]
Elmorsy, Ashraf M. [2 ]
Atwa, Ahmed E. Mansour [3 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Comp Engn, POB 11099, Taif 21944, Saudi Arabia
[2] Horus Univ, Fac Engn, Dept Commun & Elect, New Damietta, Egypt
[3] Mustaqbal Univ, Coll Engn & Comp Sci, Buraydah, Al Qassim, Saudi Arabia
[4] Taif Univ, Coll Comp & Informat Technol, Dept Comp Engn, POB 11099, Taif, Saudi Arabia
关键词
Breast cancer diagnosis; Machine learning; AutoML techniques; Lazy Predict; MATLAB Classification Learner; FEATURE-SELECTION; PREDICTION; CLASSIFICATION; ALGORITHMS; PATTERNS;
D O I
10.1016/j.bspc.2023.105016
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Breast cancer is the second leading cause of death among women worldwide. Early detection is crucial for a high possibility of recovery, but current diagnostic procedures rely on visual inspection by doctors, which is timeconsuming, requires consultation, and lacks reliable automatic detection systems. Therefore, an automatic diagnosis system based on machine learning (ML) models is highly required to improve the accuracy of detection and prediction. In this study, an efficient classification method based on ML techniques was proposed to help doctors diagnose and distinguish between malignant and benign tumors relevant to breast cancer prediction. Seven AutoML techniques, including Orange, Lazy Predict, TPOT, MLJAR, MATLAB classification learner, and AutoKeras, were examined and applied to eight different datasets to evaluate their performance in terms of classification accuracy. The experimental results demonstrate a significant improvement using state-of-the-art techniques, with Lazy Predict and MATLAB Classification Learner outperforming all other ML techniques for binary classification tasks. The contribution of this work is the identification of the best ML model among multiple models inside each AutoML technique applied to different breast cancer datasets. All datasets and codes used in this study are freely available from https://github.com/amrrashed/breast-cancer-datasets-and-codes.
引用
收藏
页数:18
相关论文
共 82 条
[1]  
Aalaei S, 2016, IRAN J BASIC MED SCI, V19, P476
[2]   Predicting Breast Cancer Leveraging Supervised Machine Learning Techniques [J].
Aamir, Sanam ;
Rahim, Aqsa ;
Aamir, Zain ;
Abbasi, Saadullah Farooq ;
Khan, Muhammad Shahbaz ;
Alhaisoni, Majed ;
Khan, Muhammad Attique ;
Khan, Khyber ;
Ahmad, Jawad .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
[3]   CWV-BANN-SVM ensemble learning classifier for an accurate diagnosis of breast cancer [J].
Abdar, Moloud ;
Makarenkov, Vladimir .
MEASUREMENT, 2019, 146 :557-570
[4]   On Breast Cancer Detection: An Application of Machine Learning Algorithms on the Wisconsin Diagnostic Dataset [J].
Agarap, Abien Fred M. .
2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2018), 2015, :5-9
[5]   A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer [J].
Ahmad, Fadzil ;
Isa, Nor Ashidi Mat ;
Hussain, Zakaria ;
Osman, Muhammad Khusairi ;
Sulaiman, Siti Noraini .
PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (04) :861-870
[6]   Support vector machines combined with feature selection for breast cancer diagnosis [J].
Akay, Mehmet Fatih .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3240-3247
[7]   Breast cancer diagnosis using GA feature selection and Rotation Forest [J].
Alickovic, Emina ;
Subasi, Abdulhamit .
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (04) :753-763
[8]  
Alrashidi H., 2023, BC 2 NETRF BREAST CA
[9]   A Two Stream Fusion Assisted Deep Learning Framework for Stomach Diseases Classification [J].
Amin, Muhammad Shahid ;
Shah, Jamal Hussain ;
Yasmin, Mussarat ;
Ansari, Ghulam Jillani ;
Khan, Muhamamd Attique ;
Tariq, Usman ;
Kim, Ye Jin ;
Chang, Byoungchol .
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02) :4423-4439
[10]  
analyticsindiamag, 10 POP AUTOML TOOLS