Comparative evaluation of automated machine learning techniques for breast cancer diagnosis

被引:8
作者
Rashed, Amr E. Eldin [1 ,4 ]
Elmorsy, Ashraf M. [2 ]
Atwa, Ahmed E. Mansour [3 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Comp Engn, POB 11099, Taif 21944, Saudi Arabia
[2] Horus Univ, Fac Engn, Dept Commun & Elect, New Damietta, Egypt
[3] Mustaqbal Univ, Coll Engn & Comp Sci, Buraydah, Al Qassim, Saudi Arabia
[4] Taif Univ, Coll Comp & Informat Technol, Dept Comp Engn, POB 11099, Taif, Saudi Arabia
关键词
Breast cancer diagnosis; Machine learning; AutoML techniques; Lazy Predict; MATLAB Classification Learner; FEATURE-SELECTION; PREDICTION; CLASSIFICATION; ALGORITHMS; PATTERNS;
D O I
10.1016/j.bspc.2023.105016
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Breast cancer is the second leading cause of death among women worldwide. Early detection is crucial for a high possibility of recovery, but current diagnostic procedures rely on visual inspection by doctors, which is timeconsuming, requires consultation, and lacks reliable automatic detection systems. Therefore, an automatic diagnosis system based on machine learning (ML) models is highly required to improve the accuracy of detection and prediction. In this study, an efficient classification method based on ML techniques was proposed to help doctors diagnose and distinguish between malignant and benign tumors relevant to breast cancer prediction. Seven AutoML techniques, including Orange, Lazy Predict, TPOT, MLJAR, MATLAB classification learner, and AutoKeras, were examined and applied to eight different datasets to evaluate their performance in terms of classification accuracy. The experimental results demonstrate a significant improvement using state-of-the-art techniques, with Lazy Predict and MATLAB Classification Learner outperforming all other ML techniques for binary classification tasks. The contribution of this work is the identification of the best ML model among multiple models inside each AutoML technique applied to different breast cancer datasets. All datasets and codes used in this study are freely available from https://github.com/amrrashed/breast-cancer-datasets-and-codes.
引用
收藏
页数:18
相关论文
共 82 条
[51]  
machinelearningmastery, US AUT CLASS REGR
[52]  
Malik S., 2023, IMPROVED SKIN LESION, P1
[53]  
MathWorks and MATLAB, 2020, TRAIN MOD CLASS DAT
[54]   An Optimized Framework for Breast Cancer Classification Using Machine Learning [J].
Michael, Epimack ;
Ma, He ;
Li, Hong ;
Qi, Shouliang .
BIOMED RESEARCH INTERNATIONAL, 2022, 2022
[55]  
mljar, AUTOML COMP
[56]   A Comprehensive Survey on Deep-Learning-Based Breast Cancer Diagnosis [J].
Mridha, Muhammad Firoz ;
Hamid, Md. Abdul ;
Monowar, Muhammad Mostafa ;
Keya, Ashfia Jannat ;
Ohi, Abu Quwsar ;
Islam, Md. Rashedul ;
Kim, Jong-Myon .
CANCERS, 2021, 13 (23)
[57]   Artificial Intelligence and Machine Learning in Cancer Research: A Systematic and Thematic Analysis of the Top 100 Cited Articles Indexed in Scopus Database [J].
Musa, Ibrahim H. ;
Afolabi, Lukman O. ;
Zamit, Ibrahim ;
Musa, Taha H. ;
Musa, Hassan H. ;
Tassang, Andrew ;
Akintunde, Tosin Y. ;
Li, Wei .
CANCER CONTROL, 2022, 29
[58]  
Naji MA., 2021, Procedia Comput Sci, V191, P487, DOI DOI 10.1016/J.PROCS.2021.07.062
[59]  
ncbi, GEO DATASET BROWS
[60]  
Nguyen T., 2015, CLASSIFICATION HEALT