Machine Learning System for the Effective Diagnosis and Survival Prediction of Breast Cancer Patients

被引:4
作者
Gago, Arturo [1 ]
Aguirre, Jean Marko [1 ]
Wong, Lenis [1 ]
机构
[1] Univ Peruana Ciencias Aplicadas, Program Informat Syst Engn, Lima, Peru
关键词
breast cancer; diagnosis; treatment; machine learning (ML); random forest (RF); naive bayes (NB); CLASSIFICATION;
D O I
10.3991/ijoe.v20i02.42883
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Breast cancer is one of the most significant global health challenges. Effective diagnosis and prognosis prediction are crucial for improving patient outcomes in the case of this disease. As machine learning (ML) has significantly improved prediction models in many disciplines, the goal of this study is to develop a ML system for medical specialists that can accurately predict tumor diagnosis and patient survival for breast cancer patients. For the training of diagnosis and survival prediction, five algorithmic models-decision tree (DT), random forest (RF), naive bayes (NB), support vector machines (SVMs), and gradient boosting-were trained with 569 records from the Breast Cancer Wisconsin dataset and 1,980 records from the Breast Cancer Gene Expression Profiles dataset. The results showed that the NB model exhibited better performance for tumor diagnosis, achieving an accuracy of 95.0%, while RF presented the best results for patient survival, with an accuracy of 76.0%. A survey of medical experts' experience with the resulting system showed high scores in reliability, performance, satisfaction, usability, and efficiency, confirming that ML systems have the potential to improve breast cancer patient outcomes.
引用
收藏
页码:95 / 113
页数:19
相关论文
共 32 条
[1]   A new nested ensemble technique for automated diagnosis of breast cancer [J].
Abdar, Moloud ;
Zomorodi-Moghadam, Mariam ;
Zhou, Xujuan ;
Gururajan, Raj ;
Tao, Xiaohui ;
Barua, Prabal D. ;
Gururajan, Rashmi .
PATTERN RECOGNITION LETTERS, 2020, 132 :123-131
[2]   Predicting the recurrence of breast cancer using machine learning algorithms [J].
Alzu'bi, Amal ;
Najadat, Hassan ;
Doulat, Wesam ;
Al-Shari, Osama ;
Zhou, Leming .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) :13787-13800
[3]  
AWS, Set up a Jupyter notebook server-deep learning AMI
[4]  
Cancer.Net, Cancer de mama: Estadisticas | Cancer.Net
[5]  
cbioportal, Breast cancer dataset (METABRIC, nature 2012 & nat commun 2016) in CBioPortal increasing the resolution on breast cancer-the METABRIC study
[6]  
El Massari H, 2022, INT J ADV COMPUT SC, V13, P108
[7]   Predicting breast cancer survivability based on machine learning and features selection algorithms: a comparative study [J].
El Rahman, Sahar A. .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (08) :8585-8623
[8]   Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods [J].
Ferlay, J. ;
Colombet, M. ;
Soerjomataram, I. ;
Mathers, C. ;
Parkin, D. M. ;
Pineros, M. ;
Znaor, A. ;
Bray, F. .
INTERNATIONAL JOURNAL OF CANCER, 2019, 144 (08) :1941-1953
[9]  
Gandhi Rohith., 2018, Towards Data Science
[10]  
Geron A., 2019, Hands-on machine learning with scikit-learn and TensorFlow: concepts, tools, and techniques to build intelligent systems, V2nd ed.