Machine Learning-Based Decision Support System for Early Detection of Breast Cancer

被引:3
作者
Li, Mochen [1 ]
Nanda, Gaurav [1 ]
Chhajedss, Santosh [2 ]
Sundararajan, Raji [1 ]
机构
[1] Purdue Univ, Sch Engn Technol, Grant St, W Lafayette, IN 47907 USA
[2] METs Inst Pharm, Bhujbal Knowledge City, Nasik, Maharashtra, India
关键词
Breast cancer; Data analysis; Machine learning; Feature selection; Decision support system; PREDICTION;
D O I
10.5530/ijper.54.3s.171
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background: Breast cancer is one of the leading causes of death of women in the United States and also one of the most malignant cancer among women worldwide. Early, more accurate detection of breast cancer enables extended longevity at a reduced cost. Towards this, analyzing the available big data using tools, such as Machine learning-based decision support systems can improve the speed and accuracy of early detection of breast cancer. In this paper, we examined the prediction performance of various state-of-the-art machine learning models and a decision support system based on these models that provided the predicted category along with a prediction confidence measure. Methods: The various machine learning (ML) algorithms applied include Decision Tree, Naive Bayes, k-Nearest Neighbors (kNN) and Support Vector Machine (SVM). We also analyzed the effect of multiple feature selection approaches on the prediction performance. We used the Breast Cancer Wisconsin Dataset from Wisconsin Prognostic Breast Cancer (WPBC) with 569 digitized images of a fine needle aspirate (FNA) of breast mass and 10 realvalued feature information. The performance of the ML model was evaluated using the ten-fold cross-validation approach and also on a prediction set comprising of 20% data with the models trained on remaining 80% data. Sensitivity and Specificity were used as the primary measures of performance. Results: Among all five machine learning methods, SVM had the best performance. Except for the kNN algorithm, the performance of the other three algorithms, Logistic Regressions, Naive Bayes and Decision Trees, were also quite close to SVM. The prediction performance of the decision support system was better than any individual ML model where the prediction confidence was "High" or "Medium". Conclusion: We found that feature selection improved the performance and computation cost for all ML models. By building the ML-based decision support system with the optimal feature subset, the prediction performance for breast cancer can be improved to 96% which means it can provide powerful assistance to doctors and patinets. On the other hand, as the size of the data set increases, the processing of data with a lot of features can increase the computation cost as well as the possibility of classification errors.
引用
收藏
页码:S705 / S715
页数:11
相关论文
共 50 条
[21]   Machine learning-based models for the prediction of breast cancer recurrence risk [J].
Duo Zuo ;
Lexin Yang ;
Yu Jin ;
Huan Qi ;
Yahui Liu ;
Li Ren .
BMC Medical Informatics and Decision Making, 23
[22]   Machine learning-based models for the prediction of breast cancer recurrence risk [J].
Zuo, Duo ;
Yang, Lexin ;
Jin, Yu ;
Qi, Huan ;
Liu, Yahui ;
Ren, Li .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
[23]   Machine Learning based Intelligent System for Breast Cancer Prediction (MLISBCP) [J].
Das, Akhil Kumar ;
Biswas, Saroj Kr. ;
Mandal, Ardhendu ;
Bhattacharya, Arijit ;
Sanyal, Saptarsi .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 242
[24]   Machine learning-based clinical decision support systems for pregnancy care: A systematic review [J].
Du, Yuhan ;
McNestry, Catherine ;
Wei, Lan ;
Antoniadi, Anna Markella ;
McAuliffe, Fionnuala M. ;
Mooney, Catherine .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 173
[25]   Machine learning-based medical decision support system for diagnosing HFpEF and HFrEF using PPG [J].
Kavas, Pinar Ozen ;
Bozkurt, Mehmet Recep ;
Kocayigit, Ibrahim ;
Bilgin, Cahit .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
[26]   Application of a Machine Learning-Based Decision Support Tool to Improve an Injury Surveillance System Workflow [J].
Catchpoole, Jesani ;
Nanda, Gaurav ;
Vallmuur, Kirsten ;
Nand, Goshad ;
Lehto, Mark .
APPLIED CLINICAL INFORMATICS, 2022, 13 (03) :700-710
[27]   An intelligent decision support system for production planning based on machine learning [J].
Gonzalez Rodriguez, German ;
Gonzalez-Cava, Jose M. ;
Mendez Perez, Juan Albino .
JOURNAL OF INTELLIGENT MANUFACTURING, 2020, 31 (05) :1257-1273
[28]   Machine learning-based clinical decision support using laboratory data [J].
Cubukcu, Hikmet Can ;
Topcu, Deniz Ilhan ;
Yenice, Sedef .
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2024, 62 (05) :793-823
[29]   A machine learning-based decision support framework for energy storage selection [J].
Li, Lanyu ;
Zhou, Tianxun ;
Li, Jiali ;
Wang, Xiaonan .
CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 181 :412-422
[30]   An intelligent decision support system for production planning based on machine learning [J].
Germán González Rodríguez ;
Jose M. Gonzalez-Cava ;
Juan Albino Méndez Pérez .
Journal of Intelligent Manufacturing, 2020, 31 :1257-1273