XAI-driven CatBoost multi-layer perceptron neural network for analyzing breast cancer

被引:9
作者
Srinivasu, P. Naga [1 ,2 ]
Lakshmi, G. Jaya [3 ]
Gudipalli, Abhishek [4 ]
Narahari, Sujatha Canavoy [5 ]
Shafi, Jana [6 ]
Wozniak, Marcin [7 ]
Ijaz, Muhammad Fazal [8 ]
机构
[1] Univ Fed Ceara, Dept Teleinformat Engn, BR-60455970 Fortaleza, Brazil
[2] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Amaravati 522503, Andhra Pradesh, India
[3] V R Siddhartha Engn Coll, Dept Informat Technol, Vijayawada 520007, India
[4] Vellore Inst Technol, Sch Elect Engn, Vellore 632014, India
[5] Sreenidhi Inst Sci & Technol, Dept Elect & Commun Engn, Hyderabad 501301, India
[6] Prince Sattam Bin Abdulaziz Univ, Coll Engn Wadi Alddawasir, Dept Comp Engn & Informat, Wadi Al Dawasir 11991, Saudi Arabia
[7] Silesian Tech Univ, Fac Appl Math, PL-44100 Gliwice, Poland
[8] Melbourne Inst Technol, Sch IT & Engn, Melbourne, Vic 3000, Australia
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Explainable artificial intelligence; SHAP; ANOVA; Breast cancer; CatBoost; Multi-layer perceptron; CLASSIFICATION;
D O I
10.1038/s41598-024-79620-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Early diagnosis of breast cancer is exceptionally important in signifying the treatment results, of women's health. The present study outlines a novel approach for analyzing breast cancer data by using the CatBoost classification model with a multi-layer perceptron neural network (CatBoost+MLP). Explainable artificial intelligence techniques are used to cohere with the proposed CatBoost with the MLP model. The proposed model aims to enhance the interpretability of predictions in breast cancer diagnosis by leveraging the benefits of CatBoost classification technique in feature identification and also contributing towards the interpretability of the decision model. The proposed CatBoost+MLP has been evaluated using the Shapley additive explanations values to analyze the feature significance in decision-making. Initially, the feature engineering is done using the analysis of variance technique to identify the significant features. The MLP model alone and the CatBoost+MLP model are being analyzed using divergent performance metrics, and the results obtained are compared with contemporary breast cancer identification techniques.
引用
收藏
页数:19
相关论文
共 42 条
[1]   A Software Framework for Predicting the Maize Yield Using Modified Multi-Layer Perceptron [J].
Ahmed, Shakeel .
SUSTAINABILITY, 2023, 15 (04)
[2]   Enhancing Breast Cancer Detection and Classification Using Advanced Multi-Model Features and Ensemble Machine Learning Techniques [J].
Al Reshan, Mana Saleh ;
Amin, Samina ;
Zeb, Muhammad Ali ;
Sulaiman, Adel ;
Alshahrani, Hani ;
Azar, Ahmad Taher ;
Shaikh, Asadullah .
LIFE-BASEL, 2023, 13 (10)
[3]  
[Anonymous], Breast cancer digital repository
[4]  
Bazazeh D, 2016, INT C ELECT DEVICE S
[5]   Diagnostic validation study of rapid urinary tract infection diagnosis kit at peripheral health facilities of West Bengal, India [J].
Chakraborty, Debjit ;
Debnath, Falguni ;
Majumdar, Agniva ;
Chakrabarti, Atreyi ;
Sharma, Monica ;
Walia, Kamini ;
Deb, Alok Kumar ;
Dutta, Shanta .
SCIENTIFIC REPORTS, 2024, 14 (01)
[6]   An Improved CatBoost-Based Classification Model for Ecological Suitability of Blueberries [J].
Chang, Wenfeng ;
Wang, Xiao ;
Yang, Jing ;
Qin, Tao .
SENSORS, 2023, 23 (04)
[7]  
Chen Hua, 2023, Comput Intell Neurosci, V2023, P6530719, DOI 10.1155/2023/6530719
[8]   Deep Convolutional Neural Networks for breast cancer screening [J].
Chougrad, Hiba ;
Zouaki, Hamid ;
Alheyane, Omar .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 157 :19-30
[9]  
DDSM, Digital Database for Screening Mammography
[10]   Prediction of Breast Cancer, Comparative Review of Machine Learning Techniques, and Their Analysis [J].
Fatima, Noreen ;
Liu, Li ;
Hong, Sha ;
Ahmed, Haroon .
IEEE ACCESS, 2020, 8 :150360-150376