Cervical cancer risk prediction with robust ensemble and explainable black boxes method

被引:13
作者
Curia, Francesco [1 ]
机构
[1] Sapienza Univ Rome, Dept Stat Sci, Piazzale Aldo Moro 5, I-00185 Rome, Italy
关键词
Cervical cancer; Ensemble; Interpretable AI; Risk prediction;
D O I
10.1007/s12553-021-00554-6
中图分类号
R-058 [];
学科分类号
摘要
Clinical decision support systems (CDSS) that make use of algorithms based on intelligent systems, such as machine learning or deep learning, they suffer from the fact that often the methods used are hard to interpret and difficult to understand on how some decisions are made; the opacity of some methods, sometimes voluntary due to problems such as data privacy or the techniques used to protect intellectual property, makes these systems very complicated. Besides this series of problems, the results obtained also suffer from the poor possibility of being interpreted; in the clinical context therefore it is required that the methods used are as accurate as possible, transparent techniques and explainable results. In this work the problem of the development of cervical cancer is treated, a disease that mainly affects the female population. In order to introduce advanced machine learning techniques in a clinical decision support system that can be transparent and explainable, a robust, accurate ensemble method is presented, in terms of error and sensitivity linked to the classification of possible development of the aforementioned pathology and advanced techniques are also presented of explainability and interpretability (Explanaible Machine Learning) applied to the context of CDSS such as Lime and Shapley. The results obtained, as well as being interesting, are understandable and can be implemented in the treatment of this type of problem.
引用
收藏
页码:875 / 885
页数:11
相关论文
共 21 条
[1]  
Ali, 2020, 14 INT C RES CHALL I
[2]  
Apley D. W., 2016, ARXIV
[3]   Machine learning explainability via microaggregation and shallow decision trees [J].
Blanco-Justicia, Alberto ;
Domingo-Ferrer, Josep ;
Martinez, Sergio ;
Sanchez, David .
KNOWLEDGE-BASED SYSTEMS, 2020, 194
[4]  
Cramer H., 1946, Mathematical methods of statistics.
[5]   Randomized controlled trial of a patient decision aid for colorectal cancer screening [J].
Dolan, JG ;
Frisina, S .
MEDICAL DECISION MAKING, 2002, 22 (02) :125-139
[6]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[7]   PREDICTIVE LEARNING VIA RULE ENSEMBLES [J].
Frieman, Jerome H. ;
Popescu, Bogdan E. .
ANNALS OF APPLIED STATISTICS, 2008, 2 (03) :916-954
[8]   Cervical Cancer Identification with Synthetic Minority Oversampling Technique and PCA Analysis using Random Forest Classifier [J].
Geetha, R. ;
Sivasubramanian, S. ;
Kaliappan, M. ;
Vimal, S. ;
Annamalai, Suresh .
JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (09)
[9]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[10]  
Koh PW, 2017, PR MACH LEARN RES, V70