Wireless Capsule Endoscopy Image Classification: An Explainable AI Approach

被引：12

作者：

Varam, Dara ^{[1
]}

Mitra, Rohan ^{[1
]}

Mkadmi, Meriam ^{[1
]}

Riyas, Radi Aman ^{[1
]}

Abuhani, Diaa Addeen ^{[1
]}

Dhou, Salam ^{[1
]}

Alzaatreh, Ayman ^{[2
]}

机构：

[1] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates

[2] Amer Univ Sharjah, Dept Math & Stat, Sharjah, U Arab Emirates

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Solid modeling; Analytical models; Endoscopes; Classification algorithms; Feature extraction; Wireless communication; Gastrointestinal tract; Deep learning; Artificial intelligence; Machine learning; explainable AI; gastrointestinal diseases; machine learning; vision transformer; wireless capsule endoscopy;

D O I：

10.1109/ACCESS.2023.3319068

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Learning has contributed significantly to the advances made in the fields of Medical Imaging and Computer Aided Diagnosis (CAD). Although a variety of Deep Learning (DL) models exist for the purposes of image classification in the medical domain, more analysis needs to be conducted on their decision-making processes. For this reason, several novel Explainable AI (XAI) techniques have been proposed in recent years to better understand DL models. Currently, medical professionals rely on visual inspections to diagnose potential diseases in endoscopic imaging in the preliminary stages. However, we believe that the use of automated systems can enhance both the efficiency for such diagnoses. The aim of this study is to increase the reliability of model predictions within the field of endoscopic imaging by implementing several transfer learning models on a balanced subset of Kvasir-capsule, a Wireless Capsule Endoscopy imaging dataset. This subset includes the top 9 classes of the dataset for training and testing. The results obtained were an F1-score of 97% +/- 1% for the Vision Transformer model, although other models such as MobileNetv3Large and ResNet152v2 were also able to achieve F1-scores of over 90%. These are currently the highest-reported metrics on this data, improving upon prior studies done on the same dataset. The heatmaps of several XAI techniques, including GradCAM, GradCAM++, LayersCAM, LIME, and SHAP have been presented in image form and evaluated according to their highlighted regions of importance. This is in an effort to better understand the decisions of the top-performing DL models and look beyond their black-box nature.

引用

页码：105262 / 105280

页数：19

共 61 条

[1] A survey of visual analytics for Explainable Artificial Intelligence methods [J].

Alicioglu, Gulsum ;

Sun, Bo .

COMPUTERS & GRAPHICS-UK, 2022, 102 :502-520

[2]

Amirthalingam M., 2023, 2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS), P110, DOI 10.1109/ICICCS56967.2023.10142708

[3]

Amirthalingam M., 2023, 2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS), P851, DOI 10.1109/ICAIS56108.2023.10073766

[4]

[Anonymous], 2023, VOLUME, V11

[5] GASTRO-CADx: a three stages framework for diagnosing gastrointestinal diseases [J].

Attallah, Omneya ;

Sharkas, Maha .

PEERJ COMPUTER SCIENCE, 2021, :1-36

[6] Comparative Validation of Polyp Detection Methods in Video Colonoscopy: Results From the MICCAI 2015 Endoscopic Vision Challenge [J].

Bernal, Jorge ;

Tajkbaksh, Nima ;

Sanchez, Francisco Javier ;

Matuszewski, Bogdan J. ;

Chen, Hao ;

Yu, Lequan ;

Angermann, Quentin ;

Romain, Olivier ;

Rustad, Bjorn ;

Balasingham, Ilangko ;

Pogorelov, Konstantin ;

Choi, Sungbin ;

Debard, Quentin ;

Maier-Hein, Lena ;

Speidel, Stefanie ;

Stoyanov, Danail ;

Brandao, Patrick ;

Cordova, Henry ;

Sanchez-Montes, Cristina ;

Gurudu, Suryakanth R. ;

Fernandez-Esparrach, Gloria ;

Dray, Xavier ;

Liang, Jianming ;

Histace, Aymeric .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (06) :1231-1249

[7] HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy [J].

Borgli, Hanna ;

Thambawita, Vajira ;

Smedsrud, Pia H. ;

Hicks, Steven ;

Jha, Debesh ;

Eskeland, Sigrun L. ;

Randel, Kristin Ranheim ;

Pogorelov, Konstantin ;

Lux, Mathias ;

Nguyen, Duc Tien Dang ;

Johansen, Dag ;

Griwodz, Carsten ;

Stensland, Hakon K. ;

Garcia-Ceja, Enrique ;

Schmidt, Peter T. ;

Hammer, Hugo L. ;

Riegler, Michael A. ;

Halvorsen, Pal ;

de Lange, Thomas .

SCIENTIFIC DATA, 2020, 7 (01)

[8]

Caroppo Andrea, 2023, Procedia Computer Science, P1136, DOI 10.1016/j.procs.2023.01.394

[9] Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].

Chattopadhay, Aditya ;

Sarkar, Anirban ;

Howlader, Prantik ;

Balasubramanian, Vineeth N. .

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847

[10]

Dey R. K., 2023, P 17 INT C UB INF MA, P1

← 1 2 3 4 5 6 7 →