Vision transformer distillation for enhanced gastrointestinal abnormality recognition in wireless capsule endoscopy images

被引:0
|
作者
Oukdach, Yassine [1 ]
Garbaz, Anass [1 ]
Kerkaou, Zakaria [1 ]
El Ansari, Mohamed [2 ]
Koutti, Lahcen [1 ]
Papachrysos, Nikolaos [3 ,4 ]
El Ouafdi, Ahmed Fouad [1 ]
de Lange, Thomas [3 ,4 ]
Distante, Cosimo [5 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Dept Comp Sci, LabSIV, Agadir, Morocco
[2] Moulay Ismail Univ, Fac Sci, Dept Comp Sci, Informat & Applicat Lab, Meknes, Morocco
[3] Univ Gothenburg, Sahlgrenska Acad, Dept Mol & Clin Med, Gothenburg, Sweden
[4] Sahlgrens Univ Hosp, Med Dept, Molndal, Sweden
[5] CNR, Inst Appl Sci & Intelligent Syst Eduardo Caianiell, Lecce, Italy
关键词
wireless capsule endoscopy; vision transformer; convolutional neural network; attention mechanism; knowledge distillation; gastrointestinal abnormality detection; CANCER STATISTICS; SYSTEM; COLON;
D O I
10.1117/1.JMI.12.1.014505
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Wireless capsule endoscopy (WCE) is a non-invasive technology used for diagnosing gastrointestinal abnormalities. A single examination generates similar to 55,000 images, making manual review both time-consuming and costly for doctors. Therefore, the development of computer vision-assisted systems is highly desirable to aid in the diagnostic process. Approach: We presents a deep learning approach leveraging knowledge distillation (KD) from a convolutional neural network (CNN) teacher model to a vision transformer (ViT) student model for gastrointestinal abnormality recognition. The CNN teacher model utilizes attention mechanisms and depth-wise separable convolutions to extract features from WCE images, supervising the ViT in learning these representations. Results: The proposed method achieves accuracy of 97% and 96% on the Kvasir and KID datasets, respectively, demonstrating its effectiveness in distinguishing normal from abnormal regions and bleeding from non-bleeding cases. The proposed approach offers computational efficiency and generalization to unseen datasets, outperforming several state-of-the-art methods. Conclusions: We proposed a deep learning approach utilizing CNNs and a ViT with KD to effectively classify gastrointestinal diseases in WCE images. It demonstrates promising performance on public datasets, distinguishing normal from abnormal regions and bleeding from non-bleeding cases while offering optimal computational efficiency compared with existing methods, making it suitable for GI disease applications.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Dilated CNN for abnormality detection in wireless capsule endoscopy images
    Goel, Nidhi
    Kaur, Samarjeet
    Gunjan, Deepak
    Mahapatra, S. J.
    SOFT COMPUTING, 2022, 26 (03) : 1231 - 1247
  • [2] Dilated CNN for abnormality detection in wireless capsule endoscopy images
    Nidhi Goel
    Samarjeet Kaur
    Deepak Gunjan
    S. J. Mahapatra
    Soft Computing, 2022, 26 : 1231 - 1247
  • [3] Investigating the significance of color space for abnormality detection in wireless capsule endoscopy images
    Goel, Nidhi
    Kaur, Samarjeet
    Gunjan, Deepak
    Mahapatra, S. J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 75
  • [4] Detection of abnormality in wireless capsule endoscopy images using fractal features
    Jain, Samir
    Seal, Ayan
    Ojha, Aparajita
    Krejcar, Ondrej
    Bures, Jan
    Tacheci, Ilja
    Yazidi, Anis
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 127
  • [5] Automatic Polyp recognition of small bowel in Wireless Capsule Endoscopy images
    Xu, Lanmeng
    Fan, Shanhui
    Fan, Yihong
    Li, Lihua
    MEDICAL IMAGING 2018: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2018, 10579
  • [6] Gastrointestinal Tract Disease Classification Using Residual-Inception Transformer With Wireless Capsule Endoscopy Images Segmentation
    Ozbay, Erdal
    IEEE Access, 2024, 12 : 197988 - 197998
  • [7] ViTCA-Net: a framework for disease detection in video capsule endoscopy images using a vision transformer and convolutional neural network with a specific attention mechanism
    Oukdach, Yassine
    Kerkaou, Zakaria
    El Ansari, Mohamed
    Koutti, Lahcen
    El Ouafdi, Ahmed Fouad
    De Lange, Thomas
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63635 - 63654
  • [8] Enhanced CNN-Based Gaze Estimation on Wireless Capsule Endoscopy Images
    Gatoula, Panagiota
    Dimas, George
    Iakovidis, Dimitris K.
    Koulaouzidis, Anastasios
    2021 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2021, : 189 - 195
  • [9] Bleeding Classification of Enhanced Wireless Capsule Endoscopy Images using Deep Convolutional Neural Network
    Shahril, Rosdiana
    Saito, Atsushi
    Shimizu, Akinobu
    Baharun, Sabariah
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (01) : 91 - 108
  • [10] Hookworm Detection in Wireless Capsule Endoscopy Images With Deep Learning
    He, Jun-Yan
    Wu, Xiao
    Jiang, Yu-Gang
    Peng, Qiang
    Jain, Ramesh
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) : 2379 - 2392