Vision transformer distillation for enhanced gastrointestinal abnormality recognition in wireless capsule endoscopy images

被引:0
作者
Oukdach, Yassine [1 ]
Garbaz, Anass [1 ]
Kerkaou, Zakaria [1 ]
El Ansari, Mohamed [2 ]
Koutti, Lahcen [1 ]
Papachrysos, Nikolaos [3 ,4 ]
El Ouafdi, Ahmed Fouad [1 ]
de Lange, Thomas [3 ,4 ]
Distante, Cosimo [5 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Dept Comp Sci, LabSIV, Agadir, Morocco
[2] Moulay Ismail Univ, Fac Sci, Dept Comp Sci, Informat & Applicat Lab, Meknes, Morocco
[3] Univ Gothenburg, Sahlgrenska Acad, Dept Mol & Clin Med, Gothenburg, Sweden
[4] Sahlgrens Univ Hosp, Med Dept, Molndal, Sweden
[5] CNR, Inst Appl Sci & Intelligent Syst Eduardo Caianiell, Lecce, Italy
关键词
wireless capsule endoscopy; vision transformer; convolutional neural network; attention mechanism; knowledge distillation; gastrointestinal abnormality detection; CANCER STATISTICS; SYSTEM; COLON;
D O I
10.1117/1.JMI.12.1.014505
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Wireless capsule endoscopy (WCE) is a non-invasive technology used for diagnosing gastrointestinal abnormalities. A single examination generates similar to 55,000 images, making manual review both time-consuming and costly for doctors. Therefore, the development of computer vision-assisted systems is highly desirable to aid in the diagnostic process. Approach: We presents a deep learning approach leveraging knowledge distillation (KD) from a convolutional neural network (CNN) teacher model to a vision transformer (ViT) student model for gastrointestinal abnormality recognition. The CNN teacher model utilizes attention mechanisms and depth-wise separable convolutions to extract features from WCE images, supervising the ViT in learning these representations. Results: The proposed method achieves accuracy of 97% and 96% on the Kvasir and KID datasets, respectively, demonstrating its effectiveness in distinguishing normal from abnormal regions and bleeding from non-bleeding cases. The proposed approach offers computational efficiency and generalization to unseen datasets, outperforming several state-of-the-art methods. Conclusions: We proposed a deep learning approach utilizing CNNs and a ViT with KD to effectively classify gastrointestinal diseases in WCE images. It demonstrates promising performance on public datasets, distinguishing normal from abnormal regions and bleeding from non-bleeding cases while offering optimal computational efficiency compared with existing methods, making it suitable for GI disease applications.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Computer-assisted bleeding detection in wireless capsule endoscopy images
    Figueiredo, Isabel N.
    Kumar, Sunil
    Leal, Carlos
    Figueiredo, Pedro N.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2013, 1 (04) : 198 - 210
  • [32] Abnormalities detection in wireless capsule endoscopy images using EM algorithm
    Amiri, Zahra
    Hassanpour, Hamid
    Beghdadi, Azeddine
    VISUAL COMPUTER, 2023, 39 (07) : 2999 - 3010
  • [33] Se-Resnet: A Novel Method for Gastrointestinal (GI) Diseases Classification from Wireless Capsule Endoscopy (WCE) Images
    Padmavathi, Panguluri
    Harikiran, Jonnadula
    TRAITEMENT DU SIGNAL, 2023, 40 (04) : 1341 - 1353
  • [34] Principal Curvature Based Polyp Detection in Wireless Capsule Endoscopy Images
    Vani, V.
    Prashanth, K. V. Mahendra
    2017 INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN ELECTRONICS AND COMMUNICATION TECHNOLOGY (ICRAECT), 2017, : 5 - 10
  • [35] Computer vision-based solutions to overcome the limitations of wireless capsule endoscopy
    Horovistiz A.
    Oliveira M.
    Araújo H.
    Journal of Medical Engineering and Technology, 2023, 47 (04) : 242 - 261
  • [36] Abnormalities detection from wireless capsule endoscopy images based on embedding learning with triplet loss
    Charfi, Said
    El Ansari, Mohamed
    Koutti, Lahcen
    Ellahyani, Ayoub
    Eljaafari, Ilyas
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73079 - 73100
  • [37] Computer-aided detection of small intestinal ulcer and erosion in wireless capsule endoscopy images
    Fan, Shanhui
    Xu, Lanmeng
    Fan, Yihong
    Wei, Kaihua
    Li, Lihua
    PHYSICS IN MEDICINE AND BIOLOGY, 2018, 63 (16)
  • [38] An enhanced speech emotion recognition using vision transformer
    Akinpelu, Samson
    Viriri, Serestina
    Adegun, Adekanmi
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] RAt-CapsNet: A Deep Learning Network Utilizing Attention and Regional Information for Abnormality Detection in Wireless Capsule Endoscopy
    Alam, Md Jahin
    Bin Rashid, Rifat
    Fattah, Shaikh Anowarul
    Saquib, Mohammad
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2022, 10
  • [40] Deep CNN and geometric features-based gastrointestinal tract diseases detection and classification from wireless capsule endoscopy images
    Sharif, Muhammad
    Khan, Muhammad Attique
    Rashid, Muhammad
    Yasmin, Mussarat
    Afza, Farhat
    Tanik, Urcun John
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2021, 33 (04) : 577 - 599