Vision transformer distillation for enhanced gastrointestinal abnormality recognition in wireless capsule endoscopy images

被引:0
作者
Oukdach, Yassine [1 ]
Garbaz, Anass [1 ]
Kerkaou, Zakaria [1 ]
El Ansari, Mohamed [2 ]
Koutti, Lahcen [1 ]
Papachrysos, Nikolaos [3 ,4 ]
El Ouafdi, Ahmed Fouad [1 ]
de Lange, Thomas [3 ,4 ]
Distante, Cosimo [5 ]
机构
[1] Ibn Zohr Univ, Fac Sci, Dept Comp Sci, LabSIV, Agadir, Morocco
[2] Moulay Ismail Univ, Fac Sci, Dept Comp Sci, Informat & Applicat Lab, Meknes, Morocco
[3] Univ Gothenburg, Sahlgrenska Acad, Dept Mol & Clin Med, Gothenburg, Sweden
[4] Sahlgrens Univ Hosp, Med Dept, Molndal, Sweden
[5] CNR, Inst Appl Sci & Intelligent Syst Eduardo Caianiell, Lecce, Italy
关键词
wireless capsule endoscopy; vision transformer; convolutional neural network; attention mechanism; knowledge distillation; gastrointestinal abnormality detection; CANCER STATISTICS; SYSTEM; COLON;
D O I
10.1117/1.JMI.12.1.014505
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Wireless capsule endoscopy (WCE) is a non-invasive technology used for diagnosing gastrointestinal abnormalities. A single examination generates similar to 55,000 images, making manual review both time-consuming and costly for doctors. Therefore, the development of computer vision-assisted systems is highly desirable to aid in the diagnostic process. Approach: We presents a deep learning approach leveraging knowledge distillation (KD) from a convolutional neural network (CNN) teacher model to a vision transformer (ViT) student model for gastrointestinal abnormality recognition. The CNN teacher model utilizes attention mechanisms and depth-wise separable convolutions to extract features from WCE images, supervising the ViT in learning these representations. Results: The proposed method achieves accuracy of 97% and 96% on the Kvasir and KID datasets, respectively, demonstrating its effectiveness in distinguishing normal from abnormal regions and bleeding from non-bleeding cases. The proposed approach offers computational efficiency and generalization to unseen datasets, outperforming several state-of-the-art methods. Conclusions: We proposed a deep learning approach utilizing CNNs and a ViT with KD to effectively classify gastrointestinal diseases in WCE images. It demonstrates promising performance on public datasets, distinguishing normal from abnormal regions and bleeding from non-bleeding cases while offering optimal computational efficiency compared with existing methods, making it suitable for GI disease applications.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Wireless Capsule Endoscopy Bleeding Images Classification Using CNN Based Model
    Rustam, Furqan
    Siddique, Muhammad Abubakar
    Siddiqui, Hafeez Ur Rehman
    Ullah, Saleem
    Mehmood, Arif
    Ashraf, Imran
    Choi, Gyu Sang
    IEEE ACCESS, 2021, 9 : 33675 - 33688
  • [42] Lymphangiectasia Detection in Wireless Capsule Endoscopy images Using Fisher Transform Method
    Alizadeh, Mahdi
    Eskandari, Hoda
    Sharzehi, Kaveh
    2015 41ST ANNUAL NORTHEAST BIOMEDICAL ENGINEERING CONFERENCE (NEBEC), 2015,
  • [43] Detection of Lymphangiectasia Disease from Wireless Capsule Endoscopy Images with Adaptive Threshold
    Cui, Lei
    Hu, Chao
    Zou, Yuexian
    Song, Shuang
    He, Qing
    Meng, Max Q. -H.
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 3088 - 3093
  • [44] Lesion Detection in Wireless Capsule Endoscopy Images Using Texture and Color Features
    Jia, Zhiwei
    Liu, Yong
    Zhang, Liming
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2018, 8 (07) : 1397 - 1401
  • [45] Computer-Aided System for Polyp Detection in Wireless Capsule Endoscopy Images
    El Ansari, Mohamed
    Charfi, Said
    2017 INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS AND MOBILE COMMUNICATIONS (WINCOM), 2017, : 407 - 412
  • [46] Bag of Visual Words Approach for Bleeding Detection in Wireless Capsule Endoscopy Images
    Joshi, Indu
    Kumar, Sunil
    Figueiredo, Isabel N.
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), 2016, 9730 : 575 - 582
  • [47] Automated bleeding detection in wireless capsule endoscopy images based on sparse coding
    Abhinav Patel
    Kumi Rani
    Sunil Kumar
    Isabel N. Figueiredo
    Pedro N. Figueiredo
    Multimedia Tools and Applications, 2021, 80 : 30353 - 30366
  • [48] CONVOLUTIONAL NEURAL NETWORKS FOR INTESTINAL HEMORRHAGE DETECTION IN WIRELESS CAPSULE ENDOSCOPY IMAGES
    Li, Panpeng
    Li, Ziyun
    Gao, Fei
    Wan, Li
    Yu, Jun
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1518 - 1523
  • [49] Effective deep learning based segmentation and classification in wireless capsule endoscopy images
    Panguluri Padmavathi
    Jonnadula Harikiran
    J. Vijaya
    Multimedia Tools and Applications, 2023, 82 : 47109 - 47133
  • [50] Stomach, intestine and colon tissue discriminators for wireless capsule endoscopy images.
    Berens, J
    Mackiewicz, M
    Bell, D
    MEDICAL IMAGING 2005: IMAGE PROCESSING, PT 1-3, 2005, 5747 : 283 - 290