Consistency Loss for Improved Colonoscopy Landmark Detection with Vision Transformers

被引:0
作者
Tamhane, Aniruddha [1 ]
Dobkin, Daniel [1 ]
Shtalrid, Ore [1 ]
Bouhnik, Moshe [1 ]
Posner, Erez [1 ]
Mida, Tse'ela [1 ]
机构
[1] Intuit Surg Inc, 1020 Kifer Rd, Sunnyvale, CA 94086 USA
来源
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II | 2024年 / 14349卷
关键词
Colonoscopy; Vision Transformer; Landmark Detection; Self-supervised learning; Consistency loss; Data sampling; COLON;
D O I
10.1007/978-3-031-45676-3_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Colonoscopy is a procedure used to examine the colon and rectum for colorectal cancer or other abnormalities including polyps or diverticula. Apart from the actual diagnosis, manually processing the snapshots taken during the colonoscopy procedure (for medical record keeping) consumes a large amount of the clinician's time. This can be automated through post-procedural machine learning based algorithms which classify anatomical landmarks in the colon. In this work, we have developed a pipeline for training vision-transformers for identifying anatomical landmarks, including appendiceal orifice, ileocecal valve/cecum landmark and rectum retroflection. To increase the accuracy of the model, we utilize a hybrid approach that combines algorithm-level and data-level techniques. We introduce a consistency loss to enhance model immunity to label inconsistencies, as well as a semantic non-landmark sampling technique aimed at increasing focus on colonic findings. For training and testing our pipeline, we have annotated 307 colonoscopy videos and 2363 snapshots with the assistance of several medical experts for enhanced reliability. The algorithm identifies landmarks with an accuracy of 92% on the test dataset.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 50 条
  • [31] DeepFake detection algorithm based on improved vision transformer
    Heo, Young-Jin
    Yeo, Woon-Ha
    Kim, Byung-Gyu
    [J]. APPLIED INTELLIGENCE, 2023, 53 (07) : 7512 - 7527
  • [32] IntelPVT: intelligent patch-based pyramid vision transformers for object detection and classification
    Nimma, Divya
    Zhou, Zhaoxian
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1767 - 1778
  • [33] Automated detection and classification of osteolytic lesions in panoramic radiographs using CNNs and vision transformers
    Niels van Nistelrooij
    Iman Ghanad
    Amir K. Bigdeli
    Daniel G. E. Thiem
    Constantin von See
    Carsten Rendenbach
    Ira Maistreli
    Tong Xi
    Stefaan Bergé
    Max Heiland
    Shankeeth Vinayahalingam
    Robert Gaudin
    [J]. BMC Oral Health, 25 (1)
  • [34] Feature fusion Vision Transformers using MLP-Mixer for enhanced deepfake detection
    Essa, Ehab
    [J]. NEUROCOMPUTING, 2024, 598
  • [35] IntelPVT: intelligent patch-based pyramid vision transformers for object detection and classification
    Divya Nimma
    Zhaoxian Zhou
    [J]. International Journal of Machine Learning and Cybernetics, 2024, 15 : 1767 - 1778
  • [36] Robust Landmark Detection and Position Measurement Based on Monocular Vision for Autonomous Aerial Refueling of UAVs
    Sun, Siyang
    Yin, Yingjie
    Wang, Xingang
    Xu, De
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (12) : 4167 - 4179
  • [37] Improved detection of adenomas and sessile serrated polyps is maintained with continuous audit of colonoscopy
    Fraser, Alan Gordon
    Rose, Toby
    Wong, Philip
    Lane, Mark
    Frankish, Paul
    [J]. BMJ OPEN GASTROENTEROLOGY, 2020, 7 (01):
  • [38] Access matters: Improved detection of premalignant polyps with a screening colonoscopy program for the uninsured
    Casadesus, Damian
    Penaloza, Orlando
    Tewary, Anubha Mishra
    Moazami, Delaram
    Simonian, Armen
    Goldsmith, Daniel
    [J]. JOURNAL OF THE NATIONAL MEDICAL ASSOCIATION, 2015, 107 (02) : 46 - 50
  • [39] COVID-19 Detection in CT/X-ray Imagery Using Vision Transformers
    Al Rahhal, Mohamad Mahmoud
    Bazi, Yakoub
    Jomaa, Rami M.
    AlShibli, Ahmad
    Alajlan, Naif
    Mekhalfi, Mohamed Lamine
    Melgani, Farid
    [J]. JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (02):
  • [40] A Comparative Evaluation between Convolutional Neural Networks and Vision Transformers for COVID-19 Detection
    Nafisah, Saad I.
    Muhammad, Ghulam
    Hossain, M. Shamim
    AlQahtani, Salman A.
    [J]. MATHEMATICS, 2023, 11 (06)