Exploring the Effect of Dataset Diversity in Self-supervised Learning for Surgical Computer Vision

被引:0
|
作者
Jaspers, Tim J. M. [1 ]
de Jonker, Ronald L. P. D. [2 ]
Al Khalil, Yasmina [2 ]
Zeelenberg, Tijn [1 ]
Kusters, Carolus H. J. [1 ]
Li, Yiping [2 ]
van Jaarsveld, Romy C. [3 ]
Bakker, Franciscus H. A. [4 ,5 ]
Ruurda, Jelle P. [3 ]
Brinkman, Willem M. [4 ]
De With, Peter H. N. [1 ]
van der Sommen, Fons [1 ]
机构
[1] Eindhoven Univ Technol, Dept Elect Engn Video Coding & Architectures, Eindhoven, Netherlands
[2] Eindhoven Univ Technol, Dept Biomed Engn, Med Image Anal, Eindhoven, Netherlands
[3] Univ Med Ctr Utrecht, Dept Surg, Utrecht, Netherlands
[4] Univ Med Ctr Utrecht, Dept Oncol Urol, Utrecht, Netherlands
[5] Catharina Hosp, Dept Urol, Eindhoven, Netherlands
来源
DATA ENGINEERING IN MEDICAL IMAGING, DEMI 2024 | 2025年 / 15265卷
关键词
Self-supervised learning; Surgical computer vision; Transfer learning; Data diversity; RECOGNITION;
D O I
10.1007/978-3-031-73748-0_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past decade, computer vision applications in minimally invasive surgery have rapidly increased. Despite this growth, the impact of surgical computer vision remains limited compared to other medical fields like pathology and radiology, primarily due to the scarcity of representative annotated data. Whereas transfer learning from large annotated datasets such as ImageNet has been conventionally the norm to achieve high-performing models, recent advancements in self-supervised learning (SSL) have demonstrated superior performance. In medical image analysis, in-domain SSL pretraining has already been shown to outperform ImageNet-based initialization. Although unlabeled data in the field of surgical computer vision is abundant, the diversity within this data is limited. This study investigates the role of dataset diversity in SSL for surgical computer vision, comparing procedure-specific datasets against a more heterogeneous general surgical dataset across three different downstream surgical applications. The obtained results show that using solely procedure-specific data can lead to substantial improvements of 13.8%, 9.5%, and 36.8% compared to ImageNet pretraining. However, extending this data with more heterogeneous surgical data further increases performance by an additional 5.0%, 5.2%, and 2.5%, suggesting that increasing diversity within SSL data is beneficial for model performance. The code and pretrained model weights are made publicly available at https://github.com/TimJaspers0801/SurgeNet.
引用
收藏
页码:43 / 53
页数:11
相关论文
共 50 条
  • [1] Dissecting self-supervised learning methods for surgical computer vision
    Ramesh, Sanat
    Srivastav, Vinkle
    Alapatt, Deepak
    Yu, Tong
    Murali, Aditya
    Sestini, Luca
    Nwoye, Chinedu Innocent
    Hamoud, Idris
    Sharma, Saurav
    Fleurentin, Antoine
    Exarchakis, Georgios
    Karargyris, Alexandros
    Padoy, Nicolas
    MEDICAL IMAGE ANALYSIS, 2023, 88
  • [2] On Pretraining Data Diversity for Self-Supervised Learning
    Hammoud, Hasan Abed Al Kader
    Das, Tuhin
    Pizzati, Fabio
    Torre, Philip H. S.
    Bibi, Adel
    Ghanem, Bernard
    COMPUTER VISION - ECCV 2024, PT LVI, 2025, 15114 : 54 - 71
  • [3] Domain adaptation and self-supervised learning for surgical margin detection
    Alice M. L. Santilli
    Amoon Jamzad
    Alireza Sedghi
    Martin Kaufmann
    Kathryn Logan
    Julie Wallis
    Kevin Y. M Ren
    Natasja Janssen
    Shaila Merchant
    Jay Engel
    Doug McKay
    Sonal Varma
    Ami Wang
    Gabor Fichtinger
    John F. Rudan
    Parvin Mousavi
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 861 - 869
  • [4] Domain adaptation and self-supervised learning for surgical margin detection
    Santilli, Alice M. L.
    Jamzad, Amoon
    Sedghi, Alireza
    Kaufmann, Martin
    Logan, Kathryn
    Wallis, Julie
    Ren, Kevin Y. M.
    Janssen, Natasja
    Merchant, Shaila
    Engel, Jay
    McKay, Doug
    Varma, Sonal
    Wang, Ami
    Fichtinger, Gabor
    Rudan, John F.
    Mousavi, Parvin
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (05) : 861 - 869
  • [5] Exploring Self-Supervised Vision Transformers for Gait Recognition in the Wild
    Cosma, Adrian
    Catruna, Andy
    Radoi, Emilian
    SENSORS, 2023, 23 (05)
  • [6] Liveness Detection in Computer Vision: Transformer-Based Self-Supervised Learning for Face Anti-Spoofing
    Keresh, Arman
    Shamoi, Pakizar
    IEEE ACCESS, 2024, 12 : 185673 - 185685
  • [7] Exploring Relations in Untrimmed Videos for Self-Supervised Learning
    Luo, Dezhao
    Zhou, Yu
    Fang, Bo
    Zhou, Yucan
    Wu, Dayan
    Wang, Weiping
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [8] Self-supervised representation learning for surgical activity recognition
    Paysan, Daniel
    Haug, Luis
    Bajka, Michael
    Oelhafen, Markus
    Buhmann, Joachim M.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (11) : 2037 - 2044
  • [9] Self-supervised representation learning for surgical activity recognition
    Daniel Paysan
    Luis Haug
    Michael Bajka
    Markus Oelhafen
    Joachim M. Buhmann
    International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 2037 - 2044
  • [10] Self-supervised anomaly detection in computer vision and beyond: A survey and outlook
    Hojjati, Hadi
    Ho, Thi Kieu Khanh
    Armanfard, Narges
    NEURAL NETWORKS, 2024, 172