Exploring the Effect of Dataset Diversity in Self-supervised Learning for Surgical Computer Vision

被引：0

作者：

Jaspers, Tim J. M. ^{[1
]}

de Jonker, Ronald L. P. D. ^{[2
]}

Al Khalil, Yasmina ^{[2
]}

Zeelenberg, Tijn ^{[1
]}

Kusters, Carolus H. J. ^{[1
]}

Li, Yiping ^{[2
]}

van Jaarsveld, Romy C. ^{[3
]}

Bakker, Franciscus H. A. ^{[4
,5
]}

Ruurda, Jelle P. ^{[3
]}

Brinkman, Willem M. ^{[4
]}

De With, Peter H. N. ^{[1
]}

van der Sommen, Fons ^{[1
]}

机构：

[1] Eindhoven Univ Technol, Dept Elect Engn Video Coding & Architectures, Eindhoven, Netherlands

[2] Eindhoven Univ Technol, Dept Biomed Engn, Med Image Anal, Eindhoven, Netherlands

[3] Univ Med Ctr Utrecht, Dept Surg, Utrecht, Netherlands

[4] Univ Med Ctr Utrecht, Dept Oncol Urol, Utrecht, Netherlands

[5] Catharina Hosp, Dept Urol, Eindhoven, Netherlands

来源：

DATA ENGINEERING IN MEDICAL IMAGING, DEMI 2024 | 2025年 / 15265卷

关键词：

Self-supervised learning; Surgical computer vision; Transfer learning; Data diversity; RECOGNITION;

D O I：

10.1007/978-3-031-73748-0_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Over the past decade, computer vision applications in minimally invasive surgery have rapidly increased. Despite this growth, the impact of surgical computer vision remains limited compared to other medical fields like pathology and radiology, primarily due to the scarcity of representative annotated data. Whereas transfer learning from large annotated datasets such as ImageNet has been conventionally the norm to achieve high-performing models, recent advancements in self-supervised learning (SSL) have demonstrated superior performance. In medical image analysis, in-domain SSL pretraining has already been shown to outperform ImageNet-based initialization. Although unlabeled data in the field of surgical computer vision is abundant, the diversity within this data is limited. This study investigates the role of dataset diversity in SSL for surgical computer vision, comparing procedure-specific datasets against a more heterogeneous general surgical dataset across three different downstream surgical applications. The obtained results show that using solely procedure-specific data can lead to substantial improvements of 13.8%, 9.5%, and 36.8% compared to ImageNet pretraining. However, extending this data with more heterogeneous surgical data further increases performance by an additional 5.0%, 5.2%, and 2.5%, suggesting that increasing diversity within SSL data is beneficial for model performance. The code and pretrained model weights are made publicly available at https://github.com/TimJaspers0801/SurgeNet.

引用

页码：43 / 53

页数：11

共 50 条

[1] Dissecting self-supervised learning methods for surgical computer vision
Ramesh, Sanat
Srivastav, Vinkle
Alapatt, Deepak
Yu, Tong
Murali, Aditya
Sestini, Luca
Nwoye, Chinedu Innocent
Hamoud, Idris
Sharma, Saurav
Fleurentin, Antoine
Exarchakis, Georgios
Karargyris, Alexandros
Padoy, Nicolas
MEDICAL IMAGE ANALYSIS, 2023, 88
[2] On Pretraining Data Diversity for Self-Supervised Learning
Hammoud, Hasan Abed Al Kader
Das, Tuhin
Pizzati, Fabio
Torre, Philip H. S.
Bibi, Adel
Ghanem, Bernard
COMPUTER VISION - ECCV 2024, PT LVI, 2025, 15114 : 54 - 71
[3] Domain adaptation and self-supervised learning for surgical margin detection
Alice M. L. Santilli
Amoon Jamzad
Alireza Sedghi
Martin Kaufmann
Kathryn Logan
Julie Wallis
Kevin Y. M Ren
Natasja Janssen
Shaila Merchant
Jay Engel
Doug McKay
Sonal Varma
Ami Wang
Gabor Fichtinger
John F. Rudan
Parvin Mousavi
International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 861 - 869
[4] Domain adaptation and self-supervised learning for surgical margin detection
Santilli, Alice M. L.
Jamzad, Amoon
Sedghi, Alireza
Kaufmann, Martin
Logan, Kathryn
Wallis, Julie
Ren, Kevin Y. M.
Janssen, Natasja
Merchant, Shaila
Engel, Jay
McKay, Doug
Varma, Sonal
Wang, Ami
Fichtinger, Gabor
Rudan, John F.
Mousavi, Parvin
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (05) : 861 - 869
[5] Exploring Self-Supervised Vision Transformers for Gait Recognition in the Wild
Cosma, Adrian
Catruna, Andy
Radoi, Emilian
SENSORS, 2023, 23 (05)
[6] Liveness Detection in Computer Vision: Transformer-Based Self-Supervised Learning for Face Anti-Spoofing
Keresh, Arman
Shamoi, Pakizar
IEEE ACCESS, 2024, 12 : 185673 - 185685
[7] Exploring Relations in Untrimmed Videos for Self-Supervised Learning
Luo, Dezhao
Zhou, Yu
Fang, Bo
Zhou, Yucan
Wu, Dayan
Wang, Weiping
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
[8] Self-supervised representation learning for surgical activity recognition
Paysan, Daniel
Haug, Luis
Bajka, Michael
Oelhafen, Markus
Buhmann, Joachim M.
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2021, 16 (11) : 2037 - 2044
[9] Self-supervised representation learning for surgical activity recognition
Daniel Paysan
Luis Haug
Michael Bajka
Markus Oelhafen
Joachim M. Buhmann
International Journal of Computer Assisted Radiology and Surgery, 2021, 16 : 2037 - 2044
[10] Self-supervised anomaly detection in computer vision and beyond: A survey and outlook
Hojjati, Hadi
Ho, Thi Kieu Khanh
Armanfard, Narges
NEURAL NETWORKS, 2024, 172

← 1 2 3 4 5 →