Robotic Instrument Segmentation With Image-to-Image Translation

被引：21

作者：

Colleoni, Emanuele ^{[1
]}

Stoyanov, Danail ^{[1
]}

机构：

[1] Univ Coll London UCL, Wellcome EPSRC Ctr Intervent & Surg Sci WEISS, London W1W 7TS, England

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期

基金：

欧盟地平线“2020”; 英国工程与自然科学研究理事会;

关键词：

Image segmentation; Gallium nitride; Feature extraction; Instruments; Robots; Generative adversarial networks; Data models; Medical robots and systems; deep learning methods; image-to-image translation; surgical robot simulators; surgical tool segmentation;

D O I：

10.1109/LRA.2021.3056354

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The semantic segmentation of robotic surgery video and the delineation of robotic instruments are important for enabling automation. Despite major recent progresses, the majority of the latest deep learning models for instrument detection and segmentation rely on large datasets with ground truth labels. While demonstrating the capability, reliance on large labelled data is a problem for practical applications because systems would need to be re-trained on domain variations such as procedure type or instrument sets. In this letter, we propose to alleviate this problem by training deep learning models on datasets that are synthesised using image-to-image translation techniques and we investigate different methods to perform this process optimally. Experimentally, we demonstrate that the same deep network architecture for robotic instrument segmentation can be trained on both real data and on our proposed synthetic data without affecting the quality of the output models' performance. We show this for several recent approaches and provide experimental support on publicly available datasets, which highlight the potential value of this approach.

引用

页码：935 / 942

页数：8

共 50 条

[21] Unsupervised Image-to-Image Translation: A Review
Hoyez, Henri
Schockaert, Cedric
Rambach, Jason
Mirbach, Bruno
Stricker, Didier
SENSORS, 2022, 22 (21)
[22] Implicit pairs for boosting unpaired image-to-image translation
Ginger, Yiftach
Danon, Dov
Averbuch-Elor, Hadar
Cohen-Or, Daniel
VISUAL INFORMATICS, 2020, 4 (04): : 50 - 58
[23] Multimodal Unsupervised Image-to-Image Translation
Huang, Xun
Liu, Ming-Yu
Belongie, Serge
Kautz, Jan
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196
[24] CoPrGAN: Image-to-Image Translation via Content Preservation
Yu, Xiaoming
Zhou, Gan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 37 - 49
[25] Guided Image Weathering using Image-to-Image Translation
Chen, Yu
Shen, I-Chao
Chen, Bing-Yu
PROCEEDINGS OF SIGGRAPH ASIA 2021 TECHNICAL COMMUNICATIONS, 2021,
[26] A novel framework for image-to-image translation and image compression
Yang, Fei
Wang, Yaxing
Herranz, Luis
Cheng, Yongmei
Mozerov, Mikhail G.
NEUROCOMPUTING, 2022, 508 : 58 - 70
[27] SSIS-Seg: Simulation-Supervised Image Synthesis for Surgical Instrument Segmentation
Colleoni, Emanuele
Psychogyios, Dimitris
Van Amsterdam, Beatrice
Vasconcelos, Francisco
Stoyanov, Danail
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (11) : 3074 - 3086
[28] Unsupervised Image-to-Image Translation with Self-Attention Networks
Kang, Taewon
Lee, Kwang Hee
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 102 - 108
[29] Spatial-Intensity Transforms for Medical Image-to-Image Translation
Wang, Clinton J.
Rost, Natalia S.
Golland, Polina
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (11) : 3362 - 3373
[30] Guided Image-to-Image Translation by Discriminator-Generator Communication
Cao, Yuanjiang
Yao, Lina
Pan, Le
Sheng, Quan Z.
Chang, Xiaojun
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1528 - 1538

← 1 2 3 4 5 →