Robotic Instrument Segmentation With Image-to-Image Translation

被引:21
|
作者
Colleoni, Emanuele [1 ]
Stoyanov, Danail [1 ]
机构
[1] Univ Coll London UCL, Wellcome EPSRC Ctr Intervent & Surg Sci WEISS, London W1W 7TS, England
基金
欧盟地平线“2020”; 英国工程与自然科学研究理事会;
关键词
Image segmentation; Gallium nitride; Feature extraction; Instruments; Robots; Generative adversarial networks; Data models; Medical robots and systems; deep learning methods; image-to-image translation; surgical robot simulators; surgical tool segmentation;
D O I
10.1109/LRA.2021.3056354
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The semantic segmentation of robotic surgery video and the delineation of robotic instruments are important for enabling automation. Despite major recent progresses, the majority of the latest deep learning models for instrument detection and segmentation rely on large datasets with ground truth labels. While demonstrating the capability, reliance on large labelled data is a problem for practical applications because systems would need to be re-trained on domain variations such as procedure type or instrument sets. In this letter, we propose to alleviate this problem by training deep learning models on datasets that are synthesised using image-to-image translation techniques and we investigate different methods to perform this process optimally. Experimentally, we demonstrate that the same deep network architecture for robotic instrument segmentation can be trained on both real data and on our proposed synthetic data without affecting the quality of the output models' performance. We show this for several recent approaches and provide experimental support on publicly available datasets, which highlight the potential value of this approach.
引用
收藏
页码:935 / 942
页数:8
相关论文
共 50 条
  • [21] Unsupervised Image-to-Image Translation: A Review
    Hoyez, Henri
    Schockaert, Cedric
    Rambach, Jason
    Mirbach, Bruno
    Stricker, Didier
    SENSORS, 2022, 22 (21)
  • [22] Implicit pairs for boosting unpaired image-to-image translation
    Ginger, Yiftach
    Danon, Dov
    Averbuch-Elor, Hadar
    Cohen-Or, Daniel
    VISUAL INFORMATICS, 2020, 4 (04): : 50 - 58
  • [23] Multimodal Unsupervised Image-to-Image Translation
    Huang, Xun
    Liu, Ming-Yu
    Belongie, Serge
    Kautz, Jan
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 179 - 196
  • [24] CoPrGAN: Image-to-Image Translation via Content Preservation
    Yu, Xiaoming
    Zhou, Gan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 37 - 49
  • [25] Guided Image Weathering using Image-to-Image Translation
    Chen, Yu
    Shen, I-Chao
    Chen, Bing-Yu
    PROCEEDINGS OF SIGGRAPH ASIA 2021 TECHNICAL COMMUNICATIONS, 2021,
  • [26] A novel framework for image-to-image translation and image compression
    Yang, Fei
    Wang, Yaxing
    Herranz, Luis
    Cheng, Yongmei
    Mozerov, Mikhail G.
    NEUROCOMPUTING, 2022, 508 : 58 - 70
  • [27] SSIS-Seg: Simulation-Supervised Image Synthesis for Surgical Instrument Segmentation
    Colleoni, Emanuele
    Psychogyios, Dimitris
    Van Amsterdam, Beatrice
    Vasconcelos, Francisco
    Stoyanov, Danail
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (11) : 3074 - 3086
  • [28] Unsupervised Image-to-Image Translation with Self-Attention Networks
    Kang, Taewon
    Lee, Kwang Hee
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 102 - 108
  • [29] Spatial-Intensity Transforms for Medical Image-to-Image Translation
    Wang, Clinton J.
    Rost, Natalia S.
    Golland, Polina
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (11) : 3362 - 3373
  • [30] Guided Image-to-Image Translation by Discriminator-Generator Communication
    Cao, Yuanjiang
    Yao, Lina
    Pan, Le
    Sheng, Quan Z.
    Chang, Xiaojun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1528 - 1538