Deep Learning Based Robotic Tool Detection and Articulation Estimation With Spatio-Temporal Layers

被引:69
作者
Colleoni, Emanuele [1 ]
Moccia, Sara [2 ,3 ]
Du, Xiaofei [4 ]
De Momi, Elena [1 ]
Stoyanov, Danail [4 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, I-20133 Milan, Italy
[2] Univ Politecn Marche, Dept Informat Engn, I-60121 Ancona, Italy
[3] Ist Italian Tencol, Dept Adv Robot, I-3016163 Genoa, Italy
[4] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, UCL Robot Inst, London WC1E 6BT, England
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2019年 / 4卷 / 03期
基金
英国工程与自然科学研究理事会;
关键词
Surgical-tool detection; medical robotics; computer assisted interventions; minimally invasive surgery; surgical vision;
D O I
10.1109/LRA.2019.2917163
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Surgical-tool joint detection from laparoscopic images is an important but challenging task in computer-assisted minimally invasive surgery. Illumination levels, variations in background and the different number of tools in the field of view, all pose difficulties to algorithm and model training. Yet, such challenges could be potentially tackled by exploiting the temporal information in laparoscopic videos to avoid per frame handling of the problem. In this letter, we propose a novel encoder-decoder architecture for surgical instrument joint detection and localization that uses three-dimensional convolutional layers to exploit spatio-temporal features from laparoscopic videos. When tested on benchmark and custom-built datasets, a median Dice similarity coefficient of 85.1% with an interquartile range of 4.6% highlights performance better than the state of the art based on single-frame processing. Alongside novelty of the network architecture, the idea for inclusion of temporal information appears to be particularly useful when processing images with unseen backgrounds during the training phase, which indicates that spatio-temporal features for joint detection help to generalize the solution.
引用
收藏
页码:2714 / 2721
页数:8
相关论文
共 27 条
  • [1] 3-D Pose Estimation of Articulated Instruments in Robotic Minimally Invasive Surgery
    Allan, M.
    Ourselin, S.
    Hawkes, D. J.
    Kelly, J. D.
    Stoyanov, D.
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (05) : 1204 - 1213
  • [2] Allan M., 2019, 190206426 ARXIV
  • [3] [Anonymous], J MACH LEARN RES
  • [4] [Anonymous], 2015, ARXIV PREPRINT ARXIV
  • [5] [Anonymous], 2008, MED ROBOTICS
  • [6] [Anonymous], 2017, P IEEE C COMP VIS PA
  • [7] [Anonymous], ADADELTA: An Adaptive Learning Rate Method
  • [8] [Anonymous], 2016, P ADV NEUR INF PROC
  • [9] Azimian H, 2010, P IEEE RAS-EMBS INT, P650, DOI 10.1109/BIOROB.2010.5627985
  • [10] Bodenstedt S., 2018, ARXIV180502475