Real-time tracking of surgical instruments based on spatio-temporal context and deep learning

被引:24
|
作者
Zhao, Zijian [1 ]
Chen, Zhaorui [1 ]
Voros, Sandrine [2 ]
Cheng, Xiaolin [3 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, 17923 Jingshi Rd, Jinan 250061, Shandong, Peoples R China
[2] Univ Grenoble Alpes, CNRS, INSERM, TIMC IMAG, Grenoble, France
[3] Shandong Univ, Qilu Hosp, Lab Laparoscop Tech & Engn, Jinan, Shandong, Peoples R China
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Tool tracking; convolutional neural network; spatial transformer network; spatio-temporal context; CONVOLUTIONAL NEURAL-NETWORKS; 3D TRACKING;
D O I
10.1080/24699322.2018.1560097
中图分类号
R61 [外科手术学];
学科分类号
摘要
Real-time tool tracking in minimally invasive-surgery (MIS) has numerous applications for computer-assisted interventions (CAIs). Visual tracking approaches are a promising solution to real-time surgical tool tracking, however, many approaches may fail to complete tracking when the tracker suffers from issues such as motion blur, adverse lighting, specular reflections, shadows, and occlusions. We propose an automatic real-time method for two-dimensional tool detection and tracking based on a spatial transformer network (STN) and spatio-temporal context (STC). Our method exploits both the ability of a convolutional neural network (CNN) with an in-house trained STN and STC to accurately locate the tool at high speed. Then we compared our method experimentally with other four general of CAIs' visual tracking methods using eight existing online and in-house datasets, covering both in vivo abdominal, cardiac and retinal clinical cases in which different surgical instruments were employed. The experiments demonstrate that our method achieved great performance with respect to the accuracy and the speed. It can track a surgical tool without labels in real time in the most challenging of cases, with an accuracy that is equal to and sometimes surpasses most state-of-the-art tracking algorithms. Further improvements to our method will focus on conditions of occlusion and multi-instruments.
引用
收藏
页码:20 / 29
页数:10
相关论文
共 50 条
  • [1] Surgical Instruments Tracking Based on Deep Learning with Lines Detection and Spatio-Temporal Context
    Chen, Zhaorui
    Zhao, Zijian
    Cheng, Xiaolin
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 2711 - 2714
  • [2] Real-time robust tracking with part-based and spatio-temporal context
    Wei, Yanxia
    Jiang, Zhen
    Xiao, Junfeng
    Xu, Xinli
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2021, 65 (02) : 97 - 109
  • [3] LEARNING SPATIO-TEMPORAL CONVOLUTIONAL NETWORK FOR REAL-TIME OBJECT TRACKING
    Chen, Hanzao
    Xing, Xiaofen
    Xu, Xiangmin
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2153 - 2157
  • [4] Real Time Eye Tracking using Kalman Extended Spatio-Temporal Context Learning
    Munir, Farzeen
    Minhas, Fayyaz ul Amir Afsar
    Jalil, Abdul
    Jeon, Moongu
    SECOND INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2017, 10443
  • [5] A Novel Real-Time Tracking Algorithm Using Spatio-Temporal Context and Color Histogram
    Wu, Yong
    Cai, Zemin
    Lai, Jianhuang
    Yan, Jingwen
    COMPUTER VISION, PT I, 2017, 771 : 128 - 144
  • [6] AO-AutoTrack: Anti-occlusion Real-Time UAV Tracking Based on Spatio-temporal Context
    Chu, Hongyu
    Liao, Kuisheng
    Shao, Yanhua
    Zhang, Xiaoqiang
    Mei, Yanying
    Wu, Yadong
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 256 - 267
  • [7] Federated Learning Based Spatio-Temporal Framework for Real-Time Traffic Prediction
    Kaur, Gaganbir
    Grewal, Surender K.
    Jain, Aarti
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (02) : 849 - 865
  • [8] An effective Object Tracking Based on Spatio-Temporal Context Learning and Hog
    Wang, Zhenhai
    Xu, Bo
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 661 - 664
  • [9] Adaptive spatio-temporal context learning for visual tracking
    Zhang, Yaqin
    Wang, Liejun
    Qin, Jiwei
    IMAGING SCIENCE JOURNAL, 2019, 67 (03): : 136 - 147
  • [10] Deep learning of spatio-temporal information for visual tracking
    Gwangmin Choe
    Ilmyong Son
    Chunhwa Choe
    Hyoson So
    Hyokchol Kim
    Gyongnam Choe
    Multimedia Tools and Applications, 2022, 81 : 17283 - 17302