Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference

被引:0
|
作者
Li, Zongyu [1 ]
Reyes, Ian [2 ,3 ]
Alemzadeh, Homa [1 ]
机构
[1] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22903 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA
[3] IBM Corp, New York, NY USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/IROS55552.2023.10342013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Surgical context inference has recently garnered significant attention in robot-assisted surgery as it can facilitate workflow analysis, skill assessment, and error detection. However, runtime context inference is challenging since it requires timely and accurate detection of the interactions among the tools and objects in the surgical scene based on the segmentation of video data. On the other hand, existing state-of-the-art video segmentation methods are often biased against infrequent classes and fail to provide temporal consistency for segmented masks. This can negatively impact the context inference and accurate detection of critical states. In this study, we propose a solution to these challenges using a Space-Time Correspondence Network (STCN). STCN is a memory network that performs binary segmentation and minimizes the effects of class imbalance. The use of a memory bank in STCN allows for the utilization of past image and segmentation information, thereby ensuring consistency of the masks. Our experiments using the publicly-available JIGSAWS dataset demonstrate that STCN achieves superior segmentation performance for objects that are difficult to segment, such as needle and thread, and improves context inference compared to the state-of-the-art. We also demonstrate that segmentation and context inference can be performed at runtime without compromising performance.
引用
收藏
页码:6601 / 6607
页数:7
相关论文
共 50 条
  • [1] StreoScenNet: Surgical Stereo Robotic Scene segmentation
    Mohammed, Ahmed
    Yildirim, Sule
    Farup, Ivar
    Pedersen, Marius
    Hovde, Oistein
    MEDICAL IMAGING 2019: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2019, 10951
  • [2] LSKANet: Long Strip Kernel Attention Network for Robotic Surgical Scene Segmentation
    Liu, Min
    Han, Yubin
    Wang, Jiazheng
    Wang, Can
    Wang, Yaonan
    Meijering, Erik
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (04) : 1308 - 1322
  • [3] Branch Aggregation Attention Network for Robotic Surgical Instrument Segmentation
    Shen, Wenting
    Wang, Yaonan
    Liu, Min
    Wang, Jiazheng
    Ding, Renjie
    Zhang, Zhe
    Meijering, Erik
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (11) : 3408 - 3419
  • [4] INFERENCE WITH MODEL UNCERTAINTY ON INDOOR SCENE FOR SEMANTIC SEGMENTATION
    Isobe, Shuya
    Arai, Shuichi
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1170 - 1174
  • [5] Scene segmentation of the chaotic oscillator network
    Zhao, L
    Macau, EEN
    Omar, N
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2000, 10 (07): : 1697 - 1708
  • [6] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [7] CFANet: Context fusing attentional network for preoperative CT image segmentation in robotic surgery
    Lin, Yao
    Wang, Jiazheng
    Liu, Qinghao
    Zhang, Kang
    Liu, Min
    Wang, Yaonan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 171
  • [8] Two forms of scene memory guide visual search: Memory for scene context and memory for the binding of target object to scene location
    Hollingworth, Andrew
    VISUAL COGNITION, 2009, 17 (1-2) : 273 - 291
  • [9] Adaptive Context Network for Scene Parsing
    Fu, Jun
    Liu, Jing
    Wang, Yuhang
    Li, Yong
    Bao, Yongjun
    Tang, Jinhui
    Lu, Hanqing
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6747 - 6756
  • [10] Efficient runtime adaptability to support context-awareness in a robotic framework
    Garcia, Miguel
    Ortin, Francisco
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2014, 6 (01) : 107 - 109