Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference

被引:0
|
作者
Li, Zongyu [1 ]
Reyes, Ian [2 ,3 ]
Alemzadeh, Homa [1 ]
机构
[1] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22903 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA
[3] IBM Corp, New York, NY USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/IROS55552.2023.10342013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Surgical context inference has recently garnered significant attention in robot-assisted surgery as it can facilitate workflow analysis, skill assessment, and error detection. However, runtime context inference is challenging since it requires timely and accurate detection of the interactions among the tools and objects in the surgical scene based on the segmentation of video data. On the other hand, existing state-of-the-art video segmentation methods are often biased against infrequent classes and fail to provide temporal consistency for segmented masks. This can negatively impact the context inference and accurate detection of critical states. In this study, we propose a solution to these challenges using a Space-Time Correspondence Network (STCN). STCN is a memory network that performs binary segmentation and minimizes the effects of class imbalance. The use of a memory bank in STCN allows for the utilization of past image and segmentation information, thereby ensuring consistency of the masks. Our experiments using the publicly-available JIGSAWS dataset demonstrate that STCN achieves superior segmentation performance for objects that are difficult to segment, such as needle and thread, and improves context inference compared to the state-of-the-art. We also demonstrate that segmentation and context inference can be performed at runtime without compromising performance.
引用
收藏
页码:6601 / 6607
页数:7
相关论文
共 50 条
  • [31] Scene Graph Inference via Multi-Scale Context Modeling
    Xu, Ning
    Liu, An-An
    Wong, Yongkang
    Nie, Weizhi
    Su, Yuting
    Kankanhalli, Mohan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1031 - 1041
  • [32] ACENet: Auxiliary Context-Information Enhancement Network for RGB-D Indoor Scene Semantic Segmentation
    Zhou, Wujie
    Xu, Gao
    Qiang, Fangfang
    Yu, Lu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (02): : 1125 - 1129
  • [33] MFCANet: A road scene segmentation network based on Multi-Scale feature fusion and context information aggregation
    Wang, Yunfeng
    Zhou, Yi
    Wu, Hao
    Liu, Xiyu
    Zhai, Xiaodi
    Sun, Kuizhi
    Tian, Chengliang
    Zhao, Haixia
    Li, Tao
    Jia, Wenguang
    Zhang, Yan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [34] Boundary semantic interactive aggregation network for scene segmentation
    Zhang, Fan
    Lv, Qijun
    Pan, Binrong
    Wang, Yun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [35] Scene segmentation in video sequences by an RPCL neural network
    Chiarantoni, E
    Di Lecce, V
    Guerriero, A
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 1877 - 1882
  • [36] A New Feature Pyramid Network For Road Scene Segmentation
    Zhan, Wujing
    Chen, Jiaxing
    Fan, Lei
    Ou, Xinqi
    Chen, Long
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 3487 - 3492
  • [37] Visual scene segmentation based on neural oscillator network
    Jin, Z. (jinzhuo1978@hotmail.com), 1600, Editorial Board of Jilin University (43):
  • [38] Chaotic synchronization in general network topology for scene segmentation
    Zhao, Liang
    Cupertino, Thiago H.
    Bertini, Joao R., Jr.
    NEUROCOMPUTING, 2008, 71 (16-18) : 3360 - 3366
  • [39] A network of dynamically coupled chaotic maps for scene segmentation
    Zhao, L
    Macau, EEN
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (06): : 1375 - 1385
  • [40] A Hierarchical Feature Extraction Network for Fast Scene Segmentation
    Miao, Liu
    Zhang, Yi
    SENSORS, 2021, 21 (22)