Towards Surgical Context Inference and Translation to Gestures

被引:0
|
作者
Hutchinso, Kay [1 ]
Li, Zongyu [1 ]
Reyes, Ian [2 ,3 ]
Alemzadeh, Homa [1 ]
机构
[1] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22903 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA
[3] IBM Corp, Armonk, NY USA
基金
美国国家科学基金会;
关键词
TRAJECTORY SEGMENTATION; RECOGNITION;
D O I
10.1109/ICRA48891.2023.10160383
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manual labeling of gestures in robot-assisted surgery is labor intensive, prone to errors, and requires expertise or training. We propose a method for automated and explainable generation of gesture transcripts that leverages the abundance of data for image segmentation. Surgical context is detected using segmentation masks by examining the distances and intersections between the tools and objects. Next, context labels are translated into gesture transcripts using knowledge-based Finite State Machine (FSM) and data-driven Long Short Term Memory (LSTM) models. We evaluate the performance of each stage of our method by comparing the results with the ground truth segmentation masks, the consensus context labels, and the gesture labels in the JIGSAWS dataset. Our results show that our segmentation models achieve state-of-the-art performance in recognizing needle and thread in Suturing and we can automatically detect important surgical states with high agreement with crowd-sourced labels (e.g., contact between graspers and objects in Suturing). We also find that the FSM models are more robust to poor segmentation and labeling performance than LSTMs. Our proposed method can significantly shorten the gesture labeling process (similar to 2.8 times).
引用
收藏
页码:6802 / 6809
页数:8
相关论文
共 50 条
  • [21] Visual attention towards gestures in conversation
    Holmqvist, K
    Gullberg, M
    CURRENT OCULOMOTOR RESEARCH: PHYSIOLOGICAL AND PSYCHOLOGICAL ASPECTS, 1999, : 305 - 307
  • [22] Translation and context
    Baker, M
    JOURNAL OF PRAGMATICS, 2006, 38 (03) : 317 - 320
  • [23] Context and Translation
    Wang Jiajia
    Zhang Yonggang
    校园英语, 2017, (15) : 241 - 241
  • [24] Translation and Context
    熊茂蓉
    赵君
    环球人文地理, 2014, (12) : 282 - 282
  • [25] Towards Formality-Aware Neural Machine Translation by Leveraging Context Information
    Kim, Dohee
    Baek, Yujin
    Yang, Soyoung
    Choo, Jaegul
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 7384 - 7392
  • [26] Terminological research and machine translation : towards an optimal use of Reverso Context software
    Zemni, Bahia
    Bedjaoui, Wafa
    Elsaadany, Marwa
    TEXTO LIVRE-LINGUAGEM E TECNOLOGIA, 2021, 14 (01):
  • [27] Type Inference in Context
    Gundry, Adam
    McBride, Conor
    McKinna, James
    MSFP 2010: PROCEEDINGS OF THE 2010 ACM SIGPLAN WORKSHOP ON MATHEMATICALLY STRUCTURED FUNCTIONAL PROGRAMMING, 2010, : 43 - 54
  • [28] Towards Semantic Smart Cities: A Study on the Conceptualization and Implementation of Semantic Context Inference Systems
    Lee, Jieun
    Song, Jaeseung
    SENSORS, 2023, 23 (23)
  • [29] The Effect of the Visual Context in the Recognition of Symbolic Gestures
    Villarreal, Mirta F.
    Fridman, Esteban A.
    Leiguarda, Ramon C.
    PLOS ONE, 2012, 7 (02):
  • [30] Extracting static hand gestures in dynamic context
    Burger, Thomas
    Benoit, Alexandre
    Caplier, Andalice
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2081 - +