Towards Surgical Context Inference and Translation to Gestures

被引:0
|
作者
Hutchinso, Kay [1 ]
Li, Zongyu [1 ]
Reyes, Ian [2 ,3 ]
Alemzadeh, Homa [1 ]
机构
[1] Univ Virginia, Dept Elect & Comp Engn, Charlottesville, VA 22903 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22903 USA
[3] IBM Corp, Armonk, NY USA
基金
美国国家科学基金会;
关键词
TRAJECTORY SEGMENTATION; RECOGNITION;
D O I
10.1109/ICRA48891.2023.10160383
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manual labeling of gestures in robot-assisted surgery is labor intensive, prone to errors, and requires expertise or training. We propose a method for automated and explainable generation of gesture transcripts that leverages the abundance of data for image segmentation. Surgical context is detected using segmentation masks by examining the distances and intersections between the tools and objects. Next, context labels are translated into gesture transcripts using knowledge-based Finite State Machine (FSM) and data-driven Long Short Term Memory (LSTM) models. We evaluate the performance of each stage of our method by comparing the results with the ground truth segmentation masks, the consensus context labels, and the gesture labels in the JIGSAWS dataset. Our results show that our segmentation models achieve state-of-the-art performance in recognizing needle and thread in Suturing and we can automatically detect important surgical states with high agreement with crowd-sourced labels (e.g., contact between graspers and objects in Suturing). We also find that the FSM models are more robust to poor segmentation and labeling performance than LSTMs. Our proposed method can significantly shorten the gesture labeling process (similar to 2.8 times).
引用
收藏
页码:6802 / 6809
页数:8
相关论文
共 50 条
  • [1] Towards context sensitive information inference
    Song, D
    Bruza, PD
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (04): : 321 - 334
  • [2] Evaluation of Inductive and Transductive Inference in the context of Translation Initiation Site
    Guimaraes, Wallison W.
    Pinto, Cristiano L. N.
    Nobre, Cristiane N.
    Zarate, Luis E.
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 68 - 71
  • [3] Automated Translation of Android Context-Dependent Gestures to Visual GUI Test Instructions
    Coppola, Riccardo
    Ardito, Luca
    Torchiano, Marco
    A-TEST '21: PROCEEDINGS OF THE 12TH INTERNATIONAL WORKSHOP ON AUTOMATING TEST CASE DESIGN, SELECTION, AND EVALUATION, 2021, : 17 - 24
  • [4] Context Sensitive Gestures
    Demski, Daniel
    Lee, Roger
    SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL-DISTRIBUTED COMPUTING 2010, 2010, 295 : 127 - 137
  • [5] Towards Making the Most of Context in Neural Machine Translation
    Zheng, Zaixiang
    Yue, Xiang
    Huang, Shujian
    Chen, Jiajun
    Birch, Alexandra
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3983 - 3989
  • [6] Scalable Inference and Training of Context-Rich Syntactic Translation Models
    Galley, Michel
    Graehl, Jonathan
    Knight, Kevin
    Marcu, Daniel
    DeNeefe, Steve
    Wang, Wei
    Thayer, Ignacio
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 961 - 968
  • [7] Recognition of gestures in the context of speech
    Bray, M
    Sidenbladh, H
    Eklundh, JO
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 356 - 359
  • [8] Recognition of gestures in the context of speech
    Bray, Matthieu
    Sidenbladh, Hedvig
    Eklundh, Jan-Olof
    Proceedings - International Conference on Pattern Recognition, 2002, 16 (01): : 356 - 358
  • [9] Recognition of deictic gestures with context
    Hofemann, N
    Fritsch, J
    Sagerer, G
    PATTERN RECOGNITION, 2004, 3175 : 334 - 341
  • [10] Association, Figuration of Context, Aspiration: Towards a Cultural Theory of Translation
    Langenohl, Andreas
    ZEITSCHRIFT FUR INTERKULTURELLE GERMANISTIK, 2014, 5 (02): : 17 - 27