Learning intraoperative organ manipulation with context-based reinforcement learning

被引:4
|
作者
D'Ettorre, Claudia [1 ]
Zirino, Silvia [1 ,2 ]
Dei, Neri Niccolo [3 ]
Stilli, Agostino [1 ]
De Momi, Elena [2 ]
Stoyanov, Danail [1 ]
机构
[1] UCL, Wellcome EPSRC Ctr Int & Surg Sci WEISS, London, England
[2] Politecn Milan, Dept Elect Informat & Bioengn NearLab, Milan, Italy
[3] Scuola Super Sant Anna, BioRobot Inst, Pisa, Italy
基金
英国工程与自然科学研究理事会;
关键词
Computer-assisted intervention; Robotic surgery; Reinforcement learning; Surgical automation;
D O I
10.1007/s11548-022-02630-2
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Automation of sub-tasks during robotic surgery is challenging due to the high variability of the surgical scenes intra- and inter-patients. For example, the pick and place task can be executed different times during the same operation and for distinct purposes. Hence, designing automation solutions that can generalise a skill over different contexts becomes hard. All the experiments are conducted using the Pneumatic Attachable Flexible (PAF) rail, a novel surgical tool designed for robotic-assisted intraoperative organ manipulation. Methods We build upon previous open-source surgical Reinforcement Learning (RL) training environment to develop a new RL framework for manipulation skills, rlman. In rlman, contextual RL agents are trained to solve different aspects of the pick and place task using the PAF rail system. rlman is implemented to support both low- and high-dimensional state information to solve surgical sub-tasks in a simulation environment. Results We use rlman to train state of the art RL agents to solve four different surgical sub-tasks involving manipulation skills using the PAF rail. We compare the results with state-of-the-art benchmarks found in the literature. We evaluate the ability of the agent to be able to generalise over different aspects of the targeted surgical environment. Conclusion We have shown that the rlman framework can support the training of different RL algorithms for solving surgical sub-task, analysing the importance of context information for generalisation capabilities. We are aiming to deploy the trained policy on the real da Vinci using the dVRK and show that the generalisation of the trained policy can be transferred to the real world.
引用
收藏
页码:1419 / 1427
页数:9
相关论文
共 50 条
  • [1] Learning intraoperative organ manipulation with context-based reinforcement learning
    Claudia D’Ettorre
    Silvia Zirino
    Neri Niccolò Dei
    Agostino Stilli
    Elena De Momi
    Danail Stoyanov
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 1419 - 1427
  • [2] Multi-Task Reinforcement Learning with Context-based Representations
    Sodhani, Shagun
    Zhang, Amy
    Pineau, Joelle
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] Semantic Locality and Context-based Prefetching Using Reinforcement Learning
    Peled, Leeor
    Mannor, Shie
    Weiser, Uri
    Etsion, Yoav
    2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, : 285 - 297
  • [4] Context-Based Meta-Reinforcement Learning With Bayesian Nonparametric Models
    Bing, Zhenshan
    Yun, Yuqi
    Huang, Kai
    Knoll, Alois
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (10) : 6948 - 6965
  • [5] Learning in context: enhancing machine learning with context-based reasoning
    Gary Stein
    Avelino J. Gonzalez
    Applied Intelligence, 2014, 41 : 709 - 724
  • [6] Learning in context: enhancing machine learning with context-based reasoning
    Stein, Gary
    Gonzalez, Avelino J.
    APPLIED INTELLIGENCE, 2014, 41 (03) : 709 - 724
  • [7] A context-based meta-reinforcement learning approach to efficient hyperparameter optimization
    Liu, Xiyuan
    Wu, Jia
    Chen, Senpeng
    NEUROCOMPUTING, 2022, 478 : 89 - 103
  • [8] Context-based statistical relational learning
    Tian, Yonghong
    AI COMMUNICATIONS, 2006, 19 (03) : 291 - 293
  • [9] A Context-Based Adaptation In Mobile Learning
    Alila, F. Soualah
    Mendes, F.
    Nicolle, C.
    BULLETIN OF THE TECHNICAL COMMITTEE ON LEARNING TECHNOLOGY, 2013, 15 (04): : 14 - 18
  • [10] Context-based learning for autonomous vehicles
    Peixoto, Maria J. P.
    Azim, Akramul
    2020 IEEE 23RD INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2020), 2020, : 150 - 151