Deep Reinforcement Learning for Guidewire Navigation in Coronary Artery Phantom

被引:20
作者
Kweon, Jihoon [1 ]
Kim, Kyunghwan [2 ]
Lee, Chaehyuk [2 ]
Kwon, Hwi [2 ]
Park, Jinwoo [2 ]
Song, Kyoseok [2 ]
Kim, Young In [3 ]
Park, Jeeone [3 ]
Back, Inwook [4 ]
Roh, Jae-Hyung [5 ]
Moon, Youngjin [1 ]
Choi, Jaesoon [6 ]
Kim, Young-Hak [4 ]
机构
[1] Univ Ulsan, Coll Med, Asan Med Ctr, Dept Convergence Med, Seoul 05505, South Korea
[2] Medipixel Inc, Seoul 04037, South Korea
[3] Univ Ulsan, Coll Med, Asan Med Inst Convergence Sci & Technol, Dept Med Sci,Asan Med Ctr, Seoul 05505, South Korea
[4] Univ Ulsan, Coll Med, Asan Med Ctr, Div Cardiol,Dept Internal Med, Seoul 05505, South Korea
[5] Chungnam Natl Univ, Sch Med, Sejong Hosp, Dept Cardiol Internal Med, Daejeon 30099, South Korea
[6] Univ Ulsan, Coll Med, Asan Med Ctr, Dept Biomed Engn, Seoul 05505, South Korea
关键词
Training; Robots; Navigation; Phantoms; Arteries; Lesions; Reinforcement learning; Coronary intervention; guidewire navigation; reinforcement learning; INTERVENTION; FEASIBILITY; SAFETY;
D O I
10.1109/ACCESS.2021.3135277
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In percutaneous intervention for treatment of coronary plaques, guidewire navigation is a primary procedure for stent delivery. Steering a flexible guidewire within coronary arteries requires considerable training, and the non-linearity between the control operation and the movement of the guidewire makes precise manipulation difficult. Here, we introduce a deep reinforcement learning (RL) framework for autonomous guidewire navigation in a robot-assisted coronary intervention. Using Rainbow, a segment-wise learning approach is applied to determine how best to accelerate training using human demonstrations, transfer learning, and weight initialization. 'State' for RL is customized as a focus window near the guidewire tip, and subgoals are placed to mitigate a sparse reward problem. The RL agent improves performance, eventually enabling the guidewire to reach all valid targets in 'stable' phase. For the last 300 out of 1000 episodes, the success rates of the guidewire navigation to the distal-main and side targets were 98% and 99% in 2D and 3D phantoms, respectively. Our framework opens a new direction in the automation of robot-assisted intervention, providing guidance on RL in physical spaces involving mechanical fatigue.
引用
收藏
页码:166409 / 166422
页数:14
相关论文
共 50 条
  • [31] Environment Exploration for Mapless Navigation based on Deep Reinforcement Learning
    Toan, Nguyen Duc
    Gon-Woo, Kim
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 17 - 20
  • [32] Autonomous Visual Navigation using Deep Reinforcement Learning: An Overview
    Ejaz, Muhammad Mudassir
    Tang, Tong Boon
    Lu, Cheng-Kai
    2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 294 - 299
  • [33] Robotic Navigation with Human Brain Signals and Deep Reinforcement Learning
    Lin, Chaohao
    Hasan, S. M. Shafiul
    Bai, Ou
    2021 4TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION ENGINEERING (RCAE 2021), 2021, : 278 - 283
  • [34] Deep-Sarsa: A reinforcement learning algorithm for autonomous navigation
    Andrecut, M
    Ali, MK
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2001, 12 (10): : 1513 - 1523
  • [35] Mapless Navigation for Autonomous Robots: A Deep Reinforcement Learning Approach
    Zhang, Pengpeng
    Wei, Changyun
    Cai, Boliang
    Ouyang, Yongping
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3141 - 3146
  • [36] Oracle-Guided Deep Reinforcement Learning for Large-Scale Multi-UAVs Flocking and Navigation
    Wang, Wen
    Wang, Liang
    Wu, Junfeng
    Tao, Xianping
    Wu, Haijun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (10) : 10280 - 10292
  • [37] Inertial Navigation Compensation with Reinforcement Learning
    Bozeman, Eric
    Minhdao Nguyen
    Alam, Mohammad
    Onners, Jeffrey
    2022 9TH IEEE INTERNATIONAL SYMPOSIUM ON INERTIAL SENSORS AND SYSTEMS (IEEE INERTIAL 2022), 2022,
  • [38] Reinforcement Learning in Navigation and Cooperative Mapping
    Cruz, Jose Aleixo
    Cardoso, Henrique Lopes
    Reis, Luis Paulo
    Sousa, Armando
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 200 - 205
  • [39] A topological reinforcement learning agent for navigation
    Braga, APS
    Araújo, AFR
    NEURAL COMPUTING & APPLICATIONS, 2003, 12 (3-4) : 220 - 236
  • [40] A topological reinforcement learning agent for navigation
    Arthur P. S. Braga
    Aluízio F. R. Araújo
    Neural Computing & Applications, 2003, 12 : 220 - 236