Deep Reinforcement Learning for Guidewire Navigation in Coronary Artery Phantom

被引:20
作者
Kweon, Jihoon [1 ]
Kim, Kyunghwan [2 ]
Lee, Chaehyuk [2 ]
Kwon, Hwi [2 ]
Park, Jinwoo [2 ]
Song, Kyoseok [2 ]
Kim, Young In [3 ]
Park, Jeeone [3 ]
Back, Inwook [4 ]
Roh, Jae-Hyung [5 ]
Moon, Youngjin [1 ]
Choi, Jaesoon [6 ]
Kim, Young-Hak [4 ]
机构
[1] Univ Ulsan, Coll Med, Asan Med Ctr, Dept Convergence Med, Seoul 05505, South Korea
[2] Medipixel Inc, Seoul 04037, South Korea
[3] Univ Ulsan, Coll Med, Asan Med Inst Convergence Sci & Technol, Dept Med Sci,Asan Med Ctr, Seoul 05505, South Korea
[4] Univ Ulsan, Coll Med, Asan Med Ctr, Div Cardiol,Dept Internal Med, Seoul 05505, South Korea
[5] Chungnam Natl Univ, Sch Med, Sejong Hosp, Dept Cardiol Internal Med, Daejeon 30099, South Korea
[6] Univ Ulsan, Coll Med, Asan Med Ctr, Dept Biomed Engn, Seoul 05505, South Korea
关键词
Training; Robots; Navigation; Phantoms; Arteries; Lesions; Reinforcement learning; Coronary intervention; guidewire navigation; reinforcement learning; INTERVENTION; FEASIBILITY; SAFETY;
D O I
10.1109/ACCESS.2021.3135277
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In percutaneous intervention for treatment of coronary plaques, guidewire navigation is a primary procedure for stent delivery. Steering a flexible guidewire within coronary arteries requires considerable training, and the non-linearity between the control operation and the movement of the guidewire makes precise manipulation difficult. Here, we introduce a deep reinforcement learning (RL) framework for autonomous guidewire navigation in a robot-assisted coronary intervention. Using Rainbow, a segment-wise learning approach is applied to determine how best to accelerate training using human demonstrations, transfer learning, and weight initialization. 'State' for RL is customized as a focus window near the guidewire tip, and subgoals are placed to mitigate a sparse reward problem. The RL agent improves performance, eventually enabling the guidewire to reach all valid targets in 'stable' phase. For the last 300 out of 1000 episodes, the success rates of the guidewire navigation to the distal-main and side targets were 98% and 99% in 2D and 3D phantoms, respectively. Our framework opens a new direction in the automation of robot-assisted intervention, providing guidance on RL in physical spaces involving mechanical fatigue.
引用
收藏
页码:166409 / 166422
页数:14
相关论文
共 50 条
  • [41] Deep reinforcement learning for map-less goal-driven robot navigation
    Dobrevski, Matej
    Skocaj, Danijel
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2021, 18 (01):
  • [42] Transformable Gaussian Reward Function for Socially Aware Navigation Using Deep Reinforcement Learning
    Kim, Jinyeob
    Kang, Sumin
    Yang, Sungwoo
    Kim, Beomjoon
    Yura, Jargalbaatar
    Kim, Donghan
    SENSORS, 2024, 24 (14)
  • [43] Real-Time Object Navigation With Deep Neural Networks and Hierarchical Reinforcement Learning
    Staroverov, Aleksey
    Yudin, Dmitry A.
    Belkin, Ilya
    Adeshkin, Vasily
    Solomentsev, Yaroslav K.
    Panov, Aleksandr I.
    IEEE ACCESS, 2020, 8 : 195608 - 195621
  • [44] A topological reinforcement learning agent for navigation
    Arthur P. S. Braga
    Aluízio F. R. Araújo
    Neural Computing & Applications, 2003, 12 : 220 - 236
  • [45] Incremental Learning for Autonomous Navigation of Mobile Robots based on Deep Reinforcement Learning
    Manh Luong
    Cuong Pham
    Journal of Intelligent & Robotic Systems, 2021, 101
  • [46] Incremental Learning for Autonomous Navigation of Mobile Robots based on Deep Reinforcement Learning
    Manh Luong
    Cuong Pham
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 101 (01)
  • [47] Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems
    Liu, Boyi
    Wang, Lujia
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4555 - 4562
  • [48] Guidewire navigation in coronary artery stenoses using a novel magnetic navigation system:: First clinical experience
    Tsuchida, K
    García-García, HM
    van der Giessen, WJ
    McFadden, EP
    van der Ent, M
    Sianos, G
    Meulenbrug, H
    Ong, ATL
    Serruys, PW
    CATHETERIZATION AND CARDIOVASCULAR INTERVENTIONS, 2006, 67 (03) : 356 - 363
  • [49] Robot Mapless Navigation in VUCA Environments via Deep Reinforcement Learning
    Xue, Bingxin
    Zhou, Fengyu
    Wang, Chaoqun
    Gao, Ming
    Yin, Lei
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2025, 72 (01) : 639 - 649
  • [50] Effective Deep Reinforcement Learning Setups for Multiple Goals on Visual Navigation
    Takeshi Horita, Luiz Ricardo
    Wolf, Denis Fernando
    Grassi Junior, Valdir
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,