Deep Reinforcement Learning for Guidewire Navigation in Coronary Artery Phantom

被引：20

作者：

Kweon, Jihoon ^{[1
]}

Kim, Kyunghwan ^{[2
]}

Lee, Chaehyuk ^{[2
]}

Kwon, Hwi ^{[2
]}

Park, Jinwoo ^{[2
]}

Song, Kyoseok ^{[2
]}

Kim, Young In ^{[3
]}

Park, Jeeone ^{[3
]}

Back, Inwook ^{[4
]}

Roh, Jae-Hyung ^{[5
]}

Moon, Youngjin ^{[1
]}

Choi, Jaesoon ^{[6
]}

Kim, Young-Hak ^{[4
]}

机构：

[1] Univ Ulsan, Coll Med, Asan Med Ctr, Dept Convergence Med, Seoul 05505, South Korea

[2] Medipixel Inc, Seoul 04037, South Korea

[3] Univ Ulsan, Coll Med, Asan Med Inst Convergence Sci & Technol, Dept Med Sci,Asan Med Ctr, Seoul 05505, South Korea

[4] Univ Ulsan, Coll Med, Asan Med Ctr, Div Cardiol,Dept Internal Med, Seoul 05505, South Korea

[5] Chungnam Natl Univ, Sch Med, Sejong Hosp, Dept Cardiol Internal Med, Daejeon 30099, South Korea

[6] Univ Ulsan, Coll Med, Asan Med Ctr, Dept Biomed Engn, Seoul 05505, South Korea

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Training; Robots; Navigation; Phantoms; Arteries; Lesions; Reinforcement learning; Coronary intervention; guidewire navigation; reinforcement learning; INTERVENTION; FEASIBILITY; SAFETY;

D O I：

10.1109/ACCESS.2021.3135277

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In percutaneous intervention for treatment of coronary plaques, guidewire navigation is a primary procedure for stent delivery. Steering a flexible guidewire within coronary arteries requires considerable training, and the non-linearity between the control operation and the movement of the guidewire makes precise manipulation difficult. Here, we introduce a deep reinforcement learning (RL) framework for autonomous guidewire navigation in a robot-assisted coronary intervention. Using Rainbow, a segment-wise learning approach is applied to determine how best to accelerate training using human demonstrations, transfer learning, and weight initialization. 'State' for RL is customized as a focus window near the guidewire tip, and subgoals are placed to mitigate a sparse reward problem. The RL agent improves performance, eventually enabling the guidewire to reach all valid targets in 'stable' phase. For the last 300 out of 1000 episodes, the success rates of the guidewire navigation to the distal-main and side targets were 98% and 99% in 2D and 3D phantoms, respectively. Our framework opens a new direction in the automation of robot-assisted intervention, providing guidance on RL in physical spaces involving mechanical fatigue.

引用

页码：166409 / 166422

页数：14

共 50 条

[31] Environment Exploration for Mapless Navigation based on Deep Reinforcement Learning
Toan, Nguyen Duc
Gon-Woo, Kim
2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 17 - 20
[32] Autonomous Visual Navigation using Deep Reinforcement Learning: An Overview
Ejaz, Muhammad Mudassir
Tang, Tong Boon
Lu, Cheng-Kai
2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 294 - 299
[33] Robotic Navigation with Human Brain Signals and Deep Reinforcement Learning
Lin, Chaohao
Hasan, S. M. Shafiul
Bai, Ou
2021 4TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION ENGINEERING (RCAE 2021), 2021, : 278 - 283
[34] Deep-Sarsa: A reinforcement learning algorithm for autonomous navigation
Andrecut, M
Ali, MK
INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2001, 12 (10): : 1513 - 1523
[35] Mapless Navigation for Autonomous Robots: A Deep Reinforcement Learning Approach
Zhang, Pengpeng
Wei, Changyun
Cai, Boliang
Ouyang, Yongping
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3141 - 3146
[36] Oracle-Guided Deep Reinforcement Learning for Large-Scale Multi-UAVs Flocking and Navigation
Wang, Wen
Wang, Liang
Wu, Junfeng
Tao, Xianping
Wu, Haijun
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (10) : 10280 - 10292
[37] Inertial Navigation Compensation with Reinforcement Learning
Bozeman, Eric
Minhdao Nguyen
Alam, Mohammad
Onners, Jeffrey
2022 9TH IEEE INTERNATIONAL SYMPOSIUM ON INERTIAL SENSORS AND SYSTEMS (IEEE INERTIAL 2022), 2022,
[38] Reinforcement Learning in Navigation and Cooperative Mapping
Cruz, Jose Aleixo
Cardoso, Henrique Lopes
Reis, Luis Paulo
Sousa, Armando
2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 200 - 205
[39] A topological reinforcement learning agent for navigation
Braga, APS
Araújo, AFR
NEURAL COMPUTING & APPLICATIONS, 2003, 12 (3-4) : 220 - 236
[40] A topological reinforcement learning agent for navigation
Arthur P. S. Braga
Aluízio F. R. Araújo
Neural Computing & Applications, 2003, 12 : 220 - 236

← 1 2 3 4 5 →