Path Planning for Multi-Arm Manipulators Using Soft Actor-Critic Algorithm with Position Prediction of Moving Obstacles via LSTM

被引:14
作者
Park, Kwan-Woo [1 ]
Kim, MyeongSeop [1 ,2 ]
Kim, Jung-Su [1 ]
Park, Jae-Han [2 ]
机构
[1] Seoul Natl Univ Sci & Technol, Res Ctr Elect & Informat Technol, Dept Elect & Informat Engn, Seoul 01811, South Korea
[2] Korea Inst Ind Technol KITECH, Appl Robot R&D Dept, Ansan 15588, South Korea
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 19期
基金
新加坡国家研究基金会;
关键词
path planning; multi-arm manipulators; moving obstacles; reinforcement learning; soft actor-critic (SAC); hindsight experience replay (HER); collision avoidance; long short-term memory (LSTM);
D O I
10.3390/app12199837
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper presents a deep reinforcement learning-based path planning algorithm for the multi-arm robot manipulator when there are both fixed and moving obstacles in the workspace. Considering the problem properties such as high dimensionality and continuous action, the proposed algorithm employs the SAC (soft actor-critic). Moreover, in order to predict explicitly the future position of the moving obstacle, LSTM (long short-term memory) is used. The SAC-based path planning algorithm is developed using the LSTM. In order to show the performance of the proposed algorithm, simulation results using GAZEBO and experimental results using real manipulators are presented. The simulation and experiment results show that the success ratio of path generation for arbitrary starting and goal points converges to 100%. It is also confirmed that the LSTM successfully predicts the future position of the obstacle.
引用
收藏
页数:20
相关论文
共 52 条
[1]   Accurate photovoltaic power forecasting models using deep LSTM-RNN [J].
Abdel-Nasser, Mohamed ;
Mahmoud, Karar .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07) :2727-2740
[2]  
Abdolmaleki A., 2018, INT C LEARN REPR
[3]  
Andrychowicz M., 2017, Proceedings of the Advances in Neural Information Processing Systems, V30, P1
[4]  
[Anonymous], 1997, J GRAPH TOOLS, DOI DOI 10.1080/10867651.1997.10487480
[5]  
Arents J., 2022, Artificial Intelligence for Digitising Industry, P205
[6]   Human-Robot Collaboration Trends and Safety Aspects: A Systematic Review [J].
Arents, Janis ;
Abolins, Valters ;
Judvaitis, Janis ;
Vismanis, Oskars ;
Oraby, Aly ;
Ozols, Kaspars .
JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2021, 10 (03)
[7]   ROBOT MOTION PLANNING - A DISTRIBUTED REPRESENTATION APPROACH [J].
BARRAQUAND, J ;
LATOMBE, JC .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1991, 10 (06) :628-649
[8]  
Barth-Maron G., 2018, P INT C LEARN REPR V
[9]   Evaluation of automatic guided vehicle systems [J].
Berman, Sigal ;
Schechtman, Edna ;
Edan, Yael .
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2009, 25 (03) :522-528
[10]   SIMULATED ANNEALING [J].
BERTSIMAS, D ;
TSITSIKLIS, J .
STATISTICAL SCIENCE, 1993, 8 (01) :10-15