Harmonic-Based Optimal Motion Planning in Constrained Workspaces Using Reinforcement Learning

被引:24
作者
Rousseas, Panagiotis [1 ]
Bechlioulis, Charalampos [1 ]
Kyriakopoulos, Kostas J. [1 ]
机构
[1] Natl Tech Univ Athens, Control Syst Lab, Sch Mech Engn, Athens 15780, Greece
关键词
Motion and path planning; optimization and optimal control; reinforcement learning; TIME OBSTACLE AVOIDANCE; NAVIGATION;
D O I
10.1109/LRA.2021.3060711
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this work, we propose a novel reinforcement learning algorithm to solve the optimal motion planning problem. Particular emphasis is given on the rigorous mathematical proof of safety, convergence as well as optimality w.r.t. to an integral quadratic cost function, while reinforcement learning is adopted to enable the cost function's approximation. Both offline and online solutions are proposed, and an implementation of the offline method is compared to a state-of-the-art RRT star approach. This novel approach inherits the strong traits from both artificial potential fields, i.e., reactivity, as well as sampling-based methods, i.e., optimality, and opens up new paths to the age-old problem of motion planning, by merging modern tools and philosophies from various corners of the field.
引用
收藏
页码:2005 / 2011
页数:7
相关论文
共 25 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]  
Bhattacharya S., 2018, ANN MATH ARTIF INTEL
[3]  
Bohlin R., 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065), P521, DOI 10.1109/ROBOT.2000.844107
[4]  
CANNY J, 1988, COMPLEXITY ROBOT MOT
[5]   Locally-optimal multi-robot navigation under delaying disturbances using homotopy constraints [J].
Gregoire, Jean ;
Cap, Michal ;
Frazzoli, Emilio .
AUTONOMOUS ROBOTS, 2018, 42 (04) :895-907
[6]   Sampling-based algorithms for optimal motion planning [J].
Karaman, Sertac ;
Frazzoli, Emilio .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (07) :846-894
[7]   Probabilistic roadmaps for path planning in high-dimensional configuration spaces [J].
Kavraki, LE ;
Svestka, P ;
Latombe, JC ;
Overmars, MH .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (04) :566-580
[9]   REAL-TIME OBSTACLE AVOIDANCE USING HARMONIC POTENTIAL FUNCTIONS [J].
KIM, JO ;
KHOSLA, PK .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1992, 8 (03) :338-349
[10]   Sampling-Based Methods for Motion Planning with Constraints [J].
Kingston, Zachary ;
Moll, Mark ;
Kavraki, Lydia E. .
ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 1, 2018, 1 :159-185