Reactive optimal motion planning for a class of holonomic planar agents using reinforcement learning with provable guarantees

被引:1
|
作者
Rousseas, Panagiotis [1 ]
Bechlioulis, Charalampos [2 ]
Kyriakopoulos, Kostas [3 ]
机构
[1] Natl Tech Univ Athens, Sch Mech Engn, Control Syst Lab, Athens, Greece
[2] Univ Patras, Dept Elect & Comp Engn, Div Syst & Control, Patras, Greece
[3] New York Univ, Ctr AI & Robot CAIR, Abu Dhabi, U Arab Emirates
来源
关键词
optimal motion planning; optimal control; reinforcement learning; nonlinear systems and control; path planning; ROBOT NAVIGATION; NONLINEAR-SYSTEMS; NETWORK;
D O I
10.3389/frobt.2023.1255696
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In control theory, reactive methods have been widely celebrated owing to their success in providing robust, provably convergent solutions to control problems. Even though such methods have long been formulated for motion planning, optimality has largely been left untreated through reactive means, with the community focusing on discrete/graph-based solutions. Although the latter exhibit certain advantages (completeness, complicated state-spaces), the recent rise in Reinforcement Learning (RL), provides novel ways to address the limitations of reactive methods. The goal of this paper is to treat the reactive optimal motion planning problem through an RL framework. A policy iteration RL scheme is formulated in a consistent manner with the control-theoretic results, thus utilizing the advantages of each approach in a complementary way; RL is employed to construct the optimal input without necessitating the solution of a hard, non-linear partial differential equation. Conversely, safety, convergence and policy improvement are guaranteed through control theoretic arguments. The proposed method is validated in simulated synthetic workspaces, and compared against reactive methods as well as a PRM and an RRT star approach. The proposed method outperforms or closely matches the latter methods, indicating the near global optimality of the former, while providing a solution for planning from anywhere within the workspace to the goal position.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Reactive motion planning of manipulator by distributed learning agents using reinforcement learning
    Naruse, Keitarou
    Kakazu, Yukinori
    Nippon Kikai Gakkai Ronbunshu, C Hen/Transactions of the Japan Society of Mechanical Engineers, Part C, 1995, 61 (581): : 131 - 137
  • [2] Optimal Robot Motion Planning in Constrained Workspaces Using Reinforcement Learning
    Rousseas, Panagiotis
    Bechlioulis, Charalampos P.
    Kyriakopoulos, Kostas J.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6917 - 6922
  • [3] Optimal Motion Planning in Unknown Workspaces Using Integral Reinforcement Learning
    Rousseas, Panagiotis
    Bechlioulis, Charalampos P.
    Kyriakopoulos, Kostas J.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 6926 - 6933
  • [4] Motion Planning of a Non-holonomic Vehicle in a Real Environment by Reinforcement Learning
    Gomez, M.
    Gayarre, L.
    Martinez-Marin, T.
    Sanchez, S.
    Meziat, D.
    BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 : 813 - +
  • [5] Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees
    Panaganti, Kishan
    Kalathil, Dileep
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [6] Planning in Hierarchical Reinforcement Learning: Guarantees for Using Local Policies
    Zahavy, Tom
    Hasidim, Avinatan
    Kaplan, Haim
    Mansour, Yishay
    ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 906 - 934
  • [7] Optimal motion planning by reinforcement learning in autonomous mobile vehicles
    Gomez, M.
    Gonzalez, R. V.
    Martinez-Marin, T.
    Meziat, D.
    Sanchez, S.
    ROBOTICA, 2012, 30 : 159 - 170
  • [8] Harmonic-Based Optimal Motion Planning in Constrained Workspaces Using Reinforcement Learning
    Rousseas, Panagiotis
    Bechlioulis, Charalampos
    Kyriakopoulos, Kostas J.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 2005 - 2011
  • [9] Motion planning algorithm for non-holonomic autonomous underwater vehicle in disturbance using reinforcement learning and teaching method
    Kawano, H
    Ura, T
    2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 4032 - 4038
  • [10] Motion Planning for Industrial Robots using Reinforcement Learning
    Meyes, Richard
    Tercan, Hasan
    Roggendorf, Simon
    Thiele, Thomas
    Buescher, Christian
    Obdenbusch, Markus
    Brecher, Christian
    Jeschke, Sabina
    Meisen, Tobias
    MANUFACTURING SYSTEMS 4.0, 2017, 63 : 107 - 112