Decentralized, Safe, Multiagent Motion Planning for Drones Under Uncertainty via Filtered Reinforcement Learning

被引:0
|
作者
Vinod, Abraham P. [1 ]
Safaoui, Sleiman [2 ]
Summers, Tyler H. [2 ]
Yoshikawa, Nobuyuki [3 ]
Di Cairano, Stefano [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[2] Univ Texas Dallas, Control Optimizat & Networks Lab CONLab, Richardson, TX 75080 USA
[3] Mitsubishi Electr Corp, Chiyoda Ku, Tokyo 1008310, Japan
关键词
Safety; Planning; Vectors; Uncertainty; Trajectory; Stochastic processes; Dynamics; Collision avoidance; constrained control under uncertainty; decentralized model predictive control (MPC); multiagent systems; reinforcement learning (RL); safe learning-based control; MODEL PREDICTIVE CONTROL;
D O I
10.1109/TCST.2024.3433229
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a decentralized, multiagent motion planner that guarantees the probabilistic safety of a team subject to stochastic uncertainty in the agent model and environment. Our scalable approach generates safe motion plans in real-time using off-the-shelf, single-agent reinforcement learning (RL) rendered safe using distributionally robust, convex optimization and buffered Voronoi cells. We guarantee the recursive feasibility of the mean trajectories and mitigate the conservativeness using a temporal discounting of safety. We show in simulation that our approach generates safe and high-performant trajectories as compared to existing approaches, and further validate these observations in physical experiments using drones.
引用
收藏
页码:2492 / 2499
页数:8
相关论文
共 50 条
  • [21] Hierarchical Multicontact Motion Planning of Hexapod Robots With Incremental Reinforcement Learning
    Tang, Kaiqiang
    Fu, Huiqiao
    Deng, Guizhou
    Wang, Xinpeng
    Chen, Chunlin
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (04) : 1327 - 1341
  • [22] Inverse Reinforcement Learning for Decentralized Non-Cooperative Multiagent Systems
    Reddy, Tummalapalli Sudhamsh
    Gopikrishna, Vamsikrishna
    Zaruba, Gergely
    Huber, Manfred
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1930 - 1935
  • [23] Using Implicit Behavior Cloning and Dynamic Movement Primitive to Facilitate Reinforcement Learning for Robot Motion Planning
    Zhang, Zengjie
    Hong, Jayden
    Enayati, Amir M. Soufi
    Najjaran, Homayoun
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 4733 - 4749
  • [24] Multiairport Departure Scheduling via Multiagent Reinforcement Learning
    Cai, Kaiquan
    Li, Ziqi
    Guo, Tong
    Du, Wenbo
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024, 16 (02) : 102 - 116
  • [25] Safe Reinforcement Learning for Active Distribution Networks Reconfiguration Considering Uncertainty
    Hao, Guokai
    Li, Yuanzheng
    Li, Yang
    Guang, Kuo
    Zeng, Zhigang
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2025, 61 (01) : 1757 - 1769
  • [26] Safe Planning and Control Under Uncertainty: A Model-Free Design With One-Step Backward Data
    Li, Cong
    Liu, Qingchen
    Qin, Jiahu
    Buss, Martin
    Hirche, Sandra
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (01) : 729 - 738
  • [27] A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
    Dong, Lu
    He, Zichen
    Song, Chunwei
    Sun, Changyin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (02) : 439 - 459
  • [28] Water Age Control for Water Distribution Networks via Safe Reinforcement Learning
    Ledesma, Jorge Val
    Wisniewski, Rafal
    Kallesoe, Carsten S.
    Tsouvalas, Agisilaos
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024, : 2332 - 2343
  • [29] Multiagent Reinforcement Learning for Community Energy Management to Mitigate Peak Rebounds Under Renewable Energy Uncertainty
    Lai, Bo-Chen
    Chiu, Wei-Yu
    Tsai, Yuan-Po
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03): : 568 - 579
  • [30] Safely: Safe Stochastic Motion Planning Under Constrained Sensing via Duality
    Hibbard, Michael
    Vinod, Abraham P.
    Quattrociocchi, Jesse
    Topcu, Ufuk
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (05) : 3464 - 3478