Decentralized, Safe, Multiagent Motion Planning for Drones Under Uncertainty via Filtered Reinforcement Learning

被引:0
|
作者
Vinod, Abraham P. [1 ]
Safaoui, Sleiman [2 ]
Summers, Tyler H. [2 ]
Yoshikawa, Nobuyuki [3 ]
Di Cairano, Stefano [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[2] Univ Texas Dallas, Control Optimizat & Networks Lab CONLab, Richardson, TX 75080 USA
[3] Mitsubishi Electr Corp, Chiyoda Ku, Tokyo 1008310, Japan
关键词
Safety; Planning; Vectors; Uncertainty; Trajectory; Stochastic processes; Dynamics; Collision avoidance; constrained control under uncertainty; decentralized model predictive control (MPC); multiagent systems; reinforcement learning (RL); safe learning-based control; MODEL PREDICTIVE CONTROL;
D O I
10.1109/TCST.2024.3433229
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a decentralized, multiagent motion planner that guarantees the probabilistic safety of a team subject to stochastic uncertainty in the agent model and environment. Our scalable approach generates safe motion plans in real-time using off-the-shelf, single-agent reinforcement learning (RL) rendered safe using distributionally robust, convex optimization and buffered Voronoi cells. We guarantee the recursive feasibility of the mean trajectories and mitigate the conservativeness using a temporal discounting of safety. We show in simulation that our approach generates safe and high-performant trajectories as compared to existing approaches, and further validate these observations in physical experiments using drones.
引用
收藏
页码:2492 / 2499
页数:8
相关论文
共 50 条
  • [1] Safe Multiagent Motion Planning Under Uncertainty for Drones Using Filtered Reinforcement Learning
    Safaoui, Sleiman
    Vinod, Abraham P.
    Chakrabarty, Ankush
    Quirynen, Rien
    Yoshikawa, Nobuyuki
    Di Cairano, Stefano
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 (2529-2542) : 2529 - 2542
  • [2] Safe Reinforcement Learning With Stability Guarantee for Motion Planning of Autonomous Vehicles
    Zhang, Lixian
    Zhang, Ruixian
    Wu, Tong
    Weng, Rui
    Han, Minghao
    Zhao, Ye
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5435 - 5444
  • [3] Decentralized Motion Planning for Multiagent Collaboration Under Coupled LTL Task Specifications
    Tian, Daiying
    Fang, Hao
    Yang, Qingkai
    Wei, Yue
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (06): : 3602 - 3611
  • [4] Safe and Efficient Path Planning Under Uncertainty via Deep Collision Probability Fields
    Herrmann, Felix
    Zach, Sebastian
    Banfi, Jacopo
    Peters, Jan
    Chalvatzaki, Georgia
    Tateo, Davide
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9327 - 9334
  • [5] Online Mapping and Motion Planning Under Uncertainty for Safe Navigation in Unknown Environments
    Pairet, Eric
    Hernandez, Juan David
    Carreras, Marc
    Petillot, Yvan
    Lahijanian, Morteza
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 3356 - 3378
  • [6] Cooperative Motion Planning in Divided Environments via Congestion-Aware Deep Reinforcement Learning
    Du, Yuanyuan
    Zhang, Jianan
    Cheng, Xiang
    Cui, Shuguang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2295 - 2302
  • [7] Safe Planning and Control Under Uncertainty for Self-Driving
    Khaitan, Shivesh
    Lin, Qin
    Dolan, John M.
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) : 9826 - 9837
  • [8] Decentralized Reinforcement Learning Inspired by Multiagent Systems
    Adjodah, Dhaval
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1729 - 1730
  • [9] Formation Tracking of Spatiotemporal Multiagent Systems: A Decentralized Reinforcement Learning Approach
    Liu, Tianrun
    Chen, Yang-Yang
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2024, 10 (04): : 52 - 60
  • [10] Distributed safe reinforcement learning for multi-robot motion planning
    Lu, Yang
    Guo, Yaohua
    Zhao, Guoxiang
    Zhu, Minghui
    2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 1209 - 1214