Decentralized, Safe, Multiagent Motion Planning for Drones Under Uncertainty via Filtered Reinforcement Learning

被引:0
|
作者
Vinod, Abraham P. [1 ]
Safaoui, Sleiman [2 ]
Summers, Tyler H. [2 ]
Yoshikawa, Nobuyuki [3 ]
Di Cairano, Stefano [1 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
[2] Univ Texas Dallas, Control Optimizat & Networks Lab CONLab, Richardson, TX 75080 USA
[3] Mitsubishi Electr Corp, Chiyoda Ku, Tokyo 1008310, Japan
关键词
Safety; Planning; Vectors; Uncertainty; Trajectory; Stochastic processes; Dynamics; Collision avoidance; constrained control under uncertainty; decentralized model predictive control (MPC); multiagent systems; reinforcement learning (RL); safe learning-based control; MODEL PREDICTIVE CONTROL;
D O I
10.1109/TCST.2024.3433229
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a decentralized, multiagent motion planner that guarantees the probabilistic safety of a team subject to stochastic uncertainty in the agent model and environment. Our scalable approach generates safe motion plans in real-time using off-the-shelf, single-agent reinforcement learning (RL) rendered safe using distributionally robust, convex optimization and buffered Voronoi cells. We guarantee the recursive feasibility of the mean trajectories and mitigate the conservativeness using a temporal discounting of safety. We show in simulation that our approach generates safe and high-performant trajectories as compared to existing approaches, and further validate these observations in physical experiments using drones.
引用
收藏
页码:2492 / 2499
页数:8
相关论文
共 50 条
  • [41] Safe-State Enhancement Method for Autonomous Driving via Direct Hierarchical Reinforcement Learning
    Gu, Ziqing
    Gao, Lingping
    Ma, Haitong
    Li, Shengbo Eben
    Zheng, Sifa
    Jing, Wei
    Chen, Junbo
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 9966 - 9983
  • [42] Uncertainty-Aware Pedestrian Crossing Prediction via Reinforcement Learning
    Dai, Siyang
    Liu, Jun
    Cheung, Ngai-Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9540 - 9549
  • [43] Policy Evaluation and Seeking for Multiagent Reinforcement Learning via Best Response
    Yan, Rui
    Duan, Xiaoming
    Shi, Zongying
    Zhong, Yisheng
    Marden, Jason R.
    Bullo, Francesco
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (04) : 1898 - 1913
  • [44] Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning
    Semnani, Samaneh Hosseini
    Liu, Hugh
    Everett, Michael
    de Ruiter, Anton
    How, Jonathan P.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 3221 - 3226
  • [45] Safe Motion Planning for Autonomous Vehicles by Quantifying Uncertainties of Deep Learning-Enabled Environment Perception
    Li, Dachuan
    Liu, Bowen
    Huang, Zijian
    Hao, Qi
    Zhao, Dezong
    Tian, Bin
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2318 - 2332
  • [46] Path planning via reinforcement learning with closed-loop motion control and field tests
    Feher, Arpad
    Domina, Adam
    Bardos, Adam
    Aradi, Szilard
    Becsi, Tamas
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [47] Robotic Motion Planning Based on Deep Reinforcement Learning and Artificial Neural Networks
    Liu, Huashan
    Li, Xiangjian
    Dong, Menghua
    Gu, Yuqing
    Shen, Bo
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [48] Adaptive risk tendency in uncertainty-aware motion planning using risk-sensitive Reinforcement Learning
    Wang, Zhidong
    Wei, Chongfeng
    Tang, Xiaolin
    Zhao, Wanzhong
    Hu, Chuan
    Zhang, Xi
    ADVANCED ENGINEERING INFORMATICS, 2025, 63
  • [49] Hierarchical Reinforcement Learning for Autonomous Decision Making and Motion Planning of Intelligent Vehicles
    Lu, Yang
    Xu, Xin
    Zhang, Xinglong
    Qian, Lilin
    Zhou, Xing
    IEEE ACCESS, 2020, 8 : 209776 - 209789
  • [50] Energy-Efficient Online Path Planning for Internet of Drones Using Reinforcement Learning
    Almania, Zainab
    Sheltami, Tarek
    Ahmed, Gamil
    Mahmoud, Ashraf
    Barnawi, Abdulaziz
    JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2024, 13 (05)