Decentralized, Safe, Multiagent Motion Planning for Drones Under Uncertainty via Filtered Reinforcement Learning

被引：0

作者：

Vinod, Abraham P. ^{[1
]}

Safaoui, Sleiman ^{[2
]}

Summers, Tyler H. ^{[2
]}

Yoshikawa, Nobuyuki ^{[3
]}

Di Cairano, Stefano ^{[1
]}

机构：

[1] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA

[2] Univ Texas Dallas, Control Optimizat & Networks Lab CONLab, Richardson, TX 75080 USA

[3] Mitsubishi Electr Corp, Chiyoda Ku, Tokyo 1008310, Japan

来源：

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY | 2024年 / 32卷 / 06期

关键词：

Safety; Planning; Vectors; Uncertainty; Trajectory; Stochastic processes; Dynamics; Collision avoidance; constrained control under uncertainty; decentralized model predictive control (MPC); multiagent systems; reinforcement learning (RL); safe learning-based control; MODEL PREDICTIVE CONTROL;

D O I：

10.1109/TCST.2024.3433229

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a decentralized, multiagent motion planner that guarantees the probabilistic safety of a team subject to stochastic uncertainty in the agent model and environment. Our scalable approach generates safe motion plans in real-time using off-the-shelf, single-agent reinforcement learning (RL) rendered safe using distributionally robust, convex optimization and buffered Voronoi cells. We guarantee the recursive feasibility of the mean trajectories and mitigate the conservativeness using a temporal discounting of safety. We show in simulation that our approach generates safe and high-performant trajectories as compared to existing approaches, and further validate these observations in physical experiments using drones.

引用

页码：2492 / 2499

页数：8

共 50 条

[21] Hierarchical Multicontact Motion Planning of Hexapod Robots With Incremental Reinforcement Learning
Tang, Kaiqiang
Fu, Huiqiao
Deng, Guizhou
Wang, Xinpeng
Chen, Chunlin
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (04) : 1327 - 1341
[22] Inverse Reinforcement Learning for Decentralized Non-Cooperative Multiagent Systems
Reddy, Tummalapalli Sudhamsh
Gopikrishna, Vamsikrishna
Zaruba, Gergely
Huber, Manfred
PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1930 - 1935
[23] Using Implicit Behavior Cloning and Dynamic Movement Primitive to Facilitate Reinforcement Learning for Robot Motion Planning
Zhang, Zengjie
Hong, Jayden
Enayati, Amir M. Soufi
Najjaran, Homayoun
IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 4733 - 4749
[24] Multiairport Departure Scheduling via Multiagent Reinforcement Learning
Cai, Kaiquan
Li, Ziqi
Guo, Tong
Du, Wenbo
IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024, 16 (02) : 102 - 116
[25] Safe Reinforcement Learning for Active Distribution Networks Reconfiguration Considering Uncertainty
Hao, Guokai
Li, Yuanzheng
Li, Yang
Guang, Kuo
Zeng, Zhigang
IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2025, 61 (01) : 1757 - 1769
[26] Safe Planning and Control Under Uncertainty: A Model-Free Design With One-Step Backward Data
Li, Cong
Liu, Qingchen
Qin, Jiahu
Buss, Martin
Hirche, Sandra
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (01) : 729 - 738
[27] A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
Dong, Lu
He, Zichen
Song, Chunwei
Sun, Changyin
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (02) : 439 - 459
[28] Water Age Control for Water Distribution Networks via Safe Reinforcement Learning
Ledesma, Jorge Val
Wisniewski, Rafal
Kallesoe, Carsten S.
Tsouvalas, Agisilaos
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024, : 2332 - 2343
[29] Multiagent Reinforcement Learning for Community Energy Management to Mitigate Peak Rebounds Under Renewable Energy Uncertainty
Lai, Bo-Chen
Chiu, Wei-Yu
Tsai, Yuan-Po
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03): : 568 - 579
[30] Safely: Safe Stochastic Motion Planning Under Constrained Sensing via Duality
Hibbard, Michael
Vinod, Abraham P.
Quattrociocchi, Jesse
Topcu, Ufuk
IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (05) : 3464 - 3478

← 1 2 3 4 5 →