Multi-Agent Probabilistic Ensembles With Trajectory Sampling for Connected Autonomous Vehicles

被引：1

作者：

Wen, Ruoqi ^{[1
]}

Huang, Jiahao ^{[1
]}

Li, Rongpeng ^{[1
]}

Ding, Guoru ^{[2
]}

Zhao, Zhifeng ^{[1
,3
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310058, Peoples R China

[2] Army Engn Univ PLA, Coll Commun & Engn, Nanjing 210007, Peoples R China

[3] Zhejiang Lab, Hangzhou 311121, Peoples R China

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 11期

关键词：

Reinforcement learning; Decision making; Data models; Uncertainty; Probabilistic logic; Vehicle dynamics; Trajectory; Autonomous vehicle control; multi-agent model-based reinforcement learning; probabilistic ensembles with trajectory sampling; VALUE DECOMPOSITION; REINFORCEMENT; COMPLEXITY;

D O I：

10.1109/TVT.2024.3424191

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Connected Autonomous Vehicles (CAVs) have attracted significant attention in recent years and Reinforcement Learning (RL) has shown remarkable performance in improving the autonomy of vehicles. In that regard, Model-Based RL (MBRL) manifests itself in sample-efficient learning, but the asymptotic performance of MBRL might lag behind the state-of-the-art Model-Free RL (MFRL) algorithms. Furthermore, most studies for CAVs are limited to the decision-making of a single vehicle only, thus underscoring the performance due to the absence of communications. In this study, we try to address the decision-making problem of multiple CAVs with limited communications and propose a decentralized Multi-Agent Probabilistic Ensembles (PEs) with Trajectory Sampling (TS) algorithm namely MA-PETS. In particular, to better capture the uncertainty of the unknown environment, MA-PETS leverages PE neural networks to learn from communicated samples among neighboring CAVs. Afterward, MA-PETS capably develops TS-based model-predictive control for decision-making. On this basis, we derive the multi-agent group regret bound affected by the number of agents within the communication range and mathematically validate that incorporating effective information exchange among agents into the multi-agent learning scheme contributes to reducing the group regret bound in the worst case. Finally, we empirically demonstrate the superiority of MA-PETS in terms of the sample efficiency comparable to MFRL.

引用

页码：16076 / 16091

页数：16

共 50 条

[21] Multi-Agent Autonomous Battle Management Using Deep Neuroevolution
Soleyman, Sean
Hung, Fan
Khosla, Deepak
Chen, Yang
Fadaie, Joshua G.
Naderi, Navid
UNMANNED SYSTEMS TECHNOLOGY XXIII, 2021, 11758
[22] Multi-agent Decision-Making Framework Based on Value Decomposition for Connected Automated Vehicles at Highway On-Ramps
Wang, Jinzhu
Ma, Zhixiong
Zhu, Xichan
SAE INTERNATIONAL JOURNAL OF CONNECTED AND AUTOMATED VEHICLES, 2023, 6 (03):
[23] Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation
Park, Soohyun
Kim, Jae Pyoung
Park, Chanyoung
Jung, Soyi
Kim, Joongheon
IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (06) : 106 - 112
[24] Robust experience replay sampling for multi-agent reinforcement learning
Nicholaus, Isack Thomas
Kang, Dae-Ki
PATTERN RECOGNITION LETTERS, 2022, 155 : 135 - 142
[25] A Collaborative Control Scheme for Smart Vehicles Based on Multi-Agent Deep Reinforcement Learning
Shi, Liyan
Chen, Hairui
IEEE ACCESS, 2023, 11 : 96221 - 96234
[26] Factorization of Multi-Agent Sampling-Based Motion Planning
Zanardi, Alessandro
Zullo, Pietro
Censi, Andrea
Frazzoli, Emilio
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 8108 - 8115
[27] Multi-Agent Deep Reinforcement Learning for Joint Decoupled User Association and Trajectory Design in Full-Duplex Multi-UAV Networks
Dai, Chen
Zhu, Kun
Hossain, Ekram
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (10) : 6056 - 6070
[28] An Agent-Based Modelling Framework for Driving Policy Learning in Connected and Autonomous Vehicles
De Silva, Varuna
Wang, Xiongzhao
Aladagli, Deniz
Kondoz, Ahmet
Ekmekcioglu, Erhan
INTELLIGENT SYSTEMS AND APPLICATIONS, INTELLISYS, VOL 2, 2019, 869 : 113 - 125
[29] Fast and Accurate Multi-Agent Trajectory Prediction for Crowded Unknown Scenes
Tao, Xiuye
Li, Huiping
Liang, Bin
Shi, Yang
Xu, Demin
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
[30] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
Xu, D.
Chen, G.
AERONAUTICAL JOURNAL, 2022, 126 (1300) : 932 - 951

← 1 2 3 4 5 →