Multi-Agent Probabilistic Ensembles With Trajectory Sampling for Connected Autonomous Vehicles

被引:1
作者
Wen, Ruoqi [1 ]
Huang, Jiahao [1 ]
Li, Rongpeng [1 ]
Ding, Guoru [2 ]
Zhao, Zhifeng [1 ,3 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310058, Peoples R China
[2] Army Engn Univ PLA, Coll Commun & Engn, Nanjing 210007, Peoples R China
[3] Zhejiang Lab, Hangzhou 311121, Peoples R China
关键词
Reinforcement learning; Decision making; Data models; Uncertainty; Probabilistic logic; Vehicle dynamics; Trajectory; Autonomous vehicle control; multi-agent model-based reinforcement learning; probabilistic ensembles with trajectory sampling; VALUE DECOMPOSITION; REINFORCEMENT; COMPLEXITY;
D O I
10.1109/TVT.2024.3424191
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Connected Autonomous Vehicles (CAVs) have attracted significant attention in recent years and Reinforcement Learning (RL) has shown remarkable performance in improving the autonomy of vehicles. In that regard, Model-Based RL (MBRL) manifests itself in sample-efficient learning, but the asymptotic performance of MBRL might lag behind the state-of-the-art Model-Free RL (MFRL) algorithms. Furthermore, most studies for CAVs are limited to the decision-making of a single vehicle only, thus underscoring the performance due to the absence of communications. In this study, we try to address the decision-making problem of multiple CAVs with limited communications and propose a decentralized Multi-Agent Probabilistic Ensembles (PEs) with Trajectory Sampling (TS) algorithm namely MA-PETS. In particular, to better capture the uncertainty of the unknown environment, MA-PETS leverages PE neural networks to learn from communicated samples among neighboring CAVs. Afterward, MA-PETS capably develops TS-based model-predictive control for decision-making. On this basis, we derive the multi-agent group regret bound affected by the number of agents within the communication range and mathematically validate that incorporating effective information exchange among agents into the multi-agent learning scheme contributes to reducing the group regret bound in the worst case. Finally, we empirically demonstrate the superiority of MA-PETS in terms of the sample efficiency comparable to MFRL.
引用
收藏
页码:16076 / 16091
页数:16
相关论文
共 50 条
  • [21] Multi-Agent Autonomous Battle Management Using Deep Neuroevolution
    Soleyman, Sean
    Hung, Fan
    Khosla, Deepak
    Chen, Yang
    Fadaie, Joshua G.
    Naderi, Navid
    UNMANNED SYSTEMS TECHNOLOGY XXIII, 2021, 11758
  • [22] Multi-agent Decision-Making Framework Based on Value Decomposition for Connected Automated Vehicles at Highway On-Ramps
    Wang, Jinzhu
    Ma, Zhixiong
    Zhu, Xichan
    SAE INTERNATIONAL JOURNAL OF CONNECTED AND AUTOMATED VEHICLES, 2023, 6 (03):
  • [23] Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation
    Park, Soohyun
    Kim, Jae Pyoung
    Park, Chanyoung
    Jung, Soyi
    Kim, Joongheon
    IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (06) : 106 - 112
  • [24] Robust experience replay sampling for multi-agent reinforcement learning
    Nicholaus, Isack Thomas
    Kang, Dae-Ki
    PATTERN RECOGNITION LETTERS, 2022, 155 : 135 - 142
  • [25] A Collaborative Control Scheme for Smart Vehicles Based on Multi-Agent Deep Reinforcement Learning
    Shi, Liyan
    Chen, Hairui
    IEEE ACCESS, 2023, 11 : 96221 - 96234
  • [26] Factorization of Multi-Agent Sampling-Based Motion Planning
    Zanardi, Alessandro
    Zullo, Pietro
    Censi, Andrea
    Frazzoli, Emilio
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 8108 - 8115
  • [27] Multi-Agent Deep Reinforcement Learning for Joint Decoupled User Association and Trajectory Design in Full-Duplex Multi-UAV Networks
    Dai, Chen
    Zhu, Kun
    Hossain, Ekram
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (10) : 6056 - 6070
  • [28] An Agent-Based Modelling Framework for Driving Policy Learning in Connected and Autonomous Vehicles
    De Silva, Varuna
    Wang, Xiongzhao
    Aladagli, Deniz
    Kondoz, Ahmet
    Ekmekcioglu, Erhan
    INTELLIGENT SYSTEMS AND APPLICATIONS, INTELLISYS, VOL 2, 2019, 869 : 113 - 125
  • [29] Fast and Accurate Multi-Agent Trajectory Prediction for Crowded Unknown Scenes
    Tao, Xiuye
    Li, Huiping
    Liang, Bin
    Shi, Yang
    Xu, Demin
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [30] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
    Xu, D.
    Chen, G.
    AERONAUTICAL JOURNAL, 2022, 126 (1300) : 932 - 951