Multi-Agent Probabilistic Ensembles With Trajectory Sampling for Connected Autonomous Vehicles

被引:1
作者
Wen, Ruoqi [1 ]
Huang, Jiahao [1 ]
Li, Rongpeng [1 ]
Ding, Guoru [2 ]
Zhao, Zhifeng [1 ,3 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310058, Peoples R China
[2] Army Engn Univ PLA, Coll Commun & Engn, Nanjing 210007, Peoples R China
[3] Zhejiang Lab, Hangzhou 311121, Peoples R China
关键词
Reinforcement learning; Decision making; Data models; Uncertainty; Probabilistic logic; Vehicle dynamics; Trajectory; Autonomous vehicle control; multi-agent model-based reinforcement learning; probabilistic ensembles with trajectory sampling; VALUE DECOMPOSITION; REINFORCEMENT; COMPLEXITY;
D O I
10.1109/TVT.2024.3424191
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Connected Autonomous Vehicles (CAVs) have attracted significant attention in recent years and Reinforcement Learning (RL) has shown remarkable performance in improving the autonomy of vehicles. In that regard, Model-Based RL (MBRL) manifests itself in sample-efficient learning, but the asymptotic performance of MBRL might lag behind the state-of-the-art Model-Free RL (MFRL) algorithms. Furthermore, most studies for CAVs are limited to the decision-making of a single vehicle only, thus underscoring the performance due to the absence of communications. In this study, we try to address the decision-making problem of multiple CAVs with limited communications and propose a decentralized Multi-Agent Probabilistic Ensembles (PEs) with Trajectory Sampling (TS) algorithm namely MA-PETS. In particular, to better capture the uncertainty of the unknown environment, MA-PETS leverages PE neural networks to learn from communicated samples among neighboring CAVs. Afterward, MA-PETS capably develops TS-based model-predictive control for decision-making. On this basis, we derive the multi-agent group regret bound affected by the number of agents within the communication range and mathematically validate that incorporating effective information exchange among agents into the multi-agent learning scheme contributes to reducing the group regret bound in the worst case. Finally, we empirically demonstrate the superiority of MA-PETS in terms of the sample efficiency comparable to MFRL.
引用
收藏
页码:16076 / 16091
页数:16
相关论文
共 50 条
  • [1] Path Planning Strategy Based on Principal Component Federation for Multi-Agent in Connected Vehicles
    Li, Ruonan
    Qin, Yang
    Liu, Jie
    Zang, Lu
    Li, Jinlong
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [2] Multi-Agent Constrained Policy Optimization for Conflict-Free Management of Connected Autonomous Vehicles at Unsignalized Intersections
    Zhao, Rui
    Li, Yun
    Gao, Fei
    Gao, Zhenhai
    Zhang, Tianyao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 5374 - 5388
  • [3] Multi-Agent Formation Tracking for Autonomous Surface Vehicles
    Ringback, Rasmus
    Wei, Jieqiang
    Erstorp, Elias Strandell
    Kuttenkeuler, Jakob
    Johansen, Tor Arne
    Johansson, Karl Henrik
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2021, 29 (06) : 2287 - 2298
  • [4] Limited Information Aggregation for Collaborative Driving in Multi-Agent Autonomous Vehicles
    Liang, Qingyi
    Liu, Jia
    Jiang, Zhengmin
    Yin, Jianwen
    Xu, Kun
    Li, Huiyun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6624 - 6631
  • [5] A study on multi-agent reinforcement learning for autonomous distribution vehicles
    Serap Ergün
    Iran Journal of Computer Science, 2023, 6 (4) : 297 - 305
  • [6] Multi-Agent DRL-Controlled Connected and Automated Vehicles in Mixed Traffic With Time Delays
    Wang, Zhuwei
    Xue, Yi
    Liu, Lihan
    Zhang, Haijun
    Qu, Chunhui
    Fang, Chao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17676 - 17688
  • [7] Multi-agent Generalized Probabilistic RoadMaps : MAGPRM
    Kumar, Sandip
    Chakravorty, Suman
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 3747 - 3753
  • [8] Collaborative Uncertainty Benefits Multi-Agent Multi-Modal Trajectory Forecasting
    Tang, Bohan
    Zhong, Yiqi
    Xu, Chenxin
    Wu, Wei-Tao
    Neumann, Ulrich
    Zhang, Ya
    Chen, Siheng
    Wang, Yanfeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13297 - 13313
  • [9] Optimizing Mixed Autonomy Traffic Flow with Decentralized Autonomous Vehicles and Multi-Agent Reinforcement Learning
    Vinitsky, Eugene
    Lichtle, Nathan
    Parvate, Kanaad
    Bayen, Alexandre
    ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2023, 7 (02) : 1 - 22
  • [10] Scalable Autonomous Separation Assurance With Heterogeneous Multi-Agent Reinforcement Learning
    Brittain, Marc
    Wei, Peng
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 2837 - 2848