An Efficient Message Dissemination Scheme for Cooperative Drivings via Cooperative Hierarchical Attention Reinforcement Learning

被引:4
作者
Liu, Bingyi [1 ,2 ,3 ]
Han, Weizhen [1 ]
Wang, Enshu [4 ]
Xiong, Shengwu [1 ]
Qiao, Chunming [5 ]
Wang, Jianping [6 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan, Peoples R China
[2] Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572025, Peoples R China
[3] Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401120, Peoples R China
[4] Soochow Univ, Dept Future Sci & Engn, Suzhou 214998, Peoples R China
[5] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
[6] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Decision making; Vehicle dynamics; Games; Electronic mail; Collaboration; Time division multiple access; Cooperative driving; multi-agent reinforcement learning; hierarchical reinforcement learning; graph attention network; CONGESTION CONTROL;
D O I
10.1109/TMC.2023.3312220
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A group of connected and autonomous vehicles with common interests can drive in a cooperative manner, namely cooperative driving. In such a networked control system, an efficient message dissemination scheme is critical for cooperative drivings to periodically broadcast their kinetic status, i.e., beacon. However, most existing researches are designed for a simple or specific scenario, e.g., ignoring the impacts of the complex communication environment and emerging hybrid traffic scenarios. Worse still, the inevitable message transmission interference and the limited interaction among vehicles in harsh communication environments seriously hinder cooperation among cooperative drivings and deteriorate the beaconing performance. In this paper, we formulate the decision-making process of cooperative drivings as a Markov game. Furthermore, we propose a cooperative hierarchical attention reinforcement learning (CHA) framework to solve this Markov game. Specifically, the hierarchical structure of CHA leads cooperative drivings to be foresighted. Besides, we integrate each hierarchical level of CHA separately with graph attention networks to incorporate agents' mutual influences in the decision-making process. Moreover, each hierarchical level learns a cooperative reward function to motivate each agent to cooperate with others under harsh communication conditions. Finally, we set up a simulator and conduct extensive experiments to validate the effectiveness of CHA.
引用
收藏
页码:5527 / 5542
页数:16
相关论文
共 46 条
  • [11] Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974
  • [12] Distributed Multichannel and Mobility-Aware Cluster-Based MAC Protocol for Vehicular Ad Hoc Networks
    Hafeez, Khalid Abdel
    Zhao, Lian
    Mark, Jon W.
    Shen, Xuemin
    Niu, Zhisheng
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2013, 62 (08) : 3886 - 3902
  • [13] Hairi F., 2022, P INT C LEARN REPR, P1
  • [14] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [15] Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks
    Huang, Liang
    Bi, Suzhi
    Zhang, Ying-Jun Angela
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (11) : 2581 - 2593
  • [16] A Platoon-Centric Multi-Channel Access Scheme for Hybrid Traffic
    Huang, Yan
    Shen, Yuan
    Wang, Jian
    Zhang, Xudong
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (06) : 5404 - 5418
  • [17] A Survey on Platoon-Based Vehicular Cyber-Physical Systems
    Jia, Dongyao
    Lu, Kejie
    Wang, Jianping
    Zhang, Xiang
    Shen, Xuemin
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (01) : 263 - 284
  • [18] Ke N. R., 2018, P 31 ADV NEUR INF PR, P7651
  • [19] Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
    Li, Minne
    Qin, Zhiwei
    Jiao, Yan
    Yang, Yaodong
    Gong, Zhichen
    Wang, Jun
    Wang, Chenxi
    Wu, Guobin
    Ye, Jieping
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 983 - 994
  • [20] TCGMAC: A TDMA-based MAC protocol with collision alleviation based on slot declaration and game theory in VANETS
    Li, Shujing
    Liu, Yanheng
    Wang, Jian
    Ge, Yuming
    Deng, Lingyue
    Deng, Weiwen
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2019, 30 (12)