An Efficient Message Dissemination Scheme for Cooperative Drivings via Cooperative Hierarchical Attention Reinforcement Learning

被引：4

作者：

Liu, Bingyi ^{[1
,2
,3
]}

Han, Weizhen ^{[1
]}

Wang, Enshu ^{[4
]}

Xiong, Shengwu ^{[1
]}

Qiao, Chunming ^{[5
]}

Wang, Jianping ^{[6
]}

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan, Peoples R China

[2] Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572025, Peoples R China

[3] Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401120, Peoples R China

[4] Soochow Univ, Dept Future Sci & Engn, Suzhou 214998, Peoples R China

[5] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA

[6] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Decision making; Vehicle dynamics; Games; Electronic mail; Collaboration; Time division multiple access; Cooperative driving; multi-agent reinforcement learning; hierarchical reinforcement learning; graph attention network; CONGESTION CONTROL;

D O I：

10.1109/TMC.2023.3312220

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A group of connected and autonomous vehicles with common interests can drive in a cooperative manner, namely cooperative driving. In such a networked control system, an efficient message dissemination scheme is critical for cooperative drivings to periodically broadcast their kinetic status, i.e., beacon. However, most existing researches are designed for a simple or specific scenario, e.g., ignoring the impacts of the complex communication environment and emerging hybrid traffic scenarios. Worse still, the inevitable message transmission interference and the limited interaction among vehicles in harsh communication environments seriously hinder cooperation among cooperative drivings and deteriorate the beaconing performance. In this paper, we formulate the decision-making process of cooperative drivings as a Markov game. Furthermore, we propose a cooperative hierarchical attention reinforcement learning (CHA) framework to solve this Markov game. Specifically, the hierarchical structure of CHA leads cooperative drivings to be foresighted. Besides, we integrate each hierarchical level of CHA separately with graph attention networks to incorporate agents' mutual influences in the decision-making process. Moreover, each hierarchical level learns a cooperative reward function to motivate each agent to cooperate with others under harsh communication conditions. Finally, we set up a simulator and conduct extensive experiments to validate the effectiveness of CHA.

引用

页码：5527 / 5542

页数：16

共 46 条

[11] Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974
[12] Distributed Multichannel and Mobility-Aware Cluster-Based MAC Protocol for Vehicular Ad Hoc Networks
Hafeez, Khalid Abdel
Zhao, Lian
Mark, Jon W.
Shen, Xuemin
Niu, Zhisheng
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2013, 62 (08) : 3886 - 3902
[13] Hairi F., 2022, P INT C LEARN REPR, P1
[14] Densely Connected Convolutional Networks
Huang, Gao
Liu, Zhuang
van der Maaten, Laurens
Weinberger, Kilian Q.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
[15] Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks
Huang, Liang
Bi, Suzhi
Zhang, Ying-Jun Angela
[J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (11) : 2581 - 2593
[16] A Platoon-Centric Multi-Channel Access Scheme for Hybrid Traffic
Huang, Yan
Shen, Yuan
Wang, Jian
Zhang, Xudong
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (06) : 5404 - 5418
[17] A Survey on Platoon-Based Vehicular Cyber-Physical Systems
Jia, Dongyao
Lu, Kejie
Wang, Jianping
Zhang, Xiang
Shen, Xuemin
[J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (01) : 263 - 284
[18] Ke N. R., 2018, P 31 ADV NEUR INF PR, P7651
[19] Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
Li, Minne
Qin, Zhiwei
Jiao, Yan
Yang, Yaodong
Gong, Zhichen
Wang, Jun
Wang, Chenxi
Wu, Guobin
Ye, Jieping
[J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 983 - 994
[20] TCGMAC: A TDMA-based MAC protocol with collision alleviation based on slot declaration and game theory in VANETS
Li, Shujing
Liu, Yanheng
Wang, Jian
Ge, Yuming
Deng, Lingyue
Deng, Weiwen
[J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2019, 30 (12)

← 1 2 3 4 5 →