Dense Multiagent Reinforcement Learning Aided Multi-UAV Information Coverage for Vehicular Networks

被引：5

作者：

Fu, Hang ^{[1
,2
]}

Wang, Jingjing ^{[1
,2
]}

Chen, Jianrui ^{[1
,3
]}

Ren, Pengfei ^{[1
]}

Zhang, Zheng ^{[1
]}

Zhao, Guodong ^{[4
]}

机构：

[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China

[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China

[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China

[4] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 12期

关键词：

Heuristic algorithms; Autonomous aerial vehicles; Vehicle dynamics; Training; Internet of Things; Energy consumption; Decision making; Communication coverage; dense reinforcement learning; distributed multiunmanned aerial vehicle (UAV); multiagent reinforcement learning (MARL); vehicular networks; RESOURCE-ALLOCATION; COMMUNICATION; OPTIMIZATION; ALTITUDE; INTERNET;

D O I：

10.1109/JIOT.2024.3367005

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of wireless communication networks, UAVs serving as base stations are increasingly being applied in various scenarios which not only include edge computation and task offloading, but also involve emergency communication, vehicular network enhancement, etc. In order to enhance the utility of UAV base stations' allocation and deployment, a series of algorithms have been proposed, utilizing heuristic methods, learning-based algorithms, or optimization approaches. However, it is intractable for current algorithms to handle the exponential computation increment with UAV base stations increasing, and complicated application scenarios with high dynamic demands. To solve the above issues, we formulate a decision problem with a long sequence to optimize the deployment of multi-UAV base stations for maximizing vehicular networks' communication coverage ratio, which needs to be subject to co-constraints consisting of moving velocity, energy consumption, and communication coverage radius. To solve this optimization problem, we creatively propose an algorithm named dense multiagent reinforcement learning (DMARL), which is under the dual-layer nested decision-making framework, centralized training with decentralized deployment, and accelerates training by only collecting critical states into the dense sampling buffer. To prove our proposed algorithm's effectiveness and generalization ability, we conduct experimental simulations in scenarios with different scales. Corresponding results have been provided to verify our algorithm's superiority in training efficiency and performance metrics, including coverage ratio and energy consumption, compared with other algorithms.

引用

页码：21274 / 21286

页数：13

共 50 条

[31] Throughput Maximization in NOMA Enhanced RIS-Assisted Multi-UAV Networks: A Deep Reinforcement Learning Approach
Tang, Runzhi
Wang, Junxuan
Zhang, Yanyan
Jiang, Fan
Zhang, Xuewei
Du, Jianbo
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 730 - 745
[32] Multiagent Reinforcement Learning in Controlling Offloading Ratio and Trajectory for Multi-UAV Mobile-Edge Computing
Lee, Wonseok
Kim, Taejoon
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02) : 3417 - 3429
[33] Optimizing Virtual Functions Deployment in Multi-UAV IoT Networks
Forghani, Athena
Chin, Kwan-Wu
Ros, Montserrat
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (11): : 20367 - 20378
[34] Deep Reinforcement Learning-Based Dual-Timescale Service Caching and Computation Offloading for Multi-UAV Assisted MEC Systems
Lin, Na
Han, Xiao
Hawbani, Ammar
Sun, Yunhe
Guan, Yunchong
Zhao, Liang
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2025, 22 (01): : 605 - 617
[35] Extrinsic-and-Intrinsic Reward-Based Multi-Agent Reinforcement Learning for Multi-UAV Cooperative Target Encirclement
Chen, Jinchao
Wang, Yang
Zhang, Ying
Lu, Yantao
Shu, Qiuhao
Hu, Yujiao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
[36] Channel-State Information-Driven Data Rate Optimization for Multi-UAV IoT Networks
Bera, Abhishek
Misra, Sudip
Chatterjee, Chandranath
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (21) : 19177 - 19186
[37] UAV-Assisted Heterogeneous Multi-Server Computation Offloading With Enhanced Deep Reinforcement Learning in Vehicular Networks
Song, Xiaoqin
Zhang, Wenjing
Lei, Lei
Zhang, Xinting
Zhang, Lijuan
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5323 - 5335
[38] Deep Reinforcement Learning for Energy-Efficient Data Dissemination Through UAV Networks
Ali, Abubakar S.
Al-Habob, Ahmed A.
Naser, Shimaa
Bariah, Lina
Dobre, Octavia A.
Muhaidat, Sami
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 5567 - 5583
[39] A Learning-Based Cooperative Navigation Approach for Multi-UAV Systems Under Communication Coverage
Wu, Di
Cao, Zhuang
Lin, Xudong
Shu, Feng
Feng, Zikai
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (02): : 763 - 773
[40] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
Xu, Shu
Zhan, Xiangyu
Li, Chunguo
Wang, Dongming
Yang, Luxi
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394

← 1 2 3 4 5 →