Dense Multiagent Reinforcement Learning Aided Multi-UAV Information Coverage for Vehicular Networks

被引:5
|
作者
Fu, Hang [1 ,2 ]
Wang, Jingjing [1 ,2 ]
Chen, Jianrui [1 ,3 ]
Ren, Pengfei [1 ]
Zhang, Zheng [1 ]
Zhao, Guodong [4 ]
机构
[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[4] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 12期
关键词
Heuristic algorithms; Autonomous aerial vehicles; Vehicle dynamics; Training; Internet of Things; Energy consumption; Decision making; Communication coverage; dense reinforcement learning; distributed multiunmanned aerial vehicle (UAV); multiagent reinforcement learning (MARL); vehicular networks; RESOURCE-ALLOCATION; COMMUNICATION; OPTIMIZATION; ALTITUDE; INTERNET;
D O I
10.1109/JIOT.2024.3367005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of wireless communication networks, UAVs serving as base stations are increasingly being applied in various scenarios which not only include edge computation and task offloading, but also involve emergency communication, vehicular network enhancement, etc. In order to enhance the utility of UAV base stations' allocation and deployment, a series of algorithms have been proposed, utilizing heuristic methods, learning-based algorithms, or optimization approaches. However, it is intractable for current algorithms to handle the exponential computation increment with UAV base stations increasing, and complicated application scenarios with high dynamic demands. To solve the above issues, we formulate a decision problem with a long sequence to optimize the deployment of multi-UAV base stations for maximizing vehicular networks' communication coverage ratio, which needs to be subject to co-constraints consisting of moving velocity, energy consumption, and communication coverage radius. To solve this optimization problem, we creatively propose an algorithm named dense multiagent reinforcement learning (DMARL), which is under the dual-layer nested decision-making framework, centralized training with decentralized deployment, and accelerates training by only collecting critical states into the dense sampling buffer. To prove our proposed algorithm's effectiveness and generalization ability, we conduct experimental simulations in scenarios with different scales. Corresponding results have been provided to verify our algorithm's superiority in training efficiency and performance metrics, including coverage ratio and energy consumption, compared with other algorithms.
引用
收藏
页码:21274 / 21286
页数:13
相关论文
共 50 条
  • [31] Throughput Maximization in NOMA Enhanced RIS-Assisted Multi-UAV Networks: A Deep Reinforcement Learning Approach
    Tang, Runzhi
    Wang, Junxuan
    Zhang, Yanyan
    Jiang, Fan
    Zhang, Xuewei
    Du, Jianbo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 730 - 745
  • [32] Multiagent Reinforcement Learning in Controlling Offloading Ratio and Trajectory for Multi-UAV Mobile-Edge Computing
    Lee, Wonseok
    Kim, Taejoon
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02) : 3417 - 3429
  • [33] Optimizing Virtual Functions Deployment in Multi-UAV IoT Networks
    Forghani, Athena
    Chin, Kwan-Wu
    Ros, Montserrat
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (11): : 20367 - 20378
  • [34] Deep Reinforcement Learning-Based Dual-Timescale Service Caching and Computation Offloading for Multi-UAV Assisted MEC Systems
    Lin, Na
    Han, Xiao
    Hawbani, Ammar
    Sun, Yunhe
    Guan, Yunchong
    Zhao, Liang
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2025, 22 (01): : 605 - 617
  • [35] Extrinsic-and-Intrinsic Reward-Based Multi-Agent Reinforcement Learning for Multi-UAV Cooperative Target Encirclement
    Chen, Jinchao
    Wang, Yang
    Zhang, Ying
    Lu, Yantao
    Shu, Qiuhao
    Hu, Yujiao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [36] Channel-State Information-Driven Data Rate Optimization for Multi-UAV IoT Networks
    Bera, Abhishek
    Misra, Sudip
    Chatterjee, Chandranath
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (21) : 19177 - 19186
  • [37] UAV-Assisted Heterogeneous Multi-Server Computation Offloading With Enhanced Deep Reinforcement Learning in Vehicular Networks
    Song, Xiaoqin
    Zhang, Wenjing
    Lei, Lei
    Zhang, Xinting
    Zhang, Lijuan
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5323 - 5335
  • [38] Deep Reinforcement Learning for Energy-Efficient Data Dissemination Through UAV Networks
    Ali, Abubakar S.
    Al-Habob, Ahmed A.
    Naser, Shimaa
    Bariah, Lina
    Dobre, Octavia A.
    Muhaidat, Sami
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 5567 - 5583
  • [39] A Learning-Based Cooperative Navigation Approach for Multi-UAV Systems Under Communication Coverage
    Wu, Di
    Cao, Zhuang
    Lin, Xudong
    Shu, Feng
    Feng, Zikai
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (02): : 763 - 773
  • [40] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
    Xu, Shu
    Zhan, Xiangyu
    Li, Chunguo
    Wang, Dongming
    Yang, Luxi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394