DRL-Based Joint Resource Allocation and Platoon Control Optimization for UAV-Hosted Platoon Digital Twin

被引:0
作者
Wang, Lei [1 ,2 ]
Liang, Hongbin [1 ,2 ]
Tang, Yanmei [3 ]
Mao, Guotao [1 ,2 ]
Zhang, Han [1 ,2 ]
Zhao, Dongmei [4 ]
机构
[1] Southwest Jiaotong Univ, Sch Transportat & Logist, Natl United Engn Lab Integrated & Intelligent Tran, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Informatizat & Network Management Off, Chengdu 611756, Peoples R China
[4] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 22期
基金
中国国家自然科学基金;
关键词
Resource management; Vehicle dynamics; Autonomous aerial vehicles; Optimization; Numerical stability; Synchronization; Perturbation methods; Age of Information (AoI); deep reinforcement learning (DRL); digital twin (DT); platoon control; platoon; resource allocation; DELAY;
D O I
10.1109/JIOT.2024.3439576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital twin (DT)-empowered platoon can improve platoon management efficiency and driving safety. However, the resource allocation scheme of low-latency platoon DT (PDT) and the interactions with platoon control strategy are important issues in the study of PDTs. In this article, we study the resource allocation in the PDT network and the interaction mechanism between PDT and platoon control for an unmanned aerial vehicle (UAV)-hosted PDT. We introduce the Age of Information (AoI) metrics to characterize the freshness of the DTs. To explore the impact of the PDT resource allocation scheme on the platoon control strategy, we propose a joint optimization model for power resource allocation and platoon control. Specifically, the allocation of power resources affects the PDT's AoI, and the high-latency PDT in turn affects the platoon control strategy. Our objective is minimize the weighted sum of the system's average energy consumption and the PDT's average peak AoI. To solve the problem, we first reformulate the power resource allocation problem over a period of time as a Markov decision process (MDP) model, and then propose the Dirichlet deep deterministic policy gradient (DDPG)-based power allocation (D3PGPA) method based on Dirichlet distribution and DDPG algorithm. The method can not only effectively explores the state space while satisfying the constraints of limited resources but also improve the stability of the algorithm. Numerical results show that the D3PGPA method can host a PDT with low AoI and improve the stability of the platoon. Besides, our proposed method performs stably and outperforms other benchmark methods.
引用
收藏
页码:37114 / 37126
页数:13
相关论文
共 22 条
  • [1] Joint Optimization of Trajectory Control, Resource Allocation, and User Association Based on DRL for Multi-Fixed-Wing UAV Networks
    Yin, Baolin
    Fang, Xuming
    Wang, Xianbin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 13330 - 13343
  • [2] Joint UAV Placement Optimization, Resource Allocation, and Computation Offloading for THz Band: A DRL Approach
    Wang, Heng
    Zhang, Haijun
    Liu, Xiangnan
    Long, Keping
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (07) : 4890 - 4900
  • [3] A Dynamic Resource Allocation Model Based on SMDP and DRL Algorithm for Truck Platoon in Vehicle Network
    Liang, Hongbin
    Zhou, Shuya
    Liu, Xiaobo
    Zheng, Fangfang
    Hong, Xintao
    Zhou, Xuemei
    Zhao, Lian
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) : 10295 - 10305
  • [4] Resource Allocation for Dynamic Platoon Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Method
    Wang, Lei
    Liang, Hongbin
    Mao, Guotao
    Zhao, Dongmei
    Liu, Qian
    Yao, Yiting
    Zhang, Han
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 15609 - 15620
  • [5] DRL-Based Resource Allocation and Trajectory Planning for NOMA-Enabled Multi-UAV Collaborative Caching 6G Network
    Qin, Peng
    Fu, Yang
    Zhang, Jing
    Geng, Suiyan
    Liu, Jiayan
    Zhao, Xiongwen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8750 - 8764
  • [6] A Digital Twin and Consensus Empowered Cooperative Control Framework for Platoon-Based Autonomous Driving
    Cao, Jiayu
    Leng, Supeng
    Xiong, Kai
    Chen, Xiaosha
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (03): : 1096 - 1111
  • [7] DRL-Based Joint Path Planning and Jamming Power Allocation Optimization for Suppressing Netted Radar System
    Li, Shengxiang
    Liu, Guangyi
    Zhang, Kai
    Qian, Zhisheng
    Ding, Siyuan
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 548 - 552
  • [8] Joint Service Caching, Communication and Computing Resource Allocation in Collaborative MEC Systems: A DRL-Based Two-Timescale Approach
    Liu, Qianqian
    Zhang, Haixia
    Zhang, Xin
    Yuan, Dongfeng
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 15493 - 15506
  • [9] Resource Allocation for Edge Computing-Based Vehicle Platoon on Freeway: A Contract-Optimization Approach
    Yang, Chao
    Lou, Wei
    Liu, Yi
    Xie, Shengli
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 15988 - 16000
  • [10] DRL-Based Joint Resource Allocation and Device Orchestration for Hierarchical Federated Learning in NOMA-Enabled Industrial IoT
    Zhao, Tantan
    Li, Fan
    He, Lijun
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (06) : 7468 - 7479