DRL-Based Joint Resource Allocation and Platoon Control Optimization for UAV-Hosted Platoon Digital Twin

被引:1
作者
Wang, Lei [1 ,2 ]
Liang, Hongbin [1 ,2 ]
Tang, Yanmei [3 ]
Mao, Guotao [1 ,2 ]
Zhang, Han [1 ,2 ]
Zhao, Dongmei [4 ]
机构
[1] Southwest Jiaotong Univ, Sch Transportat & Logist, Natl United Engn Lab Integrated & Intelligent Tran, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Informatizat & Network Management Off, Chengdu 611756, Peoples R China
[4] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
基金
中国国家自然科学基金;
关键词
Resource management; Vehicle dynamics; Autonomous aerial vehicles; Optimization; Numerical stability; Synchronization; Perturbation methods; Age of Information (AoI); deep reinforcement learning (DRL); digital twin (DT); platoon control; platoon; resource allocation; DELAY;
D O I
10.1109/JIOT.2024.3439576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital twin (DT)-empowered platoon can improve platoon management efficiency and driving safety. However, the resource allocation scheme of low-latency platoon DT (PDT) and the interactions with platoon control strategy are important issues in the study of PDTs. In this article, we study the resource allocation in the PDT network and the interaction mechanism between PDT and platoon control for an unmanned aerial vehicle (UAV)-hosted PDT. We introduce the Age of Information (AoI) metrics to characterize the freshness of the DTs. To explore the impact of the PDT resource allocation scheme on the platoon control strategy, we propose a joint optimization model for power resource allocation and platoon control. Specifically, the allocation of power resources affects the PDT's AoI, and the high-latency PDT in turn affects the platoon control strategy. Our objective is minimize the weighted sum of the system's average energy consumption and the PDT's average peak AoI. To solve the problem, we first reformulate the power resource allocation problem over a period of time as a Markov decision process (MDP) model, and then propose the Dirichlet deep deterministic policy gradient (DDPG)-based power allocation (D3PGPA) method based on Dirichlet distribution and DDPG algorithm. The method can not only effectively explores the state space while satisfying the constraints of limited resources but also improve the stability of the algorithm. Numerical results show that the D3PGPA method can host a PDT with low AoI and improve the stability of the platoon. Besides, our proposed method performs stably and outperforms other benchmark methods.
引用
收藏
页码:37114 / 37126
页数:13
相关论文
共 32 条
[1]   D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing [J].
Ale, Laha ;
King, Scott A. ;
Zhang, Ning ;
Sattar, Abdul Rahman ;
Skandaraniyam, Janahan .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19) :19260-19272
[2]   String Stability and a Delay-Based Spacing Policy for Vehicle Platoons Subject to Disturbances [J].
Besselink, Bart ;
Johansson, Karl H. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (09) :4376-4391
[3]   Better Platooning Control Toward Autonomous Driving [J].
Campolo, Claudia ;
Molinaro, Antonella ;
Araniti, Giuseppe ;
Berthet, Antoine O. .
IEEE VEHICULAR TECHNOLOGY MAGAZINE, 2017, 12 (01) :30-38
[4]   Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach [J].
Desjardins, Charles ;
Chaib-draa, Brahim .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) :1248-1260
[5]  
Du H., Digit. Commun. Netw.
[6]   Digital Twin Based Trajectory Prediction for Platoons of Connected Intelligent Vehicles [J].
Du, Hao ;
Leng, Supeng ;
He, Jianhua ;
Zhou, Longyu .
2021 IEEE 29TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP 2021), 2021,
[7]   Joint Communication and Control Optimization of a UAV-Assisted Multi-Vehicle Platooning System in Uncertain Communication Environment [J].
Duan, Xuting ;
Zhao, Yihan ;
Tian, Daxin ;
Zhou, Jianshan ;
Ma, Linru ;
Zhang, Long .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (03) :3177-3190
[8]   Weighted Energy-Efficiency Maximization for a UAV-Assisted Multiplatoon Mobile-Edge Computing System [J].
Duan, Xuting ;
Zhou, Yukang ;
Tian, Daxin ;
Zhou, Jianshan ;
Sheng, Zhengguo ;
Shen, Xuemin .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19) :18208-18220
[9]   String Stable and Collision-Safe Model Predictive Platoon Control [J].
Gratzer, Alexander L. ;
Thormann, Sebastian ;
Schirrer, Alexander ;
Jakubek, Stefan .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) :19358-19373
[10]  
Graves A, 2012, STUD COMPUT INTELL, V385, P37