AoI Minimization using Multi-agent Proximal Policy Optimization in UAVs-assisted Sensor Networks

被引:3
作者
Emami, Yousef [1 ]
Li, Kai [1 ]
Niu, Yong [2 ]
Tovar, Eduardo [1 ]
机构
[1] CISTER Res Ctr, Porto, Portugal
[2] Beijing Jiaotong Univ, Beijing, Peoples R China
来源
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS | 2023年
基金
中国国家自然科学基金;
关键词
UAVs; Age of information; Proximal policy optimization; INFORMATION; AGE;
D O I
10.1109/ICC45041.2023.10278748
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Unmanned Aerial Vehicle (UAV) swarm can be employed to collect time-sensitive data of ground sensors in remote and hostile areas. Inadequate design of UAVs ' trajectories and data collection schedule incur delay and negatively impact the information freshness of ground sensors. This paper aims to jointly optimize the trajectories and data collection schedules of multiple UAVs to minimize the average Age of Information (AoI), adapting to the AoI of the ground sensors, and the trajectories of the UAVs. The optimization is formulated as a multi-agent Markov decision process (MMDP), where network states consist of AoI at the ground sensors and the flight trajectories. In practice, a multi-UAV-assisted sensor network contains a large number of network states and actions in MMDP. Exploring the actions of multiple agents in a large state space results in considerable training uncertainties that destabilize the AoI minimization. For stabilizing the formulated MMDP, we propose an onboard Proximal Policy Optimization-based flight resource allocation scheme (PPO-FRAS), which conducts an on-policy learning to optimize the trajectories of the UAVs and data collection schedule of the ground sensors. Numerical results show that the proposed PPO-FRAS achieves 28% and 59% lower AoI than the existing trajectory planning solution based on Deep QNetwork and the greedy algorithm, respectively.
引用
收藏
页码:228 / 233
页数:6
相关论文
共 22 条
[1]  
Abd-Elmagid MA, 2019, IEEE GLOB COMM CONF, DOI [10.1109/globecom38437.2019.9013924, 10.1109/ITCE.2019.8646549, 10.1109/itce.2019.8646549]
[2]   Average Peak Age-of-Information Minimization in UAV-Assisted IoT Networks [J].
Abd-Elmagid, Mohamed A. ;
Dhillon, Harpreet S. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) :2003-2008
[3]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[4]   Optimum Sensors Allocation for a Forest Fires Monitoring System [J].
Azevedo, Beatriz Flamia ;
Brito, Thadeu ;
Lima, Jose ;
Pereira, Ana, I .
FORESTS, 2021, 12 (04)
[5]   Mobile Crowdsensing for Data Freshness: A Deep Reinforcement Learning Approach [J].
Dai, Zipeng ;
Wang, Hao ;
Liu, Chi Harold ;
Han, Rui ;
Tang, Jian ;
Wang, Guoren .
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
[6]   Joint Communication Scheduling and Velocity Control in Multi-UAV-Assisted Sensor Networks: A Deep Reinforcement Learning Approach [J].
Emami, Yousef ;
Wei, Bo ;
Li, Kai ;
Ni, Wei ;
Tovar, Eduardo .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) :10986-10998
[7]  
Kurunathan H., 2022, arXiv
[8]  
Li Kai, 2022, IEEE Internet of Things Magazine, V5, P136, DOI [10.1109/iotm.001.2100161, 10.1109/IOTM.001.2100161]
[9]   Deep-Graph-Based Reinforcement Learning for Joint Cruise Control and Task Offloading for Aerial Edge Internet of Things (EdgeIoT) [J].
Li, Kai ;
Ni, Wei ;
Yuan, Xin ;
Noor, Alam ;
Jamalipour, Abbas .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21) :21676-21686
[10]   Data-Driven Flight Control of Internet-of-Drones for Sensor Data Aggregation Using Multi-Agent Deep Reinforcement Learning [J].
Li, Kai ;
Ni, Wei ;
Emami, Yousef ;
Dressler, Falko .
IEEE WIRELESS COMMUNICATIONS, 2022, 29 (04) :18-23