Age of Information Minimization Using Multi-Agent UAVs Based on AI-Enhanced Mean Field Resource Allocation

被引：6

作者：

Emami, Yousef ^{[1
]}

Gao, Hao ^{[2
]}

Li, Kai ^{[3
,4
]}

Almeida, Luis ^{[5
,6
]}

Tovar, Eduardo ^{[1
]}

Han, Zhu ^{[2
,7
]}

机构：

[1] Real Time & Embedded Comp Syst Res Ctr CISTER, P-4200135 Porto, Portugal

[2] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA

[3] Univ Cambridge, Dept Engn, Cambridge CB3 0FA, England

[4] Real Time & Embedded Comp Syst Res Ctr CISTER, P-4249015 Porto, Portugal

[5] CISTER Res Ctr, P-4200135 Porto, Portugal

[6] Univ Porto, Fac Engn Sci, P-4200465 Porto, Portugal

[7] Kyung Hee Univ, Dept Comp Sci & Engn, Seoul 446701, South Korea

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 09期

基金：

日本科学技术振兴机构;

关键词：

Autonomous aerial vehicles; Sensors; Resource management; Trajectory; Optimization; Cruise control; Data collection; UAV; mean-field game; age of information; proximal policy optimization; long short term memory; FLIGHT CONTROL; DEEP; INTERNET; NETWORKS;

D O I：

10.1109/TVT.2024.3394235

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned Aerial Vehicle (UAV) swarms play an effective role in timely data collection from ground sensors in remote and hostile areas. Optimizing the collective behavior of swarms can improve data collection performance. This paper puts forth a new mean field flight resource allocation optimization to minimize age of information (AoI) of sensory data, where balancing the trade-off between the UAVs' movements and AoI is formulated as a mean field game (MFG). The MFG optimization yields an expansive solution space encompassing continuous state and action, resulting in significant computational complexity. To address practical situations, we propose, a new mean field hybrid proximal policy optimization (MF-HPPO) scheme to minimize the average AoI by optimizing the UAV's trajectories and data collection scheduling of the ground sensors given mixed continuous and discrete actions. Furthermore, a long short term memory (LSTM) is leveraged in MF-HPPO to predict the time-varying network state and stabilize the training. Numerical results demonstrate that the proposed MF-HPPO reduces the average AoI by up to 45% and 57% in the considered simulation setting, as compared to multi-agent deep Q-learning (MADQN) method and non-learning random algorithm, respectively.

引用

页码：13368 / 13380

页数：13

共 51 条

[1]

Abd-Elmagid M. A., 2019, P IEEE GLOB COMM C, P1

[2] Optimal LAP Altitude for Maximum Coverage [J].

Al-Hourani, Akram ;

Kandeepan, Sithamparanathan ;

Lardner, Simon .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572

[3] Mean Field Deep Reinforcement Learning for Fair and Efficient UAV Control [J].

Chen, Dezhi ;

Qi, Qi ;

Zhuang, Zirui ;

Wang, Jingyu ;

Liao, Jianxin ;

Han, Zhu .

IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (02) :813-828

[4]

Chi Kai, 2022, 2022 IEEE 5th International Conference on Electronic Information and Communication Technology (ICEICT), P57, DOI 10.1109/ICEICT55736.2022.9909005

[5] Age Minimization in Massive IoT via UAV Swarm: A Multi-agent Reinforcement Learning Approach [J].

Eldeeb, Eslam ;

Shehab, Mohammad ;

Alves, Hirley .

2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,

[6] Multi-UAV Path Learning for Age and Power Optimization in IoT With UAV Battery Recharge [J].

Eldeeb, Eslam ;

Sant'Ana, Jean Michel de Souza ;

Perez, Dian Echevarria ;

Shehab, Mohammad ;

Mahmood, Nurul Huda ;

Alves, Hirley .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (04) :5356-5360

[7] A Learning-Based Trajectory Planning of Multiple UAVs for AoI Minimization in IoT Networks [J].

Eldeeb, Eslam ;

Perez, Dian Echevarria ;

Sant'Ana, Jean Michel de Souza ;

Shehab, Mohammad ;

Mahmood, Nurul Huda ;

Alves, Hirley ;

Latva-Aho, Matti .

2022 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT (EUCNC/6G SUMMIT), 2022, :172-177

[8] Joint Communication Scheduling and Velocity Control in Multi-UAV-Assisted Sensor Networks: A Deep Reinforcement Learning Approach [J].

Emami, Yousef ;

Wei, Bo ;

Li, Kai ;

Ni, Wei ;

Tovar, Eduardo .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) :10986-10998

[9] Energy-Efficient Velocity Control for Massive Numbers of UAVs: A Mean Field Game Approach [J].

Gao, Hao ;

Lee, Wonjun ;

Kang, Yuhan ;

Li, Wuchen ;

Han, Zhu ;

Osher, Stanley ;

Poor, H. Vincent .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (06) :6266-6278

[10] A Review on UAV-Based Remote Sensing Technologies for Construction and Civil Applications [J].

Guan, Shanyue ;

Zhu, Zhen ;

Wang, George .

DRONES, 2022, 6 (05)

← 1 2 3 4 5 6 →