Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引：0

作者：

Long, Yusi ^{[1
,2
]}

Zhuang, Jialin ^{[1
]}

Gong, Shimin ^{[1
,2
]}

Gu, Bo ^{[1
]}

Xu, Jing ^{[3
]}

Deng, Jing ^{[4
]}

机构：

[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China

[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China

[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA

来源：

2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;

D O I：

10.1109/WCNC57260.2024.10570857

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.

引用

页数：6

共 50 条

[41] Collaborative computation offloading and wireless charging scheduling in multi-UAV-assisted MEC networks: A TD3-based approach [J].

Zhao, Liang ;

Yao, Yujun ;

Guo, Jianmeng ;

Zuo, Qingjun ;

Leung, Victor C. M. .

COMPUTER NETWORKS, 2024, 251

[42] UAV-Assisted NOMA for Enhancing ISAC: A Deep Reinforcement Learning Solution [J].

Amhaz, Ali ;

Elhattab, Mohamed ;

Sharafeddine, Sanaa ;

Assi, Chadi .

IEEE COMMUNICATIONS LETTERS, 2025, 29 (02) :249-253

[43] RIS-Assisted UAV Communications for IoT With Wireless Power Transfer Using Deep Reinforcement Learning [J].

Khoi Khac Nguyen ;

Masaracchia, Antonino ;

Sharma, Vishal ;

Poor, H. Vincent ;

Duong, Trung Q. .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (05) :1086-1096

[44] Deep Reinforcement Learning for Multi-Hop Offloading in UAV-Assisted Edge Computing [J].

Nguyen Tien Hoa ;

Do Van Dai ;

Le Hoang Lan ;

Nguyen Cong Luong ;

Duc Van Le ;

Niyato, Dusit .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (12) :16917-16922

[45] Average AoI Minimization for Energy Harvesting Relay-Aided Status Update Network Using Deep Reinforcement Learning [J].

Huang, Sin-Yu ;

Liu, Kuang-Hao .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (08) :1464-1468

[46] Joint AoI-Aware UAVs Trajectory Planning and Data Collection in UAV-Based IoT Systems: A Deep Reinforcement Learning Approach [J].

Xiao, Xiongbing ;

Wang, Xiumin ;

Lin, Weiwei .

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (04) :6484-6495

[47] AoI Minimization using Multi-agent Proximal Policy Optimization in UAVs-assisted Sensor Networks [J].

Emami, Yousef ;

Li, Kai ;

Niu, Yong ;

Tovar, Eduardo .

ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, :228-233

[48] Multi-Agent Deep Reinforcement Learning Based Transmission Latency Minimization for Delay-Sensitive Cognitive Satellite-UAV Networks [J].

Guo, Shaoai ;

Zhao, Xiaohui .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) :131-144

[49] Multi-UAV-enabled AoI-aware WPCN: A Multi-agent Reinforcement Learning Strategy [J].

Oubbati, Omar Sami ;

Atiquzzaman, Mohammed ;

Lakas, Abderrahmane ;

Baz, Abdullah ;

Alhakami, Hosam ;

Alhakami, Wajdi .

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,

[50] Toward Optimal Resource Allocation: A Multi-Agent DRL Based Task Offloading Approach in Multi-UAV-Assisted MEC Networks [J].

Tariq, Muhammad Naqqash ;

Wang, Jingyu ;

Raza, Salman ;

Siraj, Mohammad ;

Altamimi, Majid ;

Memon, Saifullah .

IEEE ACCESS, 2024, 12 :81428-81440

← 1 2 3 4 5 →