Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引：0

作者：

Long, Yusi ^{[1
,2
]}

Zhuang, Jialin ^{[1
]}

Gong, Shimin ^{[1
,2
]}

Gu, Bo ^{[1
]}

Xu, Jing ^{[3
]}

Deng, Jing ^{[4
]}

机构：

[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China

[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China

[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA

来源：

2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;

D O I：

10.1109/WCNC57260.2024.10570857

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.

引用

页数：6

共 50 条

[31] AoI-Energy Tradeoff for Data Collection in UAV-Assisted Wireless Networks
Zhang, Xin
Chang, Zheng
Hamalainen, Timo
Min, Geyong
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (03) : 1849 - 1861
[32] Throughput Maximization in NOMA Enhanced RIS-Assisted Multi-UAV Networks: A Deep Reinforcement Learning Approach
Tang, Runzhi
Wang, Junxuan
Zhang, Yanyan
Jiang, Fan
Zhang, Xuewei
Du, Jianbo
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 730 - 745
[33] Reconfigurable Intelligent Surface-Assisted Multi-UAV Networks: Efficient Resource Allocation With Deep Reinforcement Learning
Khoi Khac Nguyen
Khosravirad, Saeed R.
da Costa, Daniel Benevides
Nguyen, Long D.
Duong, Trung Q.
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (03) : 358 - 368
[34] Minimizing age of information in multi-UAV-assisted IoT networks: a graph theoretical approach
Omid Rahimi
Alireza Shafieinejad
[J]. Wireless Networks, 2024, 30 : 533 - 555
[35] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
Chen, Binqiang
Liu, Dong
Hanzo, Lajos
[J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
[36] Minimizing age of information in multi-UAV-assisted IoT networks: a graph theoretical approach
Rahimi, Omid
Shafieinejad, Alireza
[J]. WIRELESS NETWORKS, 2024, 30 (01) : 533 - 555
[37] AoI-Minimal Trajectory Planning and Data Collection in UAV-Assisted Wireless Powered IoT Networks
Hu, Huimin
Xiong, Ke
Qu, Gang
Ni, Qiang
Fan, Pingyi
Ben Letaief, Khaled
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (02) : 1211 - 1223
[38] Deep Reinforcement Learning for Age of Information Minimization in Reservation Multi-Access Networks
Ji, Zhengyang
Song, Xiaoshi
[J]. 2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 385 - 390
[39] Collaborative computation offloading and wireless charging scheduling in multi-UAV-assisted MEC networks: A TD3-based approach
Zhao, Liang
Yao, Yujun
Guo, Jianmeng
Zuo, Qingjun
Leung, Victor C. M.
[J]. COMPUTER NETWORKS, 2024, 251
[40] UAV-Assisted NOMA for Enhancing ISAC: A Deep Reinforcement Learning Solution
Amhaz, Ali
Elhattab, Mohamed
Sharafeddine, Sanaa
Assi, Chadi
[J]. IEEE COMMUNICATIONS LETTERS, 2025, 29 (02) : 249 - 253

← 1 2 3 4 5 →