Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引:0
作者
Long, Yusi [1 ,2 ]
Zhuang, Jialin [1 ]
Gong, Shimin [1 ,2 ]
Gu, Bo [1 ]
Xu, Jing [3 ]
Deng, Jing [4 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China
[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China
[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA
来源
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年
基金
中国国家自然科学基金;
关键词
UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;
D O I
10.1109/WCNC57260.2024.10570857
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.
引用
收藏
页数:6
相关论文
共 50 条
[41]   Collaborative computation offloading and wireless charging scheduling in multi-UAV-assisted MEC networks: A TD3-based approach [J].
Zhao, Liang ;
Yao, Yujun ;
Guo, Jianmeng ;
Zuo, Qingjun ;
Leung, Victor C. M. .
COMPUTER NETWORKS, 2024, 251
[42]   UAV-Assisted NOMA for Enhancing ISAC: A Deep Reinforcement Learning Solution [J].
Amhaz, Ali ;
Elhattab, Mohamed ;
Sharafeddine, Sanaa ;
Assi, Chadi .
IEEE COMMUNICATIONS LETTERS, 2025, 29 (02) :249-253
[43]   RIS-Assisted UAV Communications for IoT With Wireless Power Transfer Using Deep Reinforcement Learning [J].
Khoi Khac Nguyen ;
Masaracchia, Antonino ;
Sharma, Vishal ;
Poor, H. Vincent ;
Duong, Trung Q. .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (05) :1086-1096
[44]   Deep Reinforcement Learning for Multi-Hop Offloading in UAV-Assisted Edge Computing [J].
Nguyen Tien Hoa ;
Do Van Dai ;
Le Hoang Lan ;
Nguyen Cong Luong ;
Duc Van Le ;
Niyato, Dusit .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (12) :16917-16922
[45]   Average AoI Minimization for Energy Harvesting Relay-Aided Status Update Network Using Deep Reinforcement Learning [J].
Huang, Sin-Yu ;
Liu, Kuang-Hao .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (08) :1464-1468
[46]   Joint AoI-Aware UAVs Trajectory Planning and Data Collection in UAV-Based IoT Systems: A Deep Reinforcement Learning Approach [J].
Xiao, Xiongbing ;
Wang, Xiumin ;
Lin, Weiwei .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (04) :6484-6495
[47]   AoI Minimization using Multi-agent Proximal Policy Optimization in UAVs-assisted Sensor Networks [J].
Emami, Yousef ;
Li, Kai ;
Niu, Yong ;
Tovar, Eduardo .
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, :228-233
[48]   Multi-Agent Deep Reinforcement Learning Based Transmission Latency Minimization for Delay-Sensitive Cognitive Satellite-UAV Networks [J].
Guo, Shaoai ;
Zhao, Xiaohui .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) :131-144
[49]   Multi-UAV-enabled AoI-aware WPCN: A Multi-agent Reinforcement Learning Strategy [J].
Oubbati, Omar Sami ;
Atiquzzaman, Mohammed ;
Lakas, Abderrahmane ;
Baz, Abdullah ;
Alhakami, Hosam ;
Alhakami, Wajdi .
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
[50]   Toward Optimal Resource Allocation: A Multi-Agent DRL Based Task Offloading Approach in Multi-UAV-Assisted MEC Networks [J].
Tariq, Muhammad Naqqash ;
Wang, Jingyu ;
Raza, Salman ;
Siraj, Mohammad ;
Altamimi, Majid ;
Memon, Saifullah .
IEEE ACCESS, 2024, 12 :81428-81440