Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引:0
作者
Long, Yusi [1 ,2 ]
Zhuang, Jialin [1 ]
Gong, Shimin [1 ,2 ]
Gu, Bo [1 ]
Xu, Jing [3 ]
Deng, Jing [4 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China
[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China
[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA
来源
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年
基金
中国国家自然科学基金;
关键词
UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;
D O I
10.1109/WCNC57260.2024.10570857
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.
引用
收藏
页数:6
相关论文
共 50 条
[21]   Freshness-in-air: An AoI-inspired UAV-assisted wireless sensor networks [J].
Basnayaka, Chathuranga M. Wijerathna ;
Jayakody, Dushantha Nalin K. ;
Beko, Marko .
ICT EXPRESS, 2024, 10 (05) :1103-1109
[22]   Multi-UAV-Assisted Federated Learning for Energy-Aware Distributed Edge Training [J].
Tang, Jianhang ;
Nie, Jiangtian ;
Zhang, Yang ;
Xiong, Zehui ;
Jiang, Wenchao ;
Guizani, Mohsen .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01) :280-294
[23]   Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks [J].
Chen, Peixin ;
Zhao, Jian ;
Shen, Furao .
2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, :933-938
[24]   Deep Reinforcement Learning Based AoI Minimization for NOMA-Enabled Integrated Satellite-Terrestrial Networks [J].
He, Xinyu ;
Yang, Yang ;
Lee, Jemin ;
He, Gang ;
Yan, Qing .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) :3567-3572
[25]   Deep RL-based Trajectory Planning for AoI Minimization in UAV-assisted IoT [J].
Zhou, Conghao ;
He, Hongli ;
Yang, Peng ;
Lyu, Feng ;
Wu, Wen ;
Cheng, Nan ;
Shen, Xuemin .
2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
[26]   Secure Transmission Design of RIS Enabled UAV Communication Networks Exploiting Deep Reinforcement Learning [J].
Dong, Runze ;
Wang, Buhong ;
Cao, Kunrui ;
Tian, Jiwei ;
Cheng, Tianhao .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) :8404-8419
[27]   Deep Reinforcement Learning Based Resource Allocation in Multi-UAV-Aided MEC Networks [J].
Chen, Jingxuan ;
Cao, Xianbin ;
Yang, Peng ;
Xiao, Meng ;
Ren, Siqiao ;
Zhao, Zhongliang ;
Wu, Dapeng Oliver .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) :296-309
[28]   Online Altitude Control and Scheduling Policy for Minimizing AoI in UAV-Assisted IoT Wireless Networks [J].
Samir, Moataz ;
Assi, Chadi ;
Sharafeddine, Sanaa ;
Ghrayeb, Ali .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (07) :2493-2505
[29]   Autonomous Maintenance in IoT Networks via AoI-driven Deep Reinforcement Learning [J].
Stamatakis, George ;
Pappas, Nikolaos ;
Fragkiadakis, Alexandros ;
Traganitis, Apostolos .
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
[30]   Up-Downlink AoI-Driven Multi-Source Data Collection in UAV-Assisted Wireless Sensor Networks [J].
Zhao, Mingxiong ;
Xiao, Yiming ;
Yao, Jianping ;
Wang, Tongda ;
Lee, Jemin ;
Quek, Tony Q. S. .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (02) :1178-1192