Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引:0
作者
Long, Yusi [1 ,2 ]
Zhuang, Jialin [1 ]
Gong, Shimin [1 ,2 ]
Gu, Bo [1 ]
Xu, Jing [3 ]
Deng, Jing [4 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China
[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China
[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA
来源
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年
基金
中国国家自然科学基金;
关键词
UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;
D O I
10.1109/WCNC57260.2024.10570857
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Deep Reinforcement Learning Based AoI Minimization for NOMA-Enabled Integrated Satellite-Terrestrial Networks
    He, Xinyu
    Yang, Yang
    Lee, Jemin
    He, Gang
    Yan, Qing
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 3567 - 3572
  • [22] Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks
    Chen, Peixin
    Zhao, Jian
    Shen, Furao
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 933 - 938
  • [23] Deep RL-based Trajectory Planning for AoI Minimization in UAV-assisted IoT
    Zhou, Conghao
    He, Hongli
    Yang, Peng
    Lyu, Feng
    Wu, Wen
    Cheng, Nan
    Shen, Xuemin
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [24] Secure Transmission Design of RIS Enabled UAV Communication Networks Exploiting Deep Reinforcement Learning
    Dong, Runze
    Wang, Buhong
    Cao, Kunrui
    Tian, Jiwei
    Cheng, Tianhao
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 8404 - 8419
  • [25] Deep Reinforcement Learning Based Resource Allocation in Multi-UAV-Aided MEC Networks
    Chen, Jingxuan
    Cao, Xianbin
    Yang, Peng
    Xiao, Meng
    Ren, Siqiao
    Zhao, Zhongliang
    Wu, Dapeng Oliver
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) : 296 - 309
  • [26] Online Altitude Control and Scheduling Policy for Minimizing AoI in UAV-Assisted IoT Wireless Networks
    Samir, Moataz
    Assi, Chadi
    Sharafeddine, Sanaa
    Ghrayeb, Ali
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (07) : 2493 - 2505
  • [27] Up-Downlink AoI-Driven Multi-Source Data Collection in UAV-Assisted Wireless Sensor Networks
    Zhao, Mingxiong
    Xiao, Yiming
    Yao, Jianping
    Wang, Tongda
    Lee, Jemin
    Quek, Tony Q. S.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (02) : 1178 - 1192
  • [28] AoI-Energy Tradeoff for Data Collection in UAV-Assisted Wireless Networks
    Zhang, Xin
    Chang, Zheng
    Hamalainen, Timo
    Min, Geyong
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (03) : 1849 - 1861
  • [29] Multi-UAV Reinforcement Learning for Data Collection in Cellular MIMO Networks
    Diaz-Vilor, Carles
    Abdelhady, Amr M.
    Eltawil, Ahmed M.
    Jafarkhani, Hamid
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 15462 - 15476
  • [30] Autonomous Maintenance in IoT Networks via AoI-driven Deep Reinforcement Learning
    Stamatakis, George
    Pappas, Nikolaos
    Fragkiadakis, Alexandros
    Traganitis, Apostolos
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,