Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引:0
作者
Long, Yusi [1 ,2 ]
Zhuang, Jialin [1 ]
Gong, Shimin [1 ,2 ]
Gu, Bo [1 ]
Xu, Jing [3 ]
Deng, Jing [4 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China
[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China
[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA
来源
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年
基金
中国国家自然科学基金;
关键词
UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;
D O I
10.1109/WCNC57260.2024.10570857
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] AoI-Energy Tradeoff for Data Collection in UAV-Assisted Wireless Networks
    Zhang, Xin
    Chang, Zheng
    Hamalainen, Timo
    Min, Geyong
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (03) : 1849 - 1861
  • [32] Throughput Maximization in NOMA Enhanced RIS-Assisted Multi-UAV Networks: A Deep Reinforcement Learning Approach
    Tang, Runzhi
    Wang, Junxuan
    Zhang, Yanyan
    Jiang, Fan
    Zhang, Xuewei
    Du, Jianbo
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (01) : 730 - 745
  • [33] Reconfigurable Intelligent Surface-Assisted Multi-UAV Networks: Efficient Resource Allocation With Deep Reinforcement Learning
    Khoi Khac Nguyen
    Khosravirad, Saeed R.
    da Costa, Daniel Benevides
    Nguyen, Long D.
    Duong, Trung Q.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (03) : 358 - 368
  • [34] Minimizing age of information in multi-UAV-assisted IoT networks: a graph theoretical approach
    Omid Rahimi
    Alireza Shafieinejad
    [J]. Wireless Networks, 2024, 30 : 533 - 555
  • [35] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
    Chen, Binqiang
    Liu, Dong
    Hanzo, Lajos
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
  • [36] Minimizing age of information in multi-UAV-assisted IoT networks: a graph theoretical approach
    Rahimi, Omid
    Shafieinejad, Alireza
    [J]. WIRELESS NETWORKS, 2024, 30 (01) : 533 - 555
  • [37] AoI-Minimal Trajectory Planning and Data Collection in UAV-Assisted Wireless Powered IoT Networks
    Hu, Huimin
    Xiong, Ke
    Qu, Gang
    Ni, Qiang
    Fan, Pingyi
    Ben Letaief, Khaled
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (02) : 1211 - 1223
  • [38] Deep Reinforcement Learning for Age of Information Minimization in Reservation Multi-Access Networks
    Ji, Zhengyang
    Song, Xiaoshi
    [J]. 2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 385 - 390
  • [39] Collaborative computation offloading and wireless charging scheduling in multi-UAV-assisted MEC networks: A TD3-based approach
    Zhao, Liang
    Yao, Yujun
    Guo, Jianmeng
    Zuo, Qingjun
    Leung, Victor C. M.
    [J]. COMPUTER NETWORKS, 2024, 251
  • [40] UAV-Assisted NOMA for Enhancing ISAC: A Deep Reinforcement Learning Solution
    Amhaz, Ali
    Elhattab, Mohamed
    Sharafeddine, Sanaa
    Assi, Chadi
    [J]. IEEE COMMUNICATIONS LETTERS, 2025, 29 (02) : 249 - 253