Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引：0

作者：

Long, Yusi ^{[1
,2
]}

Zhuang, Jialin ^{[1
]}

Gong, Shimin ^{[1
,2
]}

Gu, Bo ^{[1
]}

Xu, Jing ^{[3
]}

Deng, Jing ^{[4
]}

机构：

[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China

[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China

[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA

来源：

2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;

D O I：

10.1109/WCNC57260.2024.10570857

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.

引用

页数：6

共 50 条

[21] Freshness-in-air: An AoI-inspired UAV-assisted wireless sensor networks [J].

Basnayaka, Chathuranga M. Wijerathna ;

Jayakody, Dushantha Nalin K. ;

Beko, Marko .

ICT EXPRESS, 2024, 10 (05) :1103-1109

[22] Multi-UAV-Assisted Federated Learning for Energy-Aware Distributed Edge Training [J].

Tang, Jianhang ;

Nie, Jiangtian ;

Zhang, Yang ;

Xiong, Zehui ;

Jiang, Wenchao ;

Guizani, Mohsen .

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01) :280-294

[23] Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks [J].

Chen, Peixin ;

Zhao, Jian ;

Shen, Furao .

2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, :933-938

[24] Deep Reinforcement Learning Based AoI Minimization for NOMA-Enabled Integrated Satellite-Terrestrial Networks [J].

He, Xinyu ;

Yang, Yang ;

Lee, Jemin ;

He, Gang ;

Yan, Qing .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) :3567-3572

[25] Deep RL-based Trajectory Planning for AoI Minimization in UAV-assisted IoT [J].

Zhou, Conghao ;

He, Hongli ;

Yang, Peng ;

Lyu, Feng ;

Wu, Wen ;

Cheng, Nan ;

Shen, Xuemin .

2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,

[26] Secure Transmission Design of RIS Enabled UAV Communication Networks Exploiting Deep Reinforcement Learning [J].

Dong, Runze ;

Wang, Buhong ;

Cao, Kunrui ;

Tian, Jiwei ;

Cheng, Tianhao .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) :8404-8419

[27] Deep Reinforcement Learning Based Resource Allocation in Multi-UAV-Aided MEC Networks [J].

Chen, Jingxuan ;

Cao, Xianbin ;

Yang, Peng ;

Xiao, Meng ;

Ren, Siqiao ;

Zhao, Zhongliang ;

Wu, Dapeng Oliver .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) :296-309

[28] Online Altitude Control and Scheduling Policy for Minimizing AoI in UAV-Assisted IoT Wireless Networks [J].

Samir, Moataz ;

Assi, Chadi ;

Sharafeddine, Sanaa ;

Ghrayeb, Ali .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (07) :2493-2505

[29] Autonomous Maintenance in IoT Networks via AoI-driven Deep Reinforcement Learning [J].

Stamatakis, George ;

Pappas, Nikolaos ;

Fragkiadakis, Alexandros ;

Traganitis, Apostolos .

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,

[30] Up-Downlink AoI-Driven Multi-Source Data Collection in UAV-Assisted Wireless Sensor Networks [J].

Zhao, Mingxiong ;

Xiao, Yiming ;

Yao, Jianping ;

Wang, Tongda ;

Lee, Jemin ;

Quek, Tony Q. S. .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (02) :1178-1192

← 1 2 3 4 5 →