Data Collection Mechanism for UAV-Assisted Cellular Network Based on PPO

被引:3
作者
Chen, Tuo [1 ]
Dong, Feihong [2 ]
Ye, Hu [2 ]
Wang, Yun [3 ]
Wu, Bin [4 ]
机构
[1] Beijing Inst Satellite Informat Engn, Beijing 100080, Peoples R China
[2] Acad Mil Sci PLA, Syst Engn Inst, Beijing 100091, Peoples R China
[3] Chongqing Univ, Coll Microelect & Commun Engn, Chongqing 400044, Peoples R China
[4] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
关键词
UAV; RL; PPO; data collection; path planning; ALLOCATION; ALTITUDE; DRL;
D O I
10.3390/electronics12061376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned aerial vehicles (UAVs) are increasingly gaining in application value in many fields because of their low cost, small size, high mobility and other advantages. In the scenario of traditional cellular networks, UAVs can be used as a kind of aerial mobile base station to collect information of edge users in time. Therefore, UAVs provide a promising communication tool for edge computing. However, due to the limited battery capacity, these may not be able to completely collect all the information. The path planning can ensure that the UAV collects as much data as possible under the limited flight distance, so it is very important to study the path planning of the UAV. In addition, due to the particularity of air-to-ground communication, the flying altitude of the UAV can have a crucial impact on the channel quality between the UAV and the user. As a mature technology, deep reinforcement learning (DRL) is an important algorithm in the field of machine learning which can be deployed in unknown environments. Deep reinforcement learning is applied to the data collection of UAV-assisted cellular networks, so that UAVs can find the best path planning and height joint optimization scheme, which ensures that UAVs can collect more information under the condition of limited energy consumption, save human and material resources as much as possible, and finally achieve higher application value. In this work, we transform the UAV path planning problem into an Markov decision process (MDP) problem. By applying the proximal policy optimization (PPO) algorithm, our proposed algorithm realizes the adaptive path planning of UAV. Simulations are conducted to verify the performance of the proposed scheme compared to the conventional scheme.
引用
收藏
页数:12
相关论文
共 28 条
[1]   UAV SECaaS: Game-Theoretic Formulation for Security as a Service in UAV Swarms [J].
Bansal, Gaurang ;
Chamola, Vinay ;
Sikdar, Biplab ;
Yu, Fei Richard .
IEEE SYSTEMS JOURNAL, 2022, 16 (04) :6209-6218
[2]   QoE Optimization for Live Video Streaming in UAV-to-UAV Communications via Deep Reinforcement Learning [J].
Burhanuddin, Liyana Adilla Binti ;
Liu, Xiaonan ;
Deng, Yansha ;
Challita, Ursula ;
Zahemszky, Andras .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (05) :5358-5370
[3]   Capacity Fade and Aging Effect on Lithium Battery Cells: A Real Case Vibration Test with UAV [J].
Caposciutti G. ;
Bandini G. ;
Marracci M. ;
Buffi A. ;
Tellini B. .
IEEE Journal on Miniaturization for Air and Space Systems, 2021, 2 (02) :76-83
[4]   Fast or Slow: An Autonomous Speed Control Approach for UAV-assisted IoT Data Collection Networks [J].
Chu, Nam H. ;
Dinh Thai Hoang ;
Nguyen, Diep N. ;
Nguyen Van Huynh ;
Dutkiewicz, Eryk .
2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
[5]   Q-Learning: Theory and Applications [J].
Clifton, Jesse ;
Laber, Eric .
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 7, 2020, 2020, 7 :279-301
[6]   Joint Air-to-Ground Scheduling in UAV-Aided Vehicular Communication: A DRL Approach With Partial Observations [J].
Deng, Danhao ;
Wang, Chaowei ;
Wang, Weidong .
IEEE COMMUNICATIONS LETTERS, 2022, 26 (07) :1628-1632
[7]   On the Optimal Mounting Angle for a Spinning LiDAR on a UAV [J].
Diels, Laurens ;
Vlaminck, Michiel ;
De Wit, Bart ;
Philips, Wilfried ;
Luong, Hiep .
IEEE SENSORS JOURNAL, 2022, 22 (21) :21240-21247
[8]  
Huiru Cao, 2020, 2020 IEEE 9th Joint International Information Technology and Artificial Intelligence Conference (ITAIC), P2133, DOI 10.1109/ITAIC49862.2020.9338964
[9]   Deep Reinforcement Learning for Persistent Cruise Control in UAV-aided Data Collection [J].
Kurunathan, Harrison ;
Li, Kai ;
Ni, Wei ;
Tovar, Eduardo ;
Dressler, Falko .
PROCEEDINGS OF THE IEEE 46TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2021), 2021, :347-350
[10]   A Novel UAV-Enabled Data Collection Scheme for Intelligent Transportation System Through UAV Speed Control [J].
Li, Xiong ;
Tan, Jiawei ;
Liu, Anfeng ;
Vijayakumar, Pandi ;
Kumar, Neeraj ;
Alazab, Mamoun .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (04) :2100-2110