Energy-Efficient UAV-Enabled Data Collection via Wireless Charging: A Reinforcement Learning Approach

被引:91
作者
Fu, Shu [1 ,2 ]
Tang, Yujie [3 ]
Wu, Yuan [4 ,5 ]
Zhang, Ning [6 ]
Gu, Huaxi [2 ]
Chen, Chen [1 ]
Liu, Min [1 ]
机构
[1] Chongqing Univ, Sch Microelect & Commun Engn, Chongqing 400044, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[3] Algoma Univ, Dept Comp Sci & Math, Sault Ste Marie, ON P6A 2G4, Canada
[4] Univ Macau, State Key Lab Internet Things Smart City, Macau, Peoples R China
[5] Univ Macau, Dept Comp Informat Sci, Macau, Peoples R China
[6] Univ Windsor, Dept Elect & Comp Engn, Windsor, ON N9B 3P4, Canada
基金
中国国家自然科学基金;
关键词
Data collection; design of reward function; energy efficiency; Q-learning; reinforcement learning; unmanned aerial vehicle (UAV); DEPLOYMENT; NETWORKS; DESIGN; COMMUNICATION; ALLOCATION; INTERNET; CHANNEL;
D O I
10.1109/JIOT.2021.3051370
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we study the application of unmanned aerial vehicle (UAV) for data collection with wireless charging, which is crucial for providing seamless coverage and improving system performance in the next-generation wireless networks. To this end, we propose a reinforcement learning-based approach to plan the route of UAV to collect sensor data from sensor devices scattered in the physical environment. Specifically, the physical environment is divided into multiple grids, where one spot for UAV hovering as well as the wireless charging of UAV is located at the center of each grid. Each grid has a spot for the UAV to hover, and moreover, there is a wireless charger at the center of each grid, which can provide wireless charging to UAV when it is hovering in the grid. When the UAV lacks energy, it can be charged by the wireless charger at the spot. By taking into account the collected data amount as well as the energy consumption, we formulate the problem of data collection with UAV as a Markov decision problem, and exploit Q-learning to find the optimal policy. In particular, we design the reward function considering the energy efficiency of UAV flight and data collection, based on which Q-table is updated for guiding the route of UAV. Through extensive simulation results, we verify that our proposed reward function can achieve a better performance in terms of the average throughput, delay of data collection, as well as the energy efficiency of UAV, in comparison with the conventional capacity-based reward function.
引用
收藏
页码:10209 / 10219
页数:11
相关论文
共 44 条
[1]   An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks [J].
Asheralieva, Alia ;
Miyanaga, Yoshikazu .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2016, 64 (09) :3996-4012
[2]   Design of Future UAV-Relay Tactical Data Link for Reliable UAV Control and Situational Awareness [J].
Baek, Hoki ;
Lim, Jaesung .
IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (10) :144-150
[3]  
Bayerlein H, 2018, CONF REC ASILOMAR C, P724, DOI 10.1109/ACSSC.2018.8645103
[4]   Online Trajectory and Radio Resource Optimization of Cache-Enabled UAV Wireless Networks With Content and Energy Recharging [J].
Chai, Shuqi ;
Lau, Vincent K. N. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 :1286-1299
[5]   Interference Management for Cellular-Connected UAVs: A Deep Reinforcement Learning Approach [J].
Challita, Ursula ;
Saad, Walid ;
Bettstetter, Christian .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (04) :2125-2140
[6]  
Dai HB, 2020, CHINA COMMUN, V17, P89, DOI 10.23919/JCC.2020.01.007
[7]   Q-Learning-Based Damping Control of Wide-Area Power Systems Under Cyber Uncertainties [J].
Duan, Jiajun ;
Xu, Hao ;
Liu, Wenxin .
IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (06) :6408-6418
[8]   Autonomous UAV Trajectory for Localizing Ground Objects: A Reinforcement Learning Approach [J].
Ebrahimi, Dariush ;
Sharafeddine, Sanaa ;
Ho, Pin-Han ;
Assi, Chadi .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (04) :1312-1324
[9]   Genetic diversity of the enterohaemolysin gene (ehxA) in non-O157 Shiga toxin-producing Escherichia coli strains in China [J].
Fu, Shanshan ;
Bai, Xiangning ;
Fan, Ruyue ;
Sun, Hui ;
Xu, Yanmei ;
Xiong, Yanwen .
SCIENTIFIC REPORTS, 2018, 8
[10]   Joint Unmanned Aerial Vehicle (UAV) Deployment and Power Control for Internet of Things Networks [J].
Fu, Shu ;
Tang, Yujie ;
Zhang, Ning ;
Zhao, Lian ;
Wu, Shaohua ;
Jian, Xin .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (04) :4367-4378