Delay-Sensitive Energy-Efficient UAV Crowdsensing by Deep Reinforcement Learning

被引:57
作者
Dai, Zipeng [1 ]
Liu, Chi Harold [1 ]
Han, Rui [1 ]
Wang, Guoren [1 ]
Leung, Kin K. K. [2 ]
Tang, Jian [3 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Imperial Coll, Elect & Elect Engn Dept, London SW7 2BT, England
[3] Midea Grp, Beijing 100088, Peoples R China
基金
中国国家自然科学基金;
关键词
Sensors; Task analysis; Crowdsensing; Data collection; Navigation; Delays; Computational modeling; UAV crowdsensing; delay-sensitive applications; energy-efficiency; deep reinforcement learning; TRAJECTORY DESIGN; TASK ASSIGNMENT; DATA-COLLECTION; NAVIGATION;
D O I
10.1109/TMC.2021.3113052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile crowdsensing (MCS) by unmanned aerial vehicles (UAVs) servicing delay-sensitive applications becomes popular by navigating a group of UAVs to take advantage of their equipped high-precision sensors and durability for data collection in harsh environments. In this paper, we aim to simultaneously maximize collected data amount, geographical fairness, and minimize the energy consumption of all UAVs, as well as to guarantee the data freshness by setting a deadline in each timeslot. Specifically, we propose a centralized control, distributed execution framework by decentralized deep reinforcement learning (DRL) for delay-sensitive and energy-efficient UAV crowdsensing, called "DRL-eFresh". It includes a synchronous computational architecture with GRU sequential modeling to generate multi-UAV navigation decisions. Also, we derive an optimal time allocation solution for data collection while considering all UAV efforts and avoiding much data dropout due to limited data upload time and wireless data rate. Simulation results show that DRL-eFresh significantly improves the energy efficiency, as compared to the best baseline DPPO, by 14% and 22% on average when varying different sensing ranges and number of PoIs, respectively.
引用
收藏
页码:2038 / 2052
页数:15
相关论文
共 53 条
[31]  
Singhal C, 2017, ADV WIREL TECHNOL TE, P1, DOI 10.4018/978-1-5225-2023-8
[32]   Crowd Navigation in an Unknown and Dynamic Environment Based on Deep Reinforcement Learning [J].
Sun, Libo ;
Zhai, Jinfeng ;
Qin, Wenhu .
IEEE ACCESS, 2019, 7 :109544-109554
[33]  
Sun Y, 2016, IEEE INFOCOM SER
[34]   Social-Aware UAV-Assisted Mobile Crowd Sensing in Stochastic and Dynamic Environments for Disaster Relief Networks [J].
Wang, Bowen ;
Sun, Yanjing ;
Liu, Dianxiong ;
Nguyen, Hien M. ;
Duong, Trung Q. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (01) :1070-1074
[35]   Deep-Reinforcement-Learning-Based Autonomous UAV Navigation With Sparse Rewards [J].
Wang, Chao ;
Wang, Jian ;
Wang, Jingjing ;
Zhang, Xudong .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) :6180-6190
[36]   An Efficient Prediction-Based User Recruitment for Mobile Crowdsensing [J].
Wang, En ;
Yang, Yongjian ;
Wu, Jie ;
Liu, Wenbin ;
Wang, Xingbo .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2018, 17 (01) :16-28
[37]   Energy-Efficient 3D Vehicular Crowdsourcing For Disaster Response by Distributed Deep Reinforcement Learning [J].
Wang, Hao ;
Liu, Chi Harold ;
Dai, Zipeng ;
Tang, Jian ;
Wang, Guoren .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :3679-3687
[38]   Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing [J].
Wang, Liang ;
Wang, Kezhi ;
Pan, Cunhua ;
Xu, Wei ;
Aslam, Nauman ;
Hanzo, Lajos .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) :73-84
[39]   Location-Aware Crowdsensing: Dynamic Task Assignment and Truth Inference [J].
Wang, Xiong ;
Jia, Riheng ;
Tian, Xiaohua ;
Gan, Xiaoying ;
Fu, Luoyi ;
Wang, Xinbing .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (02) :362-375
[40]  
Wang X, 2018, IEEE INFOCOM SER, P2420, DOI 10.1109/INFOCOM.2018.8485914