Timely Data Collection for UAV-Based IoT Networks: A Deep Reinforcement Learning Approach

被引:15
作者
Hu, Yingmeng [1 ]
Liu, Yan [1 ]
Kaushik, Aryan [2 ]
Masouros, Christos [3 ]
Thompson, John S. [4 ]
机构
[1] China Satellite Network Innovat Co Ltd, Beijing 100029, Peoples R China
[2] Univ Sussex, Sch Engn & Informat, Brighton BN1 9RH, England
[3] UCL, Dept Elect & Elect Engn, London WC1E 7JE, England
[4] Univ Edinburgh, Inst Digital Commun, Sch Engn, Edinburgh EH9 3JL, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Data collection; Sensors; Internet of Things; Trajectory; Task analysis; Reinforcement learning; Memory; Age of information (AoI); data collection; deep reinforcement learning (DRL); unmanned aerial vehicle (UAV) trajectory optimization; INTERNET; THINGS; MODEL; AOI;
D O I
10.1109/JSEN.2023.3265935
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In some real-time Internet of Things (IoT) applications, the timeliness of sensor data is very important for the performance of a system. How to collect the data of sensor nodes (SNs) is a problem to be solved for an unmanned aerial vehicle (UAV) in a specified area, where different nodes have different timeliness priorities. To efficiently collect the data, a guided search deep reinforcement learning (GSDRL) algorithm is presented to help the UAV with different initial positions to independently complete the task of data collection and forwarding. First, the data collection process is modeled as a sequential decision problem for minimizing the average age of information (AoI) or maximizing the number of collected nodes according to specific environment. Then, the data collection strategy is optimized by the GSDRL algorithm. After training the network using the GSDRL algorithm, the UAV has the ability to perform autonomous navigation and decision-making to complete the complexity task more efficiently and rapidly. Simulation experiments show that the GSDRL algorithm has strong adaptability to adverse environments and obtains a good strategy for UAV data collection and forwarding.
引用
收藏
页码:12295 / 12308
页数:14
相关论文
共 30 条
[1]   Average Peak Age-of-Information Minimization in UAV-Assisted IoT Networks [J].
Abd-Elmagid, Mohamed A. ;
Dhillon, Harpreet S. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) :2003-2008
[2]   Correlated Link Shadow Fading in Multi-Hop Wireless Networks [J].
Agrawal, Piyush ;
Patwari, Neal .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2009, 8 (08) :4024-4036
[3]  
Al-Hourani A, 2014, IEEE GLOB COMM CONF, P2898, DOI 10.1109/GLOCOM.2014.7037248
[4]   Enabling Drone Services: Drone Crowdsourcing and Drone Scripting [J].
Alwateer, Majed ;
Loke, Seng W. ;
Fernando, Niroshinie .
IEEE ACCESS, 2019, 7 :110035-110049
[5]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[6]   Regret Based Learning for UAV assisted LTE-U/WiFi Public Safety Networks [J].
Athukoralage, Dasun ;
Guvenc, Ismail ;
Saad, Walid ;
Bennis, Mehdi .
2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
[7]   Crowdsourcing with Smartphones [J].
Chatzimilioudis, Georgios ;
Konstantinidis, Andreas ;
Laoudias, Christos ;
Zeinalipour-Yazti, Demetrios .
IEEE INTERNET COMPUTING, 2012, 16 (05) :36-44
[8]   On the Age of Information in Status Update Systems With Packet Management [J].
Costa, Maice ;
Codreanu, Marian ;
Ephremides, Anthony .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (04) :1897-1910
[9]   A Spatiotemporal Model for Peak AoI in Uplink IoT Networks: Time Versus Event-Triggered Traffic [J].
Emara, Mustafa ;
ElSawy, Hesham ;
Bauch, Gerhard .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (08) :6762-6777
[10]   Survey on UAV Cellular Communications: Practical Aspects, Standardization Advancements, Regulation, and Security Challenges [J].
Fotouhi, Azade ;
Qiang, Haoran ;
Ding, Ming ;
Hassan, Mahbub ;
Giordano, Lorenzo Galati ;
Garcia-Rodriguez, Adrian ;
Yuan, Jinhong .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (04) :3417-3442