Energy-Aware Deep Reinforcement Learning Scheduling for Sensors Correlated in Time and Space

被引:15
作者
Hribar, Jernej [1 ]
Marinescu, Andrei [2 ]
Chiumento, Alessandro [3 ]
Dasilva, Luiz A. [4 ]
机构
[1] Trinity Coll Dublin, CONNECT, Dublin D02 PN40 2, Ireland
[2] Ctr Intelligent Power, Dublin D04 YOC2 4, Ireland
[3] Univ Twente, Math & Comp Sci Fac, Pervas Syst Elect Engn, NL-7522 NB Enschede, Netherlands
[4] Virginia Tech, Commonwealth Cyber Initiat, Blacksburg, VA 24061 USA
关键词
Sensor systems; Intelligent sensors; Sensor phenomena and characterization; Internet of Things; Job shop scheduling; Logic gates; Reinforcement learning; Deep reinforcement learning (DRL); Internet of Things (IoT); low-power sensors; reinforcement learning; NETWORKS; INFORMATION; INTERNET; IOT; AGE;
D O I
10.1109/JIOT.2021.3114102
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Millions of battery-powered sensors deployed for monitoring purposes in a multitude of scenarios, e.g., agriculture, smart cities, industry, etc., require energy-efficient solutions to prolong their lifetime. When these sensors observe a phenomenon distributed in space and evolving in time, it is expected that collected observations will be correlated in time and space. This article proposes a deep reinforcement learning (DRL)-based scheduling mechanism capable of taking advantage of correlated information. The designed solution employs deep deterministic policy gradient (DDPG) algorithm. The proposed mechanism can determine the frequency with which sensors should transmit their updates, to ensure accurate collection of observations, while simultaneously considering the energy available. The solution is evaluated with multiple data sets containing environmental observations obtained in multiple real deployments. The real observations are leveraged to model the environment with which the mechanism interacts as realistically as possible. The proposed solution can significantly extend the sensors' lifetime and is compared to an idealized, all-knowing scheduler to demonstrate that its performance is near optimal. Additionally, the results highlight the unique feature of the proposed design, energy-awareness, by displaying the impact of sensors' energy levels on the frequency of updates.
引用
收藏
页码:6732 / 6744
页数:13
相关论文
共 44 条
[1]   Energy conservation in wireless sensor networks: A survey [J].
Anastasi, Giuseppe ;
Conti, Marco ;
Di Francesco, Mario ;
Passarella, Andrea .
AD HOC NETWORKS, 2009, 7 (03) :537-568
[2]  
[Anonymous], 2019, SX127273 860 MHZ 102
[3]   RLMan: An Energy Manager Based on Reinforcement Learning for Energy Harvesting Wireless Sensor Networks [J].
Aoudia, Faycal Ait ;
Gautier, Matthieu ;
Berder, Olivier .
IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2018, 2 (02) :408-417
[4]  
Bhatnagar S., 2008, Advances in Neural Information Processing Systems (NeurIPS), P105
[5]  
Bodik P., 2004, Intel Lab Data
[6]  
Bormann C., 2014, RFC7228
[7]   Survey and Taxonomy of Duty Cycling Mechanisms in Wireless Sensor Networks [J].
Carrano, Ricardo C. ;
Passos, Diego ;
Magalhaes, Luiz C. S. ;
Albuquerque, Celio V. N. .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (01) :181-194
[8]   On the lifetime of wireless sensor networks [J].
Chen, YX ;
Zhao, Q .
IEEE COMMUNICATIONS LETTERS, 2005, 9 (11) :976-978
[9]  
Costa M, 2017, 2017 IEEE CONFERENCE ON STANDARDS FOR COMMUNICATIONS AND NETWORKING (CSCN), P258, DOI 10.1109/CSCN.2017.8088631
[10]   Classes of nonseparable, spatio-temporal stationary covariance functions [J].
Cressie, N ;
Huang, HC .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (448) :1330-1340