Optimizing of Q-Learning Day/Night Energy Strategy for Solar Harvesting Environmental Wireless Sensor Networks Nodes

被引:14
作者
Prauzek, Michal [1 ]
Konecny, Jaromir [1 ]
机构
[1] VSB Tech Univ Ostrava, Fac Elect Engn & Comp Sci, Ostrava, Czech Republic
基金
欧盟地平线“2020”;
关键词
Energy management; Microcontrollers; Semi-supervised learning; Wireless sensor networks;
D O I
10.5755/j02.eie.28875
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This research article presents the application of the Q-learning algorithm in the operational duty cycle control of solar-powered environmental wireless sensor network (EWSN) nodes. Those nodes are commonly implemented as embedded devices using low-power and low-cost microcontrollers. Therefore, there is a significant need for an effective and easy way to implement a machine learning (ML) algorithm in terms of computer performance. This approach uses a Q-learning-based policy implementing a sleep/run switching algorithm driven by the state of charge. The presented algorithm is based on two modes: daylight and nighttime, which is a suitable solution for solar-powered systems. The study includes the complete process of design EWSN node strategy with an optimal reward policy. The presented algorithm was tested and verified on an EWSN node model and a 5-year data set of solar irradiance values was used for the learning process and its validation. As part of the study, we are also presenting the validation in terms of Q-learning parameters, which include the learning rate and discount factor. The result section shows that the overall performance of the presented solution is more suitable for solar-powered EWSN then state-of-the-art studies. Both day/night experiments reached 828 203 measurement/transmission cycles, which is 12.7 % more than in the previous studies using the strategy defined by the state of energy storage.
引用
收藏
页码:50 / 56
页数:7
相关论文
共 18 条
[1]  
Alberta Agriculture and Rural Developement, 2013, ALB AGR RUR DEV
[2]  
[Anonymous], 2020, SOLAR CALCULATION DE
[3]  
Christopher John Cornish Hellaby Watkins, 1989, Learning from delayed rewards
[4]  
de la Piedra A, 2013, 2013 IEEE EUROCON, P267, DOI 10.1109/EUROCON.2013.6624996
[5]  
Dick RP, 2020, IEEE DES TEST, V37, P7, DOI 10.1109/MDAT.2019.2957352
[6]  
Konecny J., 2020, ADV INTELLIGENT SYST, V1156, DOI [10.1007/978-3-030-50097-9, DOI 10.1007/978-3-030-50097-9]
[7]   A Simulation Framework for Energy Harvesting in Wireless Sensor Networks: Single Node Architecture Perspective [J].
Konecny, Jaromir ;
Prauzek, Michal ;
Borova, Monika ;
Janosova, Karolina ;
Musilek, Petr .
PROCEEDINGS OF THE 2019 23RD INTERNATIONAL CONFERENCE ELECTRONICS (ELECTRONICS 2019), 2019,
[8]   Feasibility of Harvesting Solar Energy for Self-Powered Environmental Wireless Sensor Nodes [J].
Li, Yuyang ;
Hamed, Ehab A. ;
Zhang, Xincheng ;
Luna, Daniel ;
Lin, Jeen-Shang ;
Liang, Xu ;
Lee, Inhee .
ELECTRONICS, 2020, 9 (12) :1-13
[9]  
Lian LY, 2011, PROTEIN NMR SPECTROSCOPY: PRACTICAL TECHNIQUES AND APPLICATIONS, P1
[10]   Two thermocouples low power wireless sensors network [J].
Markevicius, Vytautas ;
Navikas, Dangirutis ;
Andriukaitis, Darius ;
Cepenas, Mindaugas ;
Valinevicius, Algimantas ;
Zilys, Mindaugas ;
Malekian, Reza ;
Janeliauskas, Arturas ;
Walendziuk, Wojciech ;
Idzkowski, Adam .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2018, 84 :242-250