Power Management of Wireless Sensor Nodes with Coordinated Distributed Reinforcement Learning

被引：7

作者：

Shresthamali, Shaswot ^{[1
]}

Kondo, Masaaki ^{[1
]}

Nakamura, Hiroshi ^{[1
]}

机构：

[1] Univ Tokyo, Tokyo, Japan

来源：

2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019) | 2019年

基金：

日本科学技术振兴机构;

关键词：

Distributed Reinforcement Learning; Deep Reinforcement Learning; Energy Harvesting Wireless Sensor Nodes; Energy Neutral Operation; Internet of Things; epsilon-greedy exploration;

D O I：

10.1109/ICCD46524.2019.00092

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Energy Harvesting Wireless Sensor Nodes (EHWSNs) require adaptive energy management policies for uninterrupted perpetual operation in their physical environments. Contemporary online Reinforcement Learning (RL) solutions take an unrealistically long time exploring the environment to converge on working policies. Our work accelerates learning by partitioning the state-space for simultaneous exploration by multiple agents. We achieve this by using a novel coordinated epsilon-greedy method and implement it via Distributed RL (DiRL) in an EHWSN network. Our simulation results show a four-fold increase in state-space penetration and reduction in time to achieve optimal operation by an order of magnitude (50x). Moreover, we also propose methods to reduce instances of disastrous outcomes associated with learning and exploration. This translates to reducing the downtimes of the nodes in simulations corresponding to a real-world scenario by one thirds.

引用

页码：638 / 647

页数：10

共 36 条

[1]

Achiam J., 2017, CoRR

[2] A Learning Theoretic Approach to Energy Harvesting Communication System Optimization [J].