Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges

被引:191
作者
Lei, Lei [1 ]
Tan, Yue [2 ]
Zheng, Kan [2 ]
Liu, Shiwen [2 ]
Zhang, Kuan [3 ]
Shen, Xuemin [4 ]
机构
[1] Univ Guelph, Coll Engn & Phys Sci, Guelph, ON N1G 2W1, Canada
[2] Beijing Univ Posts & Telecommun, Intelligent Comp & Commun Lab, Key Lab Universal Wireless Commun, Minist Educ, Beijing 100876, Peoples R China
[3] Univ Nebraska, Dept Elect & Comp Engn, Lincoln, NE 68182 USA
[4] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
基金
中国国家自然科学基金;
关键词
Internet of Things; Servers; Machine learning; Actuators; Tutorials; Approximation algorithms; Autonomous Internet of Things; deep reinforcement learning; DEMAND RESPONSE; RESOURCE-ALLOCATION; ENERGY MANAGEMENT; WIRELESS NETWORKS; BIG DATA; IOT; OPTIMIZATION; ALGORITHM; SENSOR;
D O I
10.1109/COMST.2020.2988367
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet of Things (IoT) extends the Internet connectivity into billions of IoT devices around the world, where the IoT devices collect and share information to reflect status of the physical world. The Autonomous Control System (ACS), on the other hand, performs control functions on the physical systems without external intervention over an extended period of time. The integration of IoT and ACS results in a new concept - autonomous IoT (AIoT). The sensors collect information on the system status, based on which the intelligent agents in the IoT devices as well as the Edge/Fog/Cloud servers make control decisions for the actuators to react. In order to achieve autonomy, a promising method is for the intelligent agents to leverage the techniques in the field of artificial intelligence, especially reinforcement learning (RL) and deep reinforcement learning (DRL) for decision making. In this paper, we first provide a tutorial of DRL, and then propose a general model for the applications of RL/DRL in AIoT. Next, a comprehensive survey of the state-of-art research on DRL for AIoT is presented, where the existing works are classified and summarized under the umbrella of the proposed general DRL model. Finally, the challenges and open issues for future research are identified.
引用
收藏
页码:1722 / 1760
页数:39
相关论文
共 163 条
[1]   A Review of Fog Computing and Machine Learning: Concepts, Applications, Challenges, and Open Issues [J].
Abdulkareem, Karrar Hameed ;
Mohammed, Mazin Abed ;
Gunasekaran, Saraswathy Shamini ;
Al-Mhiqani, Mohammed Nasser ;
Mutlag, Ammar Awad ;
Mostafa, Salama A. ;
Ali, Nabeel Salih ;
Ibrahim, Dheyaa Ahmed .
IEEE ACCESS, 2019, 7 :153123-153140
[2]   Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications [J].
Abu Alsheikh, Mohammad ;
Lin, Shaowei ;
Niyato, Dusit ;
Tan, Hwee-Pink .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (04) :1996-2018
[3]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[4]  
[Anonymous], 2016, LEARNING COMMUNICATE
[5]  
[Anonymous], 2018, SMARTER THINGS AUTON
[6]  
[Anonymous], 2016, SAMPLE EFFICIENT ACT
[7]  
[Anonymous], 2019, WIRELESS NETWORKS DE
[8]  
[Anonymous], 2016, PREPRINT
[9]  
Antsaklis P. J., 1991, IEEE Control Systems Magazine, V11, P5, DOI 10.1109/37.88585
[10]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38