Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges

被引:191
作者
Lei, Lei [1 ]
Tan, Yue [2 ]
Zheng, Kan [2 ]
Liu, Shiwen [2 ]
Zhang, Kuan [3 ]
Shen, Xuemin [4 ]
机构
[1] Univ Guelph, Coll Engn & Phys Sci, Guelph, ON N1G 2W1, Canada
[2] Beijing Univ Posts & Telecommun, Intelligent Comp & Commun Lab, Key Lab Universal Wireless Commun, Minist Educ, Beijing 100876, Peoples R China
[3] Univ Nebraska, Dept Elect & Comp Engn, Lincoln, NE 68182 USA
[4] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
基金
中国国家自然科学基金;
关键词
Internet of Things; Servers; Machine learning; Actuators; Tutorials; Approximation algorithms; Autonomous Internet of Things; deep reinforcement learning; DEMAND RESPONSE; RESOURCE-ALLOCATION; ENERGY MANAGEMENT; WIRELESS NETWORKS; BIG DATA; IOT; OPTIMIZATION; ALGORITHM; SENSOR;
D O I
10.1109/COMST.2020.2988367
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet of Things (IoT) extends the Internet connectivity into billions of IoT devices around the world, where the IoT devices collect and share information to reflect status of the physical world. The Autonomous Control System (ACS), on the other hand, performs control functions on the physical systems without external intervention over an extended period of time. The integration of IoT and ACS results in a new concept - autonomous IoT (AIoT). The sensors collect information on the system status, based on which the intelligent agents in the IoT devices as well as the Edge/Fog/Cloud servers make control decisions for the actuators to react. In order to achieve autonomy, a promising method is for the intelligent agents to leverage the techniques in the field of artificial intelligence, especially reinforcement learning (RL) and deep reinforcement learning (DRL) for decision making. In this paper, we first provide a tutorial of DRL, and then propose a general model for the applications of RL/DRL in AIoT. Next, a comprehensive survey of the state-of-art research on DRL for AIoT is presented, where the existing works are classified and summarized under the umbrella of the proposed general DRL model. Finally, the challenges and open issues for future research are identified.
引用
收藏
页码:1722 / 1760
页数:39
相关论文
共 163 条
[31]   Reinforcement Learning Approach for Optimal Distributed Energy Management in a Microgrid [J].
Foruzan, Elham ;
Soh, Leen-Kiat ;
Asgarpoor, Sohrab .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (05) :5749-5758
[32]  
Francois-Lavet V., 2016, P EUR WORKSH REINF L
[33]   An Introduction to Deep Reinforcement Learning [J].
Francois-Lavet, Vincent ;
Henderson, Peter ;
Islam, Riashat ;
Bellemare, Marc G. ;
Pineau, Joelle .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2018, 11 (3-4) :219-354
[34]  
Fujimoto S., 2018, PROC INT C MACH LEAR
[35]  
Gamage H., 2017, PROC IEEE 86 VEH TEC, P1
[36]  
Gu S., 2016, Q-Prop: Sample-Efficient Policy Gradient with an Off-Policy Critic
[37]  
Haarnoja T, 2018, P INT C MACH LEARN
[38]  
Hausknecht M., 2015, P AAAI C ART INT
[39]   Integrated Networking, Caching, and Computing for Connected Vehicles: A Deep Reinforcement Learning Approach [J].
He, Ying ;
Zhao, Nan ;
Yin, Hongxi .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (01) :44-55
[40]  
Hochreiter S, 2001, A Field Guide to Dynamical Recurrent Networks, P237, DOI DOI 10.1109/9780470544037.CH14