Applications of Deep Reinforcement Learning in Communications and Networking: A Survey

被引：1276

作者：

Luong, Nguyen Cong ^{[1
]}

Hoang, Dinh Thai ^{[2
]}

Gong, Shimin ^{[3
]}

Niyato, Dusit ^{[1
]}

Wang, Ping ^{[4
]}

Liang, Ying-Chang ^{[5
]}

Kim, Dong In ^{[6
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[2] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia

[3] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Guangdong, Peoples R China

[4] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON M3J 1P3, Canada

[5] Univ Elect Sci & Technol China, Ctr Intelligent Networking & Commun, Chengdu 610054, Sichuan, Peoples R China

[6] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon 16419, South Korea

来源：

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS | 2019年 / 21卷 / 04期

基金：

中国国家自然科学基金; 新加坡国家研究基金会;

关键词：

Reinforcement learning; Deep learning; Security; Markov processes; Unmanned aerial vehicles; Ad hoc networks; Tutorials; Deep reinforcement learning; deep Q-learning; networking; communications; spectrum access; rate control; security; caching; data offloading; data collection; RESOURCE-ALLOCATION; CHANNEL ESTIMATION; USER ASSOCIATION; MASSIVE MIMO; FRAMEWORK; TRANSMISSION; SYSTEMS; ACCESS; OPTIMIZATION; PERFORMANCE;

D O I：

10.1109/COMST.2019.2916583

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a comprehensive literature review on applications of deep reinforcement learning (DRL) in communications and networking. Modern networks, e.g., Internet of Things (IoT) and unmanned aerial vehicle (UAV) networks, become more decentralized and autonomous. In such networks, network entities need to make decisions locally to maximize the network performance under uncertainty of network environment. Reinforcement learning has been efficiently used to enable the network entities to obtain the optimal policy including, e.g., decisions or actions, given their states when the state and action spaces are small. However, in complex and large-scale networks, the state and action spaces are usually large, and the reinforcement learning may not be able to find the optimal policy in reasonable time. Therefore, DRL, a combination of reinforcement learning with deep learning, has been developed to overcome the shortcomings. In this survey, we first give a tutorial of DRL from fundamental concepts to advanced models. Then, we review DRL approaches proposed to address emerging issues in communications and networking. The issues include dynamic network access, data rate control, wireless caching, data offloading, network security, and connectivity preservation which are all important to next generation networks, such as 5G and beyond. Furthermore, we present applications of DRL for traffic routing, resource sharing, and data collection. Finally, we highlight important challenges, open issues, and future research directions of applying DRL.

引用

页码：3133 / 3174

页数：42

共 195 条

[1]

[Anonymous], 2013, IEEE INT SYMP INFO

[2]

[Anonymous], 2014, Markov decision processes: discrete stochastic dynamic programming

[3]

[Anonymous], 2016, PROC INT C MACH LEAR

[4]

[Anonymous], 2018, Proceedings of the AAAI Conference on Artificial Intelligence

[5]

[Anonymous], 2016, MEASURING FIXED BROA

[6]

[Anonymous], IEEE T NETW SCI ENG

[7]

[Anonymous], 2016, DEEP LEARNING

[8]

[Anonymous], 2018, 2018 IEEE INT C COMM

[9]

[Anonymous], 2018, IEEE COMMUN LETT

[10]

[Anonymous], 2012, COMM NETW SMART

← 1 2 3 4 5 6 7 8 9 10 →