Deep Reinforcement Learning for Scheduling in Cellular Networks

被引：19

作者：

Wang, Jian ^{[1
]}

Xu, Chen ^{[1
]}

Huangfu, Yourui ^{[1
]}

Li, Rong ^{[1
]}

Ge, Yiqun ^{[2
]}

Wang, Jun ^{[1
]}

机构：

[1] Huawei Technol, Hangzhou Res Ctr, Hangzhou, Zhejiang, Peoples R China

[2] Huawei Technol, Ottawa Res Ctr, Ottawa, ON, Canada

来源：

2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP) | 2019年

关键词：

artificial intelligence; cellular networks; deep reinforcement learning; scheduling; proportional fair;

D O I：

10.1109/wcsp.2019.8927868

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Integrating artificial intelligence (AI) into wireless networks has drawn significant interest in both industry and academia. A common solution is to replace partial or even all modules in the conventional systems, which is often lack of efficiency and robustness due to their ignoring of expert knowledge. In this paper, we take deep reinforcement learning (DRL) based scheduling as an example to investigate how expert knowledge can help with AI module in cellular networks. A simulation platform, which has considered link adaption, feedback and other practical mechanisms, is developed to facilitate the investigation. Besides the traditional way, which is learning directly from the environment, for training DRL agent, we propose two novel methods, i.e., learning from a dual AI module and learning from the expert solution. The results show that, for the considering scheduling problem, DRL training procedure can be improved on both performance and convergence speed by involving the expert knowledge. Hence, instead of replacing conventional scheduling module in the system, adding a newly introduced AI module, which is capable to interact with the conventional module and provide more flexibility, is a more feasible solution.

引用

页数：6

共 14 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], 2001, WIRELESS COMMUNICATI

[3]

[Anonymous], 2019, IEEE COMMUNICATIONS

[4]

[Anonymous], IEEE INTERNET THINGS

[5]

[Anonymous], 2018, BOUND VALUE PROBLEM

[6]

Atallah R, 2017, 2017 15TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT)

[7]

Bertsekas D. P., 2005, Dynamic Programming and Optimal Control, V1

[8] Downlink Packet Scheduling in LTE Cellular Networks: Key Design Issues and a Survey [J].

Capozzi, F. ;

Piro, G. ;

Grieco, L. A. ;

Boggia, G. ;

Camarda, P. .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2013, 15 (02) :678-700

[9]

Chinchali S, 2018, AAAI CONF ARTIF INTE, P766

[10]

Jain R., 1984, DEC TECHNICAL REPORT

← 1 2 →