Deep Reinforcement Learning for Dynamic Uplink/Downlink Resource Allocation in High Mobility 5G HetNet

被引:130
作者
Tang, Fengxiao [1 ]
Zhou, Yibo [1 ]
Kato, Nei [1 ]
机构
[1] Tohoku Univ, Grad Sch Informat Sci GSIS, Sendai, Miyagi 9808579, Japan
关键词
5G mobile communication; Resource management; Heuristic algorithms; Vehicle dynamics; Interference; Dynamic scheduling; Device-to-device communication; Vehicular ad hoc networks; 5G; high mobility; resource allocation; unmanned aerial vehicle (UAV); time division duplex (TDD); reinforcement learning (RL); Q-learning; deep learning; deep belief network; heterogeneous network (HetNet); INTERFERENCE MANAGEMENT; INTELLIGENT; TDD; ASSIGNMENT; NETWORK; CHANNELS; DOWNLINK;
D O I
10.1109/JSAC.2020.3005495
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, the 5G is widely deployed for supporting communications of high mobility nodes including train, vehicular and unmanned aerial vehicles (UAVs) largely emerged as the main components for constructing the wireless heterogeneous network (HetNet). To further improve the radio utilization, the Time Division Duplex (TDD) is considered to be the potential full-duplex communication technology in the high mobility 5G network. However, the high mobility of users leads to the high dynamic network traffic and unpredicted link state change. A new method to predict the dynamic traffic and channel condition and schedule the TDD configuration in real-time is essential for the high mobility environment. In this paper, we investigate the channel model in the high mobility and heterogeneous network and proposed a novel deep reinforcement learning based intelligent TDD configuration algorithm to dynamically allocate radio resources in an online manner. In the proposal, the deep neural network is employed to extract the features of the complex network information, and the dynamic Q-value iteration based reinforcement learning with experience replay memory mechanism is proposed to adaptively change TDD Up/Down-link ratio by evaluated rewards. The simulation results show that the proposal achieves significant network performance improvement in terms of both network throughput and packet loss rate, comparing with conventional TDD resource allocation algorithms.
引用
收藏
页码:2773 / 2782
页数:10
相关论文
共 34 条
[1]  
3GPP, TR 22.885 v14.0.0
[2]  
Al-Imari M, 2015, IEEE VTS VEH TECHNOL
[3]   Model-Free Training of End-to-End Communication Systems [J].
Aoudia, Faycal Ait ;
Hoydis, Jakob .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) :2503-2516
[4]   On the capacity of some channels with channel state information [J].
Caire, G ;
Shamai, S .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1999, 45 (06) :2007-2019
[5]   Adaptive Downlink and Uplink Channel Split Ratio Determination for TCP-Based Best Effort Traffic in TDD-Based WiMAX Networks [J].
Chiang, Chih-He ;
Liao, Wanjiun ;
Liu, Tehuang ;
Chan, Iam Kin ;
Chao, Hsi-Lu .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2009, 27 (02) :182-190
[6]   On Dynamic Time-Division-Duplex Transmissions for Small-Cell Networks [J].
Ding, Min ;
Lopez-Perez, David ;
Xue, Ruiqi ;
Vasilakos, Athanasios V. ;
Chen, Wen .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (11) :8933-8951
[7]  
Ding M, 2014, IEEE INT CONF COMM, P616, DOI 10.1109/ICCW.2014.6881267
[8]   Performance Analysis of Dense Small Cell Networks With Dynamic TDD [J].
Ding, Tian ;
Ding, Ming ;
Mao, Guoqiang ;
Lin, Zihuai ;
Zomaya, Albert Y. ;
Lopez-Perez, David .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (10) :9816-9830
[9]   State-of-the-Art Deep Learning: Evolving Machine Intelligence Toward Tomorrow's Intelligent Network Traffic Control Systems [J].
Fadlullah, Zubair Md. ;
Tang, Fengxiao ;
Mao, Bomin ;
Kato, Nei ;
Akashi, Osamu ;
Inoue, Takeru ;
Mizutani, Kimihiro .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2017, 19 (04) :2432-2455
[10]  
Fazio P, 2011, WISEC 11: PROCEEDINGS OF THE FOURTH ACM CONFERENCE ON WIRELESS NETWORK SECURITY, P5