Deep Reinforcement Learning based Dynamic Resource Allocation Method for NOMA in AeroMACS

被引：0

作者：

Yu, Lanchenhui ^{[1
]}

Zhao, Jingjing ^{[1
]}

Zhu, Yanbo ^{[1
]}

Chen, RunZe ^{[1
]}

Cai, Kaiquan ^{[1
]}

机构：

[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China

来源：

2024 INTEGRATED COMMUNICATIONS, NAVIGATION AND SURVEILLANCE CONFERENCE, ICNS | 2024年

基金：

中国国家自然科学基金;

关键词：

Aeronautical Mobile Airport Communications system; non-orthogonal multiple access; communication resource allocation; deep reinforcement learning; NONORTHOGONAL MULTIPLE-ACCESS;

D O I：

10.1109/ICNS60906.2024.10550718

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

To overcome the constraints posed by the scarcity of spectrum resources in the dedicated frequency band and the challenge of fulfilling real-time requirements across various services in civil airport surface operations, we propose a dynamic resource allocation method for airport communication system. This innovative approach is based on the non-orthogonal multiple access (NOMA) architecture. To account for variations in service priority among different entities on the surface, we design a multi-objective utility function that considers both transmission rate and service priority. We establish a joint optimization problem model for sub-channel allocation and power control in the scenario of airport uplink communication. Since the problem model exhibits non-convexity and highly coupled parameters, the multi-agent proximal policy optimization based on multi-discrete (MD-MAPPO) algorithm is introduced. Simulation results demonstrate that the NOMA architecture significantly improves the spectral efficiency of the airport communication system. Furthermore, our proposed algorithm effectively meets the requirements of multiple services by achieving dynamic and efficient wireless resource allocation, surpassing traditional reinforcement learning algorithms in terms of cumulative reward, convergence, and learning efficiency.

引用

页数：8

共 15 条

[1] Resource Allocation in Uplink NOMA-IoT Networks: A Reinforcement-Learning Approach [J].

Ahsan, Waleed ;

Yi, Wenqiang ;

Qin, Zhijin ;

Liu, Yuanwei ;

Nallanathan, Arumugam .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (08) :5083-5098

[2] Dynamic User Clustering and Power Allocation for Uplink and Downlink Non-Orthogonal Multiple Access (NOMA) Systems [J].

Ali, Md Shipon ;

Tabassum, Hina ;

Hossain, Ekram .

IEEE ACCESS, 2016, 4 :6325-6343

[3]

Andrea A. N. D., 1997, Two-sided matching: A study in gametheoretic modeling and analysis

[4]

caac, Construction and application implementation plan of newgeneration Aero MACS

[5] Reinforcement Learning-Based Multiaccess Control and Battery Prediction With Energy Harvesting in IoT Systems [J].

Chu, Man ;

Li, Hang ;

Liao, Xuewen ;

Cui, Shuguang .

IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02) :2009-2020

[6]

Dai LL, 2015, IEEE COMMUN MAG, V53, P74, DOI 10.1109/MCOM.2015.7263349

[7] On the Performance of Non-Orthogonal Multiple Access in 5G Systems with Randomly Deployed Users [J].

Ding, Zhiguo ;

Yang, Zheng ;

Fan, Pingzhi ;

Poor, H. Vincent .

IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (12) :1501-1505

[8]

ICAO, 2017, ICAO Doc.10044

[9]

ITU, 2007, ITU-R Recommendation M.1827.

[10] On the Complexity of Joint Subcarrier and Power Allocation for Multi-User OFDMA Systems [J].

Liu, Ya-Feng ;

Dai, Yu-Hong .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (03) :583-596

← 1 2 →