Learning Generalized Wireless MAC Communication Protocols via Abstraction

被引：14

作者：

Miuccio, Luciano ^{[1
]}

Riolo, Salvatore ^{[1
]}

Samarakoon, Sumudu ^{[2
]}

Panno, Daniela ^{[1
]}

Bennis, Mehdi ^{[2
]}

机构：

[1] Univ Catania, Dept Elect Elect & Comp Engn, Catania, Italy

[2] Univ Oulu, Ctr Wireless Commun, Oulu, Finland

来源：

2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022) | 2022年

关键词：

6G; MARL; abstraction; generalization; protocol; learning;

D O I：

10.1109/GLOBECOM48099.2022.10000805

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To tackle the heterogeneous requirements of beyond 5G (B5G) and future 6G wireless networks, conventional medium access control (MAC) procedures need to evolve to enable base stations (BSs) and user equipments (UEs) to automatically learn innovative MAC protocols catering to extremely diverse services. This topic has received significant attention, and several reinforcement learning (RL) algorithms, in which BSs and UEs are cast as agents, are available with the aim of learning a communication policy based on agents' local observations. However, current approaches are typically overfitted to the environment they are trained in, and lack robustness against unseen conditions, failing to generalize in different environments. To overcome this problem, in this work, instead of learning a policy in the high dimensional and redundant observation space, we leverage the concept of observation abstraction (OA) rooted in extracting useful information from the environment. This in turn allows learning communication protocols that are more robust and with much better generalization capabilities than current baselines. To learn the abstracted information from observations, we propose an architecture based on autoencoder (AE) and imbue it into a multi-agent proximal policy optimization (MAPPO) framework. Simulation results corroborate the effectiveness of leveraging abstraction when learning protocols by generalizing across environments, in terms of number of UEs, number of data packets to transmit, and channel conditions.

引用

页码：2322 / 2327

页数：6

共 11 条

[1]

ABEL D, 2018, P 35 INT C MACH LEAR, V80

[2] Learn to Schedule (LEASCH): A Deep Reinforcement Learning Approach for Radio Resource Scheduling in the 5G MAC Layer [J].

Al-Tam, Faroq ;

Correia, Noelia ;

Rodriguez, Jonathan .

IEEE ACCESS, 2020, 8 :108088-108101

[3]

[Anonymous], 2016, ABS160506676 CORR

[4]

[Anonymous], 2021, 2021 IEEE GLOB WORKS, DOI DOI 10.14232/EJQTDE.2021.1.35

[5] A survey of robot learning from demonstration [J].

Argall, Brenna D. ;

Chernova, Sonia ;

Veloso, Manuela ;

Browning, Brett .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (05) :469-483

[6] Deep Reinforcement Learning A brief survey [J].

Arulkumaran, Kai ;

Deisenroth, Marc Peter ;

Brundage, Miles ;

Bharath, Anil Anthony .

IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38

[7] Artificial-Intelligence-Enabled Air Interface for 6G: Solutions, Challenges, and Standardization Impacts [J].

Han, Shuangfeng ;

Xie, Tian ;

Chih-Lin, I ;

Chai, Li ;

Liu, Zhiming ;

Yuan, Yifei ;

Cui, Chunfeng .

IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (10) :73-79

[8] Towards a Learning-Based Framework for Self-Driving Design of Networking Protocols [J].

Pasandi, Hannaneh Barahouei ;

Nadeem, Tamer .

IEEE ACCESS, 2021, 9 :34829-34844

[9]

Terry J. K., 2020, ABS200513625 CORR

[10] Toward Joint Learning of Optimal MAC Signaling and Wireless Channel Access [J].

Valcarce, Alvaro ;

Hoydis, Jakob .

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (04) :1233-1243

← 1 2 →