Toward Joint Learning of Optimal MAC Signaling and Wireless Channel Access

被引:15
作者
Valcarce, Alvaro [1 ]
Hoydis, Jakob [1 ]
机构
[1] Nokia Bell Labs France, Dept Radio Syst Res & AI, F-91620 Nozay, France
关键词
Communication system signaling; learning systems; mobile communication;
D O I
10.1109/TCCN.2021.3080677
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Communication protocols are the languages used by network nodes. Before a user equipment (UE) exchanges data with a base station (BS), it must first negotiate the conditions and parameters for that transmission. This negotiation is supported by signaling messages at all layers of the protocol stack. Each year, the telecoms industry defines and standardizes these messages, which are designed by humans during lengthy technical (and often political) debates. Following this standardization effort, the development phase begins, wherein the industry interprets and implements the resulting standards. But is this massive development undertaking the only way to implement a given protocol? We address the question of whether radios can learn a pre-given target protocol as an intermediate step towards evolving their own. Furthermore, we train cellular radios to emerge a channel access policy that performs optimally under the constraints of the target protocol. We show that multi-agent reinforcement learning (MARL) and learning-to-communicate (L2C) techniques achieve this goal with gains over expert systems. Finally, we provide insight into the transferability of these results to scenarios never seen during training.
引用
收藏
页码:1233 / 1243
页数:11
相关论文
共 27 条
[1]  
Al-Tam F., 2020, LEARN SCHEDULE LEASC
[2]  
[Anonymous], 2020, 3GPP Standard TS 38.300
[3]  
[Anonymous], 2019, REP 2221001
[4]   Model-Free Training of End-to-End Communication Systems [J].
Aoudia, Faycal Ait ;
Hoydis, Jakob .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) :2503-2516
[5]   Reinforcement Learning for Mixed Cooperative/Competitive Dynamic Spectrum Access [J].
Bowyer, Caleb ;
Greene, David ;
Ward, Tyler ;
Menendez, Marco ;
Shea, John ;
Wong, Tan .
2019 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2019, :479-484
[6]  
Destounis A., 2019, LEARN2MAC ONLINE LEA
[7]  
Foerster JN, 2016, ADV NEUR IN, V29
[8]  
Foerster JN, 2017, PR MACH LEARN RES, V70
[9]   A survey and critique of multiagent deep reinforcement learning [J].
Hernandez-Leal, Pablo ;
Kartal, Bilal ;
Taylor, Matthew E. .
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2019, 33 (06) :750-797
[10]  
Hu HY, 2020, PR MACH LEARN RES, V119