Deep Reinforcement Learning for Power Controlled Channel Allocation in Wireless Avionics Intra-Communications

被引:0
作者
Zuo, Yuanjun [1 ]
Li, Qiao [1 ]
Lu, Guangshan [1 ]
Xiong, Huagang [1 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Wireless communication; Simulation; Power control; Reinforcement learning; Channel allocation; Aerospace electronics; Feature extraction; deep reinforcement learning; UWB; WAIC; wireless communication; NETWORKS; DESIGN;
D O I
10.1109/ACCESS.2021.3100260
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wireless avionics intra-communications (WAIC) can play an important role in alleviate the issue of fuel consumption, stability and maintenance costs over traditional wired avionics systems. However, in order for WAIC system to coexist with other systems in aircraft, the transmitted power level of WAIC is strictly limited to 50 mW. Meanwhile, WAIC require extremely low outage probability for the safety-critical avionics applications. Hence, it is urgently needed to effectively allocate channels and utilize the limited power while ensuring the reliability and real-time performance of the WAIC system. In this paper, a deep reinforcement learning (DRL)-based power controlled channel allocation (DRL-PCCA) scheme for WAIC network is proposed, with physical layer of frequency hopping orthogonal frequency division multiplexing (FH-OFDM). First, we formulate a sub-bands allocation and power control optimization problem whose aim is to minimize the overall transmit power provided that all end nodes achieve their requested data rates and desired bit error rate (BER). However, the problem formulated is non-convex and NP-hard. To tackle this problem, we propose a DRL-based scheme, which can effectively solve the optimization problem of sequence decision making in complex environment by using neural networks to extract spatial correlation features of WAIC. In particular, a reliability framework for WAIC is presented to analyse the performance of the proposed scheme against the system requirements of the flight certification. Simulation results demonstrate that the performance of proposed DRL-based scheme is superior to the traditional power control and channel allocation scheme.
引用
收藏
页码:106964 / 106980
页数:17
相关论文
共 51 条
[1]  
[Anonymous], 2013, ITURM22830 ITU REC S
[2]  
[Anonymous], 2007, AFDX ARINC 664 PROTO
[3]  
[Anonymous], 2014, ITURM2318 ITU RAD ST
[4]  
[Anonymous], 2008, GEN ASSIGNMENT PROBL
[5]  
[Anonymous], 2010, ITURM2197 ITU RAD ST
[6]  
[Anonymous], 2014, ITURM2319 ITU RAD ST
[7]  
[Anonymous], 2014, P 10 IEEE WORKSH FAC
[8]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[9]  
Baltaci A., 2019, P IEEE INT C COMM IC, P1
[10]   Design of a multiband OFDM system for realistic UWB channel environments [J].
Batra, A ;
Balakrishnan, J ;
Aiello, GR ;
Foerster, JR ;
Dabak, A .
IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2004, 52 (09) :2123-2138