Hierarchical multiagent reinforcement learning schemes for air traffic management

被引:0
作者
Christos Spatharis
Alevizos Bastas
Theocharis Kravaris
Konstantinos Blekas
George A. Vouros
Jose Manuel Cordero
机构
[1] University of Ioannina,Department of Computer Science and Engineering
[2] University of Piraeus,Department of Digital Systems
[3] CRIDA,undefined
来源
Neural Computing and Applications | 2023年 / 35卷
关键词
Multiagent reinforcement learning; Hierarchical learning; State abstraction; Congestion problems; Air traffic management;
D O I
暂无
中图分类号
学科分类号
摘要
In this work we investigate the use of hierarchical multiagent reinforcement learning methods for the computation of policies to resolve congestion problems in the air traffic management domain. To address cases where the demand of airspace use exceeds capacity, we consider agents representing flights, who need to decide on ground delays at the pre-tactical stage of operations, towards executing their trajectories while adhering to airspace capacity constraints. Hierarchical reinforcement learning manages to handle real-world problems with high complexity, by partitioning the task into hierarchies of states and/or actions. This provides an efficient way of exploring the state–action space and constructing an advantageous decision-making mechanism. We first establish a general framework of hierarchical multiagent reinforcement learning, and then, we further formulate four alternative schemes of abstractions, on states, actions, or both. To quantitatively assess the quality of solutions of the proposed approaches and show the potential of the hierarchical methods in resolving the demand–capacity balance problem, we provide experimental results on real-world evaluation cases, where we measure the average delay per flight and the number of flights with delays.
引用
收藏
页码:147 / 159
页数:12
相关论文
共 23 条
[1]  
Agogino AK(2012)A multiagent approach to managing air traffic flow Auton Agents Multiagent Syst 24 1-25
[2]  
Tumer K(2000)Hierarchical reinforcement learning with the maxq value function decomposition J Artif Intell Res 13 227-303
[3]  
Dietterich T(2006)Collaborative multiagent reinforcement learning by payoff propagation J Mach Learn Res 7 1789-1828
[4]  
Kok JR(2020)Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning Auton Robot 44 485-503
[5]  
Vlassis N(2004)Social optimality and cooperation in nonatomic congestion games J Econ Theory 114 56-87
[6]  
Ma A(2017)Deeploco: dynamic locomotion skills using hierarchical deep reinforcement learning ACM Trans Graph 36 1-13
[7]  
Ouimet M(2011)Congestion games with failures Discr Appl Math 159 1508-1525
[8]  
Cortés J(2017)A neural model of hierarchical reinforcement learning PLoS One 12 e0180234-67
[9]  
Milchtaich I(1973)A class of games processing pure-strategy nash equilibria Int J Game Theory 2 65-211
[10]  
Peng XB(1999)Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning Artif Intell 112 181-undefined