Hierarchical multiagent reinforcement learning schemes for air traffic management

被引：0

作者：

Christos Spatharis

Alevizos Bastas

Theocharis Kravaris

Konstantinos Blekas

George A. Vouros

Jose Manuel Cordero

机构：

[1] University of Ioannina,Department of Computer Science and Engineering

[2] University of Piraeus,Department of Digital Systems

[3] CRIDA,undefined

来源：

Neural Computing and Applications | 2023年 / 35卷

关键词：

Multiagent reinforcement learning; Hierarchical learning; State abstraction; Congestion problems; Air traffic management;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this work we investigate the use of hierarchical multiagent reinforcement learning methods for the computation of policies to resolve congestion problems in the air traffic management domain. To address cases where the demand of airspace use exceeds capacity, we consider agents representing flights, who need to decide on ground delays at the pre-tactical stage of operations, towards executing their trajectories while adhering to airspace capacity constraints. Hierarchical reinforcement learning manages to handle real-world problems with high complexity, by partitioning the task into hierarchies of states and/or actions. This provides an efficient way of exploring the state–action space and constructing an advantageous decision-making mechanism. We first establish a general framework of hierarchical multiagent reinforcement learning, and then, we further formulate four alternative schemes of abstractions, on states, actions, or both. To quantitatively assess the quality of solutions of the proposed approaches and show the potential of the hierarchical methods in resolving the demand–capacity balance problem, we provide experimental results on real-world evaluation cases, where we measure the average delay per flight and the number of flights with delays.

引用

页码：147 / 159

页数：12

共 50 条

[41] Air traffic complexity for a distributed air traffic management system
Brazdilova, S. L.
Casek, P.
Kubalcik, J.
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2011, 225 (G6) : 665 - 674
[42] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
Devlin, Sam
Yliniemi, Logan
Kudenko, Daniel
Tumer, Kagan
AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172
[43] Multiagent reinforcement learning in extensive form games with complete information
Akramizadeh, Ali
Menhaj, Mohammad-B.
Afshar, Ahmad
ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 205 - 211
[44] V-Learning-A Simple, Efficient, Decentralized Algorithm for Multiagent Reinforcement Learning
Jin, Chi
Liu, Qinghua
Wang, Yuanhao
Yu, Tiancheng
MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (04) : 2295 - 2322
[45] Satisficing Paths and Independent Multiagent Reinforcement Learning in Stochastic Games
Yongacoglu, Bora
Arslan, Gurdal
Yuksel, Serdar
SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2023, 5 (03): : 745 - 773
[46] Learning a robust multiagent driving policy for traffic congestion reduction
Zhang, Yulin
Macke, William
Cui, Jiaxun
Hornstein, Sharon
Urieli, Daniel
Stone, Peter
NEURAL COMPUTING & APPLICATIONS, 2023,
[47] An Information Fusion Approach to Intelligent Traffic Signal Control Using the Joint Methods of Multiagent Reinforcement Learning and Artificial Intelligence of Things
Yang, Xiaoxian
Xu, Yueshen
Kuang, Li
Wang, Zhiying
Gao, Honghao
Wang, Xuejie
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9335 - 9345
[48] A new hierarchical architecture for Air Traffic Management: Optimisation of airway capacity in a Free Flight scenario
Dell'Olmo, P
Lulli, G
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2003, 144 (01) : 179 - 193
[49] Model primitives for hierarchical lifelong reinforcement learning
Bohan Wu
Jayesh K. Gupta
Mykel Kochenderfer
Autonomous Agents and Multi-Agent Systems, 2020, 34
[50] Model primitives for hierarchical lifelong reinforcement learning
Wu, Bohan
Gupta, Jayesh K.
Kochenderfer, Mykel
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)

← 1 2 3 4 5 →