Hierarchical multiagent reinforcement learning schemes for air traffic management

被引:0
作者
Christos Spatharis
Alevizos Bastas
Theocharis Kravaris
Konstantinos Blekas
George A. Vouros
Jose Manuel Cordero
机构
[1] University of Ioannina,Department of Computer Science and Engineering
[2] University of Piraeus,Department of Digital Systems
[3] CRIDA,undefined
来源
Neural Computing and Applications | 2023年 / 35卷
关键词
Multiagent reinforcement learning; Hierarchical learning; State abstraction; Congestion problems; Air traffic management;
D O I
暂无
中图分类号
学科分类号
摘要
In this work we investigate the use of hierarchical multiagent reinforcement learning methods for the computation of policies to resolve congestion problems in the air traffic management domain. To address cases where the demand of airspace use exceeds capacity, we consider agents representing flights, who need to decide on ground delays at the pre-tactical stage of operations, towards executing their trajectories while adhering to airspace capacity constraints. Hierarchical reinforcement learning manages to handle real-world problems with high complexity, by partitioning the task into hierarchies of states and/or actions. This provides an efficient way of exploring the state–action space and constructing an advantageous decision-making mechanism. We first establish a general framework of hierarchical multiagent reinforcement learning, and then, we further formulate four alternative schemes of abstractions, on states, actions, or both. To quantitatively assess the quality of solutions of the proposed approaches and show the potential of the hierarchical methods in resolving the demand–capacity balance problem, we provide experimental results on real-world evaluation cases, where we measure the average delay per flight and the number of flights with delays.
引用
收藏
页码:147 / 159
页数:12
相关论文
共 50 条
  • [41] Air traffic complexity for a distributed air traffic management system
    Brazdilova, S. L.
    Casek, P.
    Kubalcik, J.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2011, 225 (G6) : 665 - 674
  • [42] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
    Devlin, Sam
    Yliniemi, Logan
    Kudenko, Daniel
    Tumer, Kagan
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172
  • [43] Multiagent reinforcement learning in extensive form games with complete information
    Akramizadeh, Ali
    Menhaj, Mohammad-B.
    Afshar, Ahmad
    ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 205 - 211
  • [44] V-Learning-A Simple, Efficient, Decentralized Algorithm for Multiagent Reinforcement Learning
    Jin, Chi
    Liu, Qinghua
    Wang, Yuanhao
    Yu, Tiancheng
    MATHEMATICS OF OPERATIONS RESEARCH, 2024, 49 (04) : 2295 - 2322
  • [45] Satisficing Paths and Independent Multiagent Reinforcement Learning in Stochastic Games
    Yongacoglu, Bora
    Arslan, Gurdal
    Yuksel, Serdar
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2023, 5 (03): : 745 - 773
  • [46] Learning a robust multiagent driving policy for traffic congestion reduction
    Zhang, Yulin
    Macke, William
    Cui, Jiaxun
    Hornstein, Sharon
    Urieli, Daniel
    Stone, Peter
    NEURAL COMPUTING & APPLICATIONS, 2023,
  • [47] An Information Fusion Approach to Intelligent Traffic Signal Control Using the Joint Methods of Multiagent Reinforcement Learning and Artificial Intelligence of Things
    Yang, Xiaoxian
    Xu, Yueshen
    Kuang, Li
    Wang, Zhiying
    Gao, Honghao
    Wang, Xuejie
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9335 - 9345
  • [48] A new hierarchical architecture for Air Traffic Management: Optimisation of airway capacity in a Free Flight scenario
    Dell'Olmo, P
    Lulli, G
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2003, 144 (01) : 179 - 193
  • [49] Model primitives for hierarchical lifelong reinforcement learning
    Bohan Wu
    Jayesh K. Gupta
    Mykel Kochenderfer
    Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [50] Model primitives for hierarchical lifelong reinforcement learning
    Wu, Bohan
    Gupta, Jayesh K.
    Kochenderfer, Mykel
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)