General real-time three-dimensional multi-aircraft conflict resolution method using multi-agent reinforcement learning

被引:6
|
作者
Chen, Yutong [1 ,2 ,3 ]
Xu, Yan [1 ]
Yang, Lei [2 ,3 ]
Hu, Minghua [2 ,3 ]
机构
[1] Cranfield Univ, Cranfield MK43 0AL, Bedfordshire, England
[2] Nanjing Univ Aeronaut & Astronaut, Nanjing 210000, Peoples R China
[3] State Key Lab Air Traff Management Syst, Nanjing 210000, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Air traffic management; Three-dimensional multi-aircraft conflict; resolution; Multi-agent reinforcement learning; Deep q-learning network; Generalisation; Uncertainty;
D O I
10.1016/j.trc.2023.104367
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Reinforcement learning (RL) techniques have been studied for solving the conflict resolution (CR) problem in air traffic management, leveraging their potential for computation and ability to handle uncertainty. However, challenges remain that impede the application of RL methods to CR in practice, including three-dimensional manoeuvres, generalisation, trajectory recovery, and success rate. This paper proposes a general multi-agent reinforcement learning approach for real-time three-dimensional multi-aircraft conflict resolution, in which agents share a neural network and are deployed on each aircraft to form a distributed decision-making system. To address the challenges, several technologies are introduced, including a partial observation model based on imminent threats for generalisation, a safety separation relaxation model for multiple flight levels for three-dimensional manoeuvres, an adaptive manoeuvre strategy for trajectory recovery, and a conflict buffer model for success rate. The Rainbow Deep Q-learning Network (DQN) is used to enhance the efficiency of the RL process. A simulation environment that considers flight uncertainty (resulting from mechanical and navigation errors and wind) is constructed to train and evaluate the proposed approach. The experimental results demonstrate that the proposed method can resolve conflicts in scenarios with much higher traffic density than in today's real-world situations.
引用
收藏
页数:28
相关论文
共 50 条
  • [11] Train rescheduling method based on multi-agent reinforcement learning
    Cao, Yuli
    Xu, Zhongwei
    Mei, Meng
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 301 - 305
  • [12] WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
    Liu, Hui
    Zhang, Zhen
    Wang, Dongqing
    IEEE ACCESS, 2020, 8 : 216320 - 216331
  • [13] Joint autonomous decision-making of conflict resolution and aircraft scheduling based on triple-aspect improved multi-agent reinforcement learning
    Huang, Xiao
    Tian, Yong
    Li, Jiangchen
    Zhang, Naizhong
    Dong, Xingchen
    Lv, Yue
    Li, Zhixiong
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
  • [14] A Supply Chain Inventory Management Method for Civil Aircraft Manufacturing Based on Multi-Agent Reinforcement Learning
    Piao, Mingjie
    Zhang, Dongdong
    Lu, Hu
    Li, Rupeng
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [15] A Study on Real-Time Scheduling for Holonic Manufacturing Systems - Determination of Utility Values Based on Multi-agent Reinforcement Learning
    Iwamura, Koji
    Mayumi, Norihisa
    Tanimizu, Yoshitaka
    Sugimura, Nobuhiro
    HOLONIC AND MULTI-AGENT SYSTEMS FOR MANUFACTURING, PROCEEDINGS, 2009, 5696 : 135 - 144
  • [16] Smart Grid for Industry Using Multi-Agent Reinforcement Learning
    Roesch, Martin
    Linder, Christian
    Zimmermann, Roland
    Rudolf, Andreas
    Hohmann, Andrea
    Reinhart, Gunther
    APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 20
  • [17] Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method
    Liao, Guang
    Wang, Jian
    Yang, Dujia
    Yang, Junan
    SENSORS, 2024, 24 (21)
  • [18] Three-dimensional cooperative guidance with impact angle constraints via value-policy decomposed multi-agent reinforcement learning
    Qiu, Xiaoqi
    Gao, Changsheng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 155
  • [19] A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots
    Mou, Haiming
    Xue, Jie
    Liu, Jian
    Feng, Zhen
    Li, Qingdu
    Zhang, Jianwei
    BIOMIMETICS, 2023, 8 (08)
  • [20] Cooperative Multi-agent Reinforcement Learning for Multiple Anti-aircraft Target Surveillance
    Lee, Kangbeen
    Baek, Seungjae
    Jung, Philjoon
    Kim, Tae-Hyun
    Jeon, Jeong Hwan
    Journal of Institute of Control, Robotics and Systems, 2024, 30 (06) : 587 - 595