General real-time three-dimensional multi-aircraft conflict resolution method using multi-agent reinforcement learning

被引：6

作者：

Chen, Yutong ^{[1
,2
,3
]}

Xu, Yan ^{[1
]}

Yang, Lei ^{[2
,3
]}

Hu, Minghua ^{[2
,3
]}

机构：

[1] Cranfield Univ, Cranfield MK43 0AL, Bedfordshire, England

[2] Nanjing Univ Aeronaut & Astronaut, Nanjing 210000, Peoples R China

[3] State Key Lab Air Traff Management Syst, Nanjing 210000, Peoples R China

来源：

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES | 2023年 / 157卷

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Air traffic management; Three-dimensional multi-aircraft conflict; resolution; Multi-agent reinforcement learning; Deep q-learning network; Generalisation; Uncertainty;

D O I：

10.1016/j.trc.2023.104367

中图分类号：

U [交通运输];

学科分类号：

08 ; 0823 ;

摘要：

Reinforcement learning (RL) techniques have been studied for solving the conflict resolution (CR) problem in air traffic management, leveraging their potential for computation and ability to handle uncertainty. However, challenges remain that impede the application of RL methods to CR in practice, including three-dimensional manoeuvres, generalisation, trajectory recovery, and success rate. This paper proposes a general multi-agent reinforcement learning approach for real-time three-dimensional multi-aircraft conflict resolution, in which agents share a neural network and are deployed on each aircraft to form a distributed decision-making system. To address the challenges, several technologies are introduced, including a partial observation model based on imminent threats for generalisation, a safety separation relaxation model for multiple flight levels for three-dimensional manoeuvres, an adaptive manoeuvre strategy for trajectory recovery, and a conflict buffer model for success rate. The Rainbow Deep Q-learning Network (DQN) is used to enhance the efficiency of the RL process. A simulation environment that considers flight uncertainty (resulting from mechanical and navigation errors and wind) is constructed to train and evaluate the proposed approach. The experimental results demonstrate that the proposed method can resolve conflicts in scenarios with much higher traffic density than in today's real-world situations.

引用

页数：28

共 50 条

[11] Train rescheduling method based on multi-agent reinforcement learning
Cao, Yuli
Xu, Zhongwei
Mei, Meng
2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 301 - 305
[12] WRFMR: A Multi-Agent Reinforcement Learning Method for Cooperative Tasks
Liu, Hui
Zhang, Zhen
Wang, Dongqing
IEEE ACCESS, 2020, 8 : 216320 - 216331
[13] Joint autonomous decision-making of conflict resolution and aircraft scheduling based on triple-aspect improved multi-agent reinforcement learning
Huang, Xiao
Tian, Yong
Li, Jiangchen
Zhang, Naizhong
Dong, Xingchen
Lv, Yue
Li, Zhixiong
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
[14] A Supply Chain Inventory Management Method for Civil Aircraft Manufacturing Based on Multi-Agent Reinforcement Learning
Piao, Mingjie
Zhang, Dongdong
Lu, Hu
Li, Rupeng
APPLIED SCIENCES-BASEL, 2023, 13 (13):
[15] A Study on Real-Time Scheduling for Holonic Manufacturing Systems - Determination of Utility Values Based on Multi-agent Reinforcement Learning
Iwamura, Koji
Mayumi, Norihisa
Tanimizu, Yoshitaka
Sugimura, Nobuhiro
HOLONIC AND MULTI-AGENT SYSTEMS FOR MANUFACTURING, PROCEEDINGS, 2009, 5696 : 135 - 144
[16] Smart Grid for Industry Using Multi-Agent Reinforcement Learning
Roesch, Martin
Linder, Christian
Zimmermann, Roland
Rudolf, Andreas
Hohmann, Andrea
Reinhart, Gunther
APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 20
[17] Multi-UAV Escape Target Search: A Multi-Agent Reinforcement Learning Method
Liao, Guang
Wang, Jian
Yang, Dujia
Yang, Junan
SENSORS, 2024, 24 (21)
[18] Three-dimensional cooperative guidance with impact angle constraints via value-policy decomposed multi-agent reinforcement learning
Qiu, Xiaoqi
Gao, Changsheng
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 155
[19] A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots
Mou, Haiming
Xue, Jie
Liu, Jian
Feng, Zhen
Li, Qingdu
Zhang, Jianwei
BIOMIMETICS, 2023, 8 (08)
[20] Cooperative Multi-agent Reinforcement Learning for Multiple Anti-aircraft Target Surveillance
Lee, Kangbeen
Baek, Seungjae
Jung, Philjoon
Kim, Tae-Hyun
Jeon, Jeong Hwan
Journal of Institute of Control, Robotics and Systems, 2024, 30 (06) : 587 - 595

← 1 2 3 4 5 →