Multi-Agent Reinforcement Learning in Helicopter Airport Dispatching

被引:0
作者
Liu, Zhifei [1 ]
Dong, Qiang [1 ]
Lai, Jun [1 ]
Chen, Xiliang [1 ]
机构
[1] College of Command and Control Engineering, Army Engineering University, Nanjing
关键词
airport dispatching; multi-agent path finding; reinforcement learning; test platform;
D O I
10.3778/j.issn.1002-8331.2205-0370
中图分类号
学科分类号
摘要
Fast and efficient helicopter airport dispatching is the main challenge faced by modern helicopter airport dispatching system. Helicopter airport dispatching can be regarded as a classical multi-agent path finding problem. A helicopter airport dispatching test platform is designed, which uses a two-dimensional grid environment for rapid test of various algorithms. The airport dispatching test platform edits the map according to the actual terrain of the airport, and provides the traditional centralized planning algorithm and the algorithm based on multi-agent reinforcement learning to carry out fast and efficient simulation dispatching experiments. In order to explore the potential of multi-agent reinforcement learning in airport scheduling, a large number of experiments are carried out, and the applicability and characteristics of different types of algorithms are compared and analyzed. The experimental results show that the reinforcement learning method based on multi-agent has good scalability and real-time planning effect. The test platform provides a good starting point for further research on airport scheduling, and will have a beneficial impact on the application of multi-agent path finding in practical scenarios in the future. © 2023 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:285 / 294
页数:9
相关论文
共 35 条
[1]  
YAKOVLEV K, ANDEYCHUK A., Any-angle pathfinding for multiple agents based on SIPP algorithm, Proceedings of the 27th International Conference on Automated Planning and Scheduling, (2017)
[2]  
LI J, TINKA A, KIESEL S, Et al., Lifelong multi-agent path finding in large-scale warehouses[C], Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, pp. 1898-1900, (2020)
[3]  
MA H, YANG J, COHEN L, Et al., Feasibility study:moving non-homogeneous teams in congested video game environments[C], Proceedings of the 13th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, pp. 270-272, (2017)
[4]  
LI J, CHEN Z, ZHENG Y, Et al., Scalable rail planning and replanning:winning the 2020 flatland challenge[C], Proceedings of the 31st International Conference on Automated Planning and Scheduling, pp. 477-485, (2021)
[5]  
CARTUCHO J, VENTURA R, VELOSO M., Robust object recognition through symbiotic deep learning in mobile robots[C], 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2336-2341, (2018)
[6]  
FELNER A, STERN R, SHIMONY S E, Et al., Search-based optimal solvers for the multi-agent pathfinding problem:summary and challenges, 10th Annual Symposium on Combinatorial Search, (2017)
[7]  
SURYNEK P, FELNER A, Et al., An empirical comparison of the hardness of multi-agent path finding under the makespan and the sum of costs objectives, 9th Annual Symposium on Combinatorial Search, (2016)
[8]  
BARTAK R, SVANCARA J, SKOPKOVA V, Et al., Multiagent path finding on real robots:first experience with ozobots[C], 16th Ibero-American Conference on Artificial Intelligence, pp. 290-301, (2018)
[9]  
ELLIS J., Multi-agent path finding with reinforcement learning, (2021)
[10]  
MA H, HARABOR D, STUCKEY P J, Et al., Searching with consistent prioritization for multi-agent path finding[C], Proceedings of the 33rd AAAI Conference on Artificial Intelligence, pp. 7643-7650, (2019)