Train rescheduling method based on multi-agent reinforcement learning

被引：0

作者：

Cao, Yuli ^{[1
]}

Xu, Zhongwei ^{[1
]}

Mei, Meng ^{[1
]}

机构：

[1] Tongji Univ, Sch Elect & Informat Engn, Shanghai 200000, Peoples R China

来源：

2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC) | 2022年

基金：

中国国家自然科学基金;

关键词：

multi-agent reinforcement learning; vehicle rescheduling; deep Q-learning; dynamic schedule;

D O I：

10.1109/IAEAC54830.2022.9929607

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Many multi-agent pathfinding algorithms have been raised to arrange trains' scheduling effectively and have reached excellent results. However, these algorithms usually focus on the fixed schedule and have a poor ability to deal with dynamic problems. This paper presents a train rescheduling method based on multi-agent reinforcement learning. A new observation is adopted for trains to better interact with the environment and other trains. The improved DQN network is implemented to train to obtain the best performance, such as avoiding conflicts, handling trains' breakdowns and generating new paths. According to simulation results, the model achieved an aggregate completion rate of over 70% of ten agents after training. Compared with the traditional multi-agent pathfinding algorithm CBS, this method was 20% higher in terms of completion rate when the malfunction rate was over 20%. Conclusively, the method has better handled unexpected situations and has excellent adaptability to problems such as sudden train breakdowns.

引用

页码：301 / 305

页数：5

共 10 条

[1] An optimization study based on Dijkstra algorithm for a network with trapezoidal picture fuzzy numbers
Akram, Muhammad
Habib, Amna
Alcantud, Jose Carlos R.
[J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04) : 1329 - 1342
[2] [Anonymous], 2018, PLAYING ATARI DEEP R
[3] CLASSIFICATION IN VEHICLE-ROUTING AND SCHEDULING
BODIN, L
GOLDEN, B
[J]. NETWORKS, 1981, 11 (02) : 97 - 108
[4] Gupta Jayesh K., 2017, Autonomous Agents and Multiagent Systems, AAMAS 2017: Workshops, Best Papers. Revised Selected Papers: LNAI 10642, P66, DOI 10.1007/978-3-319-71682-4_5
[5] Horgan D, 2018, Arxiv, DOI arXiv:1803.00933
[6] Islam Robiul, 2021, EVOLUTIONARY ALGORIT
[7] Jiang JC, 2018, ADV NEUR IN, V31
[8] Jonas Walter, 2020, THESIS HSR
[9] Li JQ, 2007, NETWORKS, V50, P211, DOI 10.1002/net.20l99
[10] Sharon Guni, 2012, NATL C ARTIFICIAL IN

← 1 →