Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning

被引：28

作者：

Zhang, Chi ^{[1
]}

Odonkor, Philip ^{[2
]}

Zheng, Shuai ^{[1
]}

Khorasgani, Hamed ^{[1
]}

Serita, Susumu ^{[1
]}

Gupta, Chetan ^{[1
]}

Wang, Haiyan ^{[1
]}

机构：

[1] Hitachi Amer Ltd, Ind AI Lab, Santa Clara, CA 95054 USA

[2] Stevens Inst Technol, Hoboken, NJ 07030 USA

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2020年

关键词：

Dispatching; Reinforcement Learning; Mining;

D O I：

10.1109/BigData50022.2020.9378191

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dynamic dispatching is one of the core problems for operation optimization in traditional industries such as mining, as it is about how to smartly allocate the right resources to the right place at the right time. Conventionally, the industry relies on heuristics or even human intuitions which are often short-sighted and sub-optimal solutions. Leveraging the power of AI and Internet of Things (IoT), data-driven automation is reshaping this area. However, facing its own challenges such as large-scale and heterogenous trucks running in a highly dynamic environment, it can barely adopt methods developed in other domains (e.g., ride-sharing). In this paper, we propose a novel Deep Reinforcement Learning approach to solve the dynamic dispatching problem in mining. We first develop an event-based mining simulator with parameters calibrated in real mines. Then we propose an experience-sharing Deep Q Network with a novel abstract state/action representation to learn memories from heterogeneous agents altogether and realizes learning in a centralized way. We demonstrate that the proposed methods significantly outperform the most widely adopted approaches in the industry by 5.56% in terms of productivity. The proposed approach has great potential in a broader range of industries (e.g., manufacturing, logistics) which have a large-scale of heterogenous equipment working in a highly dynamic environment, as a general framework for dynamic resource allocation.

引用

页码：1436 / 1441

页数：6

共 50 条

[41] Transform networks for cooperative multi-agent deep reinforcement learning [J].

Wang, Hongbin ;

Xie, Xiaodong ;

Zhou, Lianke .

APPLIED INTELLIGENCE, 2023, 53 (08) :9261-9269

[42] Autonomous Separation Assurance with Deep Multi-Agent Reinforcement Learning [J].

Brittain, Marc W. ;

Yang, Xuxi ;

Wei, Peng .

JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, 18 (12) :890-905

[43] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward [J].

Shao, Kun ;

Zhu, Yuanheng ;

Tang, Zhentao ;

Zhao, Dongbin .

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

[44] Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms [J].

Ibrahim, Abdikarim Mohamed ;

Yau, Kok-Lim Alvin ;

Chong, Yung-Wey ;

Wu, Celimuge .

APPLIED SCIENCES-BASEL, 2021, 11 (22)

[45] Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning [J].

de Souza, Cristino, Jr. ;

Newbury, Rhys ;

Cosgun, Akansel ;

Castillo, Pedro ;

Vidolov, Boris ;

Kulic, Dana .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) :4552-4559

[46] Deep Multi-agent Reinforcement Learning in a Homogeneous Open Population [J].

Radulescu, Roxana ;

Legrand, Manon ;

Efthymiadis, Kyriakos ;

Roijers, Diederik M. ;

Nowe, Ann .

ARTIFICIAL INTELLIGENCE, BNAIC 2018, 2019, 1021 :90-105

[47] Multi-Agent Deep Reinforcement Learning for Sectional AGC Dispatch [J].

Li, Jiawen ;

Yu, Tao ;

Zhu, Hanxin ;

Li, Fusheng ;

Lin, Dan ;

Li, Zhuohuan .

IEEE ACCESS, 2020, 8 :158067-158081

[48] Heterogeneous Multi-Robot Cooperation With Asynchronous Multi-Agent Reinforcement Learning [J].

Zhang, Han ;

Zhang, Xiaohui ;

Feng, Zhao ;

Xiao, Xiaohui .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) :159-166

[49] A multi-agent reinforcement learning approach to dynamic service composition [J].

Wang, Hongbing ;

Wang, Xiaojun ;

Hu, Xingguo ;

Zhang, Xingzhi ;

Gu, Mingzhu .

INFORMATION SCIENCES, 2016, 363 :96-119

[50] Deep Reinforcement Learning for Routing a Heterogeneous Fleet of Vehicles [J].

Manuel Vera, Jose ;

Abad, Andres G. .

2019 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2019, :239-+

← 1 2 3 4 5 →