Deep reinforcement learning with combinatorial actions spaces: An to maintenance

被引:17
作者
Goby, Niklas [1 ]
Brandt, Tobias [2 ,3 ]
Neumann, Dirk [1 ]
机构
[1] Univ Freiburg, Chair Informat Syst, Freiburg, Germany
[2] WWU Munster, European Res Ctr Informat Syst, Munster, Germany
[3] Erasmus Univ, Rotterdam Sch Management, Rotterdam, Netherlands
关键词
Maintenance; Prescriptive analytics; Deep reinforcement learning; Combinatorial action space; Prescriptive maintenance; POLICIES; OPTIMIZATION; PROGNOSTICS; SYSTEMS;
D O I
10.1016/j.cie.2023.109165
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we leverage a prescriptive analytics approach based on deep reinforcement learning and adapt it for sequential decision-problems with large, noisy state spaces and combinatorial actions spaces. We implement a novel mechanism that uses deep learning to reduce the action space and apply the approach to the context of maintenance management. We show that our method substantially outperforms established baseline methods from practice and research, closing more than 90 percent of the cost gap between the next-best solution and the optimum under perfect information. In addition to reducing costs, the specifically-designed reward function incentivizes bundling maintenance actions in a way that fully utilizes the available number of workers. Thereby, the number of time steps in which any maintenance action occurs is reduced. This decreases the organizational and operational impact of maintenance in real-world settings as disruptions can be limited to a few days. Beyond this context, our work illustrates the potential of prescriptive approaches based on deep reinforcement learning in other applications that face similarly challenging problem settings.
引用
收藏
页数:13
相关论文
共 60 条
[1]   Group Maintenance: A Restless Bandits Approach [J].
Abbou, Abderrahmane ;
Makis, Viliam .
INFORMS JOURNAL ON COMPUTING, 2019, 31 (04) :719-731
[2]   Managing engineering systems with large state and action spaces through deep reinforcement learning [J].
Andriotis, C. P. ;
Papakonstantinou, K. G. .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2019, 191
[3]   Cost-based text understanding to improve maintenance knowledge intelligence in manufacturing enterprises [J].
Ansari, Fazel .
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 141
[4]   PriMa: a prescriptive maintenance model for cyber-physical production systems [J].
Ansari, Fazel ;
Glawar, Robert ;
Nemeth, Tanja .
INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2019, 32 (4-5) :482-503
[5]   Piecewise deterministic Markov process for condition-based maintenance models - Application to critical infrastructures with discrete-state deterioration [J].
Arismendi, Renny ;
Barros, Anne ;
Grall, Antoine .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 212
[6]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[7]   OPTIMUM PREVENTIVE MAINTENANCE POLICIES [J].
BARLOW, R ;
HUNTER, L .
OPERATIONS RESEARCH, 1960, 8 (01) :90-100
[8]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[9]   From Predictive to Prescriptive Analytics [J].
Bertsimas, Dimitris ;
Kallus, Nathan .
MANAGEMENT SCIENCE, 2020, 66 (03) :1025-1044
[10]  
Bradbury S., 2018, DIGITALLY ENABLED RE