Reinforcement learning in reliability and maintenance optimization: A tutorial

被引:14
作者
Zhang, Qin [1 ]
Liu, Yu [1 ,2 ]
Xiang, Yisha [3 ]
Xiahou, Tangfan [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Mech & Elect Engn, Chengdu 611731, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Ctr Syst Reliabil & Safety, Chengdu 611731, Sichuan, Peoples R China
[3] Univ Houston, Dept Ind Engn, Houston, TX 77004 USA
关键词
Markov decision process; Reinforcement learning; Reliability optimization; Maintenance optimization; DEEP; POLICIES; SYSTEMS; LEVEL; GAME; GO;
D O I
10.1016/j.ress.2024.110401
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The increasing complexity of engineering systems presents significant challenges in addressing intricate reliability and maintenance optimization problems. Advanced computational techniques have become imperative. Reinforcement learning, with its strong capability of solving complicated sequential decision-making problems under uncertainty, has emerged as a powerful tool to reliability and maintainability community. This paper offers a step-by-step guideline on the utilization of reinforcement learning algorithms for resolving reliability optimization and maintenance optimization problems. We first introduce the Markov decision process modeling framework for these problems and elucidate the design and implementation of solution algorithms, including dynamic programming, reinforcement learning, and deep reinforcement learning. Case studies, including a pipeline system mission abort optimization and a manufacturing system condition-based maintenance decisionmaking, are included to demonstrate the utility of reinforcement learning in reliability and maintenance applications. This tutorial will assist researchers in the reliability and maintainability community by summarizing the state-of-the-art reinforcement learning algorithms and providing the hand-on implementations in reliability and maintenance optimization problems.
引用
收藏
页数:16
相关论文
共 76 条
[1]   A review on condition-based maintenance optimization models for stochastically deteriorating system [J].
Alaswad, Suzan ;
Xiang, Yisha .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2017, 157 :54-63
[2]   Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints [J].
Andriotis, C. P. ;
Papakonstantinou, K. G. .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 212
[3]   Managing engineering systems with large state and action spaces through deep reinforcement learning [J].
Andriotis, C. P. ;
Papakonstantinou, K. G. .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2019, 191
[4]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[5]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[6]  
Berner Christopher, 2019, arXiv
[7]   Deep Reinforcement Learning and Its Neuroscientific Implications [J].
Botvinick, Matthew ;
Wang, Jane X. ;
Dabney, Will ;
Miller, Kevin J. ;
Kurth-Nelson, Zeb .
NEURON, 2020, 107 (04) :603-616
[8]   Joint condition-based maintenance and load-sharing optimization for two-unit systems with economic dependency [J].
Broek, Michiel A. J. Uit Het ;
Teunter, Ruud H. ;
de Jonge, Bram ;
Veldman, Jasper .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 295 (03) :1119-1131
[9]   Joint condition-based maintenance and condition-based production optimization [J].
Broek, Michiel A. J. Uit Het ;
Teunter, Ruud H. ;
de Jonge, Bram ;
Veldman, Jasper .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 214
[10]  
Chang HS, 2013, COMMUN CONTROL ENG, P1, DOI 10.1007/978-1-4471-5022-0_1