A model-based reinforcement learning approach for maintenance optimization of degrading systems in a large state space

被引：30

作者：

Zhang, Ping ^{[1
,2
]}

Zhu, Xiaoyan ^{[1
]}

Xie, Min ^{[2
,3
]}

机构：

[1] Univ Chinese Acad Sci, Sch Econ & Management, Bldg 7,80 Zhongguancun East Rd, Beijing, Peoples R China

[2] City Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China

[3] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China

来源：

COMPUTERS & INDUSTRIAL ENGINEERING | 2021年 / 161卷

基金：

中国国家自然科学基金;

关键词：

Maintenance optimization; Periodic inspection; Model-based reinforcement learning; Degrading system; PREDICTIVE MAINTENANCE; DEGRADATION; RELIABILITY; POLICY; ANALYTICS; SUBJECT; PARTS;

D O I：

10.1016/j.cie.2021.107622

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Scheduling maintenance tasks based on the deteriorating process has often been established on degradation models. However, the formulas of the degradation processes are usually unknown and hard to be determined for a system working in practices. In this study, we develop a model-based reinforcement learning approach for maintenance optimization. The developed approach determines maintenance actions for each degradation state at each inspection time over a finite planning horizon, supposing that the degradation formula is known or unknown. At each inspection time, the developed approach attempts to learn an optimal assessment value for each maintenance action to be performed at each degradation state. The assessment value quantifies the goodness of each state-action pair in terms of minimizing the accumulated maintenance costs over the planning horizon. To optimize the assessment values when a well-defined degradation formula is known, we customize a Q-learning method with model-based acceleration. When the degradation formula is unknown or hard to be determined, we develop a Dyna-Q method with maintenance-oriented improvements, in which an environment model capturing the degradation pattern under different maintenance actions is learned at first; Then, the assessment values are optimized while considering the stochastic behavior of the system degradation. The final maintenance policy is acquired by performing the maintenance actions associated with the highest assessment values. Experimental studies are presented to illustrate the applications.

引用

页数：14

共 50 条

[41] Safe control of nonlinear systems in LPV framework using model-based reinforcement learning [J].

Bao, Yajie ;

Velni, Javad Mohammadpour .

INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (04) :1078-1089

[42] Model-based predictive maintenance in building automation systems with user discomfort [J].

Cauchi, Nathalie ;

Macek, Karel ;

Abate, Alessandro .

ENERGY, 2017, 138 :306-315

[43] Multi-agent deep reinforcement learning-based maintenance optimization for multi-dependent component systems [J].

Do, Phuc ;

Nguyen, Van-Thai ;

Voisin, Alexandre ;

Iung, Benoit ;

Neto, Waldomiro Alves Ferreira ;

Neto, Ferreira .

EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245

[44] Condition-based maintenance for multi-state systems with prognostic and deep reinforcement learning [J].

Zhang, Huixian ;

Wei, Xiukun ;

Liu, Zhiqiang ;

Ding, Yaning ;

Guan, Qingluan .

RELIABILITY ENGINEERING & SYSTEM SAFETY, 2025, 255

[45] An algorithmic approach for maintenance management based on advanced state space systems and harmonic regressions [J].

Pedregal, Diego J. ;

Garcia, Fausto P. ;

Roberts, Clive .

ANNALS OF OPERATIONS RESEARCH, 2009, 166 (01) :109-124

[46] An algorithmic approach for maintenance management based on advanced state space systems and harmonic regressions [J].

Diego J. Pedregal ;

Fausto P. García ;

Clive Roberts .

Annals of Operations Research, 2009, 166 :109-124

[47] A deep reinforcement learning approach for repair-based maintenance of multi-unit systems using proportional hazards model [J].

Najafi, Seyedvahid ;

Lee, Chi-Guhn .

RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 234

[48] High-accuracy model-based reinforcement learning, a survey [J].

Plaat, Aske ;

Kosters, Walter ;

Preuss, Mike .

ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) :9541-9573

[49] Model-Based Reinforcement Learning for Trajectory Tracking of Musculoskeletal Robots [J].

Xu, Haoran ;

Fan, Jianyin ;

Wang, Qiang .

2023 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC, 2023,

[50] Causal dreamer for partially observable model-based reinforcement learning [J].

Gao, Haichuan ;

Xu, Tianrun ;

Zhang, Tianren ;

Guo, Yuqing ;

Zhao, Chujie ;

Ren, Jinsheng ;

Jiang, Yizhou ;

Guo, Shangqi ;

Chen, Feng .

NEUROCOMPUTING, 2025, 652

← 1 2 3 4 5 →