Joint optimization of preventive maintenance and production scheduling for multi-state production systems based on reinforcement learning

被引：79

作者：

Yang, Hongbing ^{[1
]}

Li, Wenchao ^{[2
]}

Wang, Bin ^{[3
]}

机构：

[1] Soochow Univ, Sch Mech & Elect Engn, Suzhou 215006, Peoples R China

[2] Jiangsu Univ, Sch Automot & Traff Engn, Zhenjiang 212013, Jiangsu, Peoples R China

[3] Jiangsu Acad Safety Sci & Technol, Nanjing 210042, Peoples R China

来源：

RELIABILITY ENGINEERING & SYSTEM SAFETY | 2021年 / 214卷

关键词：

Preventive maintenance; Production scheduling; Reinforcement learning; Markov decision process; Expected average rewards; INTEGRATED MAINTENANCE; POLICY;

D O I：

10.1016/j.ress.2021.107713

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Preventive maintenance and production scheduling are two important and interactive activities in production systems. In this work, the integrated optimization problem of production scheduling for multi-state single-machine production systems experiencing degradation processes is investigated. Preventive maintenance tasks and jobs scheduling are jointly considered to find the optimal production policy by considering the processing costs, the maintenance costs, and the completion rewards, simultaneously. We formulate the integrated optimization problem as Markov decision process framework. R-learning algorithm is introduced to maximize the long-run expected average rewards per time unit over infinite horizon. On the basis of the analysis of the optimal stationary policy, the appropriate condition to perform preventive maintenance following optimal stationary policy is presented. This provides the basis for the improvement in R-learning algorithm. Furthermore, a novel heuristic reinforcement learning method is proposed to deal with the integrated model more efficiently. Finally, we present the simulation results and analysis of the proposed algorithm's performance in terms of the number of job types and machine states. The simulation results and analysis show the effectiveness of the proposed approach for solving the integrated problems.

引用

页数：12

共 45 条

[1]

Abbasi-Yadkori Y, 2019, PR MACH LEARN RES, V97

[2] Reward-based Monte Carlo-Bayesian reinforcement learning for cyber preventive maintenance [J].

Allen, Theodore T. ;

Roychowdhury, Sayak ;

Liu, Enhao .

COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 126 :578-594

[3] Managing engineering systems with large state and action spaces through deep reinforcement learning [J].

Andriotis, C. P. ;

Papakonstantinou, K. G. .

RELIABILITY ENGINEERING & SYSTEM SAFETY, 2019, 191

[4]

[Anonymous], PROC ICML

[5] Research of an integrated decision model for production scheduling and maintenance planning with economic objective [J].

Ao, Yinhui ;

Zhang, Huiping ;

Wang, Cuifen .

COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 137

[6] Integrated maintenance planning and production scheduling with Markovian deteriorating machine conditions [J].

Bajestani, Maliheh Aramon ;

Banjevic, Dragan ;

Beck, J. Christopher .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2014, 52 (24) :7377-7400

[7] Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration With Application to Autonomous Sequential Repair Problems [J].

Bhattacharya, Sushmita ;

Badyal, Sahil ;

Wheeler, Thomas ;

Gil, Stephanie ;

Bertsekas, Dimitri .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) :3967-3974

[8] Integrating preventive maintenance planning and production scheduling for a single machine [J].

Cassady, CR ;

Kutanoglu, E .

IEEE TRANSACTIONS ON RELIABILITY, 2005, 54 (02) :304-309

[9] Minimizing job tardiness using integrated preventive maintenance planning and production scheduling [J].

Cassady, CR ;

Kutanoglu, E .

IIE TRANSACTIONS, 2003, 35 (06) :503-513

[10] Integrated maintenance and operations decision making with imperfect degradation state observations [J].

Celen, Merve ;

Djurdjanovic, Dragan .

JOURNAL OF MANUFACTURING SYSTEMS, 2020, 55 :302-316

← 1 2 3 4 5 →