Multi-objective reinforcement learning-based framework for solving selective maintenance problems in reconfigurable cyber-physical manufacturing systems

被引：15

作者：

Achamrah, Fatima Ezzahra ^{[1
]}

Attajer, Ali ^{[2
,3
]}

机构：

[1] PSL Univ, Ctr management Sci CGS, CNRS i3 UMR9217, Mines Paris, Paris, France

[2] Univ Polytech Hauts De France, LAMIH, CNRS, UMR 8201, Valenciennes, France

[3] Univ Polytech Hauts De France, LAMIH, CNRS, UMR 8201, F-59313 Valenciennes, France

来源：

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH | 2024年 / 62卷 / 10期

关键词：

Selective maintenance; reconfigurable manufacturing systems; cyber-physical manufacturing systems; imperfect repairs; imperfect observations; multi-objective deep reinforcement learning; OPTIMIZATION;

D O I：

10.1080/00207543.2023.2240433

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Unlike mass production manufacturing systems, where configurations are rarely changed after the initial design, reconfigurable cyber-physical systems (RCPMS) self-change their structures throughout missions and thus self-adjust production in response to demand requirements. Accordingly, such a paradigm requires enhancing selective maintenance strategy to optimise scheduling maintenance actions, selecting configuration layouts for capacity and product family changes, and achieving maintenance cost reduction and reliability maximisation. This paper is the first to propose a robust model for a selective maintenance problem with imperfect repairs in the RCPMS context. The model also integrates uncertainties originating from the imperfect observations of components' health status. The model's objectives are to maximise the expected reliability and minimise the variance and maintenance cost under maintenance resource constraints. Moreover, we propose a new deep reinforcement learning framework for solving the resulting multi-objective and combinatorial optimisation problem. In addition, we use decision values to enhance the scalarisation process by permitting the priorities of specific objectives to be adjusted after the learning process. Furthermore, we employ Analytical Hierarchy Process to adjust the static priorities with respect to the objective functions and the actual learning context. Finally, broad experiments are conducted to highlight the performance of the proposed model and resolution framework.

引用

页码：3460 / 3482

页数：23

共 60 条

[1] An Artificial-Immune-System-Based Algorithm Enhanced with Deep Reinforcement Learning for Solving Returnable Transport Item Problems [J].

Achamrah, Fatima Ezzahra ;

Riane, Fouad ;

Sahin, Evren ;

Limbourg, Sabine .

SUSTAINABILITY, 2022, 14 (10)

[2] Solving inventory routing with transshipment and substitution under dynamic and stochastic demands using genetic algorithm and deep reinforcement learning [J].

Achamrah, Fatima Ezzahra ;

Riane, Fouad ;

Limbourg, Sabine .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (20) :6187-6204

[3] Branch-and-price algorithms for large-scale mission-oriented maintenance planning problems [J].

Al-Jabouri, Hamzea ;

Saif, Ahmed ;

Diallo, Claver ;

Khatab, Abdelhakim .

COMPUTERS & OPERATIONS RESEARCH, 2023, 153

[4] Selective maintenance optimization: a condensed critical review and future research directions [J].

Al-Jabouri, Hamzea ;

Saif, Ahmed ;

Khatab, Abdelhakim ;

Diallo, Claver ;

Venkatadri, Uday .

IFAC PAPERSONLINE, 2022, 55 (10) :1213-1218

[5]

Attajer A, 2021, P INT WORKSHOP SERVI, P520

[6] An analytic hierarchy process augmented with expert rules for product driven control in cyber-physical manufacturing systems [J].

Attajer, Ali ;

Darmoul, Saber ;

Chaabane, Sondes ;

Sallez, Yves ;

Riane, Fouad .

COMPUTERS IN INDUSTRY, 2022, 143

[7] Distributed Maintenance: A Literature Analysis and Classification [J].

Attajer, Ali ;

Darmoul, Saber ;

Riane, Fouad ;

Bouras, Abdelghani .

IFAC PAPERSONLINE, 2019, 52 (13) :619-624

[8] AutoConf: New Algorithm for Reconfiguration of Cyber-Physical Production Systems [J].

Balzereit, Kaja ;

Niggemann, Oliver .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) :739-749

[9] Remaining useful life in theory and practice [J].

Banjevic, Dragan .

METRIKA, 2009, 69 (2-3) :337-349

[10] Balancing consistency and expert judgment in AHP [J].

Benitez, J. ;

Delgado-Galvan, X. ;

Gutierrez, J. A. ;

Izquierdo, J. .

MATHEMATICAL AND COMPUTER MODELLING, 2011, 54 (7-8) :1785-1790

← 1 2 3 4 5 6 →