Multi-objective reinforcement learning-based framework for solving selective maintenance problems in reconfigurable cyber-physical manufacturing systems

被引:11
作者
Achamrah, Fatima Ezzahra [1 ]
Attajer, Ali [2 ,3 ]
机构
[1] PSL Univ, Ctr management Sci CGS, CNRS i3 UMR9217, Mines Paris, Paris, France
[2] Univ Polytech Hauts De France, LAMIH, CNRS, UMR 8201, Valenciennes, France
[3] Univ Polytech Hauts De France, LAMIH, CNRS, UMR 8201, F-59313 Valenciennes, France
关键词
Selective maintenance; reconfigurable manufacturing systems; cyber-physical manufacturing systems; imperfect repairs; imperfect observations; multi-objective deep reinforcement learning; OPTIMIZATION;
D O I
10.1080/00207543.2023.2240433
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Unlike mass production manufacturing systems, where configurations are rarely changed after the initial design, reconfigurable cyber-physical systems (RCPMS) self-change their structures throughout missions and thus self-adjust production in response to demand requirements. Accordingly, such a paradigm requires enhancing selective maintenance strategy to optimise scheduling maintenance actions, selecting configuration layouts for capacity and product family changes, and achieving maintenance cost reduction and reliability maximisation. This paper is the first to propose a robust model for a selective maintenance problem with imperfect repairs in the RCPMS context. The model also integrates uncertainties originating from the imperfect observations of components' health status. The model's objectives are to maximise the expected reliability and minimise the variance and maintenance cost under maintenance resource constraints. Moreover, we propose a new deep reinforcement learning framework for solving the resulting multi-objective and combinatorial optimisation problem. In addition, we use decision values to enhance the scalarisation process by permitting the priorities of specific objectives to be adjusted after the learning process. Furthermore, we employ Analytical Hierarchy Process to adjust the static priorities with respect to the objective functions and the actual learning context. Finally, broad experiments are conducted to highlight the performance of the proposed model and resolution framework.
引用
收藏
页码:3460 / 3482
页数:23
相关论文
共 60 条
  • [1] An Artificial-Immune-System-Based Algorithm Enhanced with Deep Reinforcement Learning for Solving Returnable Transport Item Problems
    Achamrah, Fatima Ezzahra
    Riane, Fouad
    Sahin, Evren
    Limbourg, Sabine
    [J]. SUSTAINABILITY, 2022, 14 (10)
  • [2] Solving inventory routing with transshipment and substitution under dynamic and stochastic demands using genetic algorithm and deep reinforcement learning
    Achamrah, Fatima Ezzahra
    Riane, Fouad
    Limbourg, Sabine
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (20) : 6187 - 6204
  • [3] Branch-and-price algorithms for large-scale mission-oriented maintenance planning problems
    Al-Jabouri, Hamzea
    Saif, Ahmed
    Diallo, Claver
    Khatab, Abdelhakim
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2023, 153
  • [4] Selective maintenance optimization: a condensed critical review and future research directions
    Al-Jabouri, Hamzea
    Saif, Ahmed
    Khatab, Abdelhakim
    Diallo, Claver
    Venkatadri, Uday
    [J]. IFAC PAPERSONLINE, 2022, 55 (10): : 1213 - 1218
  • [5] Attajer A, 2021, P INT WORKSHOP SERVI, P520
  • [6] An analytic hierarchy process augmented with expert rules for product driven control in cyber-physical manufacturing systems
    Attajer, Ali
    Darmoul, Saber
    Chaabane, Sondes
    Sallez, Yves
    Riane, Fouad
    [J]. COMPUTERS IN INDUSTRY, 2022, 143
  • [7] Distributed Maintenance: A Literature Analysis and Classification
    Attajer, Ali
    Darmoul, Saber
    Riane, Fouad
    Bouras, Abdelghani
    [J]. IFAC PAPERSONLINE, 2019, 52 (13): : 619 - 624
  • [8] AutoConf: New Algorithm for Reconfiguration of Cyber-Physical Production Systems
    Balzereit, Kaja
    Niggemann, Oliver
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 739 - 749
  • [9] Remaining useful life in theory and practice
    Banjevic, Dragan
    [J]. METRIKA, 2009, 69 (2-3) : 337 - 349
  • [10] Balancing consistency and expert judgment in AHP
    Benitez, J.
    Delgado-Galvan, X.
    Gutierrez, J. A.
    Izquierdo, J.
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 2011, 54 (7-8) : 1785 - 1790