Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance

被引:6
作者
Abbas, Ammar N. [1 ,2 ]
Chasparis, Georgios C. [1 ]
Kelleher, John D. [3 ]
机构
[1] Software Competence Ctr Hagenberg, Data Sci, Softwarepk 32a, A-4232 Hagenberg, Austria
[2] Technol Univ Dublin, Dept Comp Sci, Dublin D02HW71, Ireland
[3] Maynooth Univ, ADAPT Res Ctr, Maynooth W23 A3HY, Ireland
基金
爱尔兰科学基金会;
关键词
Deep reinforcement learning; Probabilistic modeling; Input-output hidden Markov model; Predictive maintenance; Industry; 5.0; Interpretable reinforcement learning; GO;
D O I
10.1016/j.datak.2023.102240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning holds significant potential for application in industrial decision-making, offering a promising alternative to traditional physical models. However, its black-box learning approach presents challenges for real-world and safety-critical systems, as it lacks interpretability and explanations for the derived actions. Moreover, a key research question in deep reinforcement learning is how to focus policy learning on critical decisions within sparse domains. This paper introduces a novel approach that combines probabilistic modeling and reinforcement learning, providing interpretability and addressing these challenges in the context of safety-critical predictive maintenance. The methodology is activated in specific situations identified through the input-output hidden Markov model, such as critical conditions or near-failure scenarios. To mitigate the challenges associated with deep reinforcement learning in safety-critical predictive maintenance, the approach is initialized with a baseline policy using behavioral cloning, requiring minimal interactions with the environment. The effectiveness of this framework is demonstrated through a case study on predictive maintenance for turbofan engines, outperforming previous approaches and baselines, while also providing the added benefit of interpretability. Importantly, while the framework is applied to a specific use case, this paper aims to present a general methodology that can be applied to diverse predictive maintenance applications.
引用
收藏
页数:28
相关论文
共 54 条
[1]   Interpretable Input-Output Hidden Markov Model-Based Deep Reinforcement Learning for the Predictive Maintenance of Turbofan Engines [J].
Abbas, Ammar N. ;
Chasparis, Georgios C. ;
Kelleher, John D. .
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2022, 2022, 13428 :133-148
[2]  
[Anonymous], 2003, Bayesian Filtering: from Kalman Filters to Particle Filters, and beyond, DOI [10.1080/02331880309257, DOI 10.1080/02331880309257]
[3]   Variable Compliance Control for Robotic Peg-in-Hole Assembly: A Deep-Reinforcement-Learning Approach [J].
Beltran-Hernandez, Cristian C. ;
Petit, Damien ;
Ramirez-Alpizar, Ixchel G. ;
Harada, Kensuke .
APPLIED SCIENCES-BASEL, 2020, 10 (19) :1-17
[4]  
Bengio Y., 1995, Advances in Neural Information Processing Systems 7, P427
[5]  
Bertsekas D. P., 1996, Neuro-Dynamic Programming, V1st
[6]   A RAMI 4.0 View of Predictive Maintenance: Software Architecture, Platform and Case Study in Steel Industry [J].
Bousdekis, Alexandros ;
Lepenioti, Katerina ;
Ntalaperas, Dimitrios ;
Vergeti, Danai ;
Apostolou, Dimitris ;
Boursinos, Vasilis .
ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS (CAISE 2019), 2019, 349 :95-106
[7]  
Bremaud P., 2012, An Introduction to Probabilistic Modeling
[8]  
Brockman G, 2016, Arxiv, DOI arXiv:1606.01540
[9]  
Chao A., Prognostics and Diagnostics, V6, P1
[10]   Artificial intelligence in manufacturing and logistics systems: algorithms, applications, and case studies [J].
Chien, Chen-Fu ;
Dauzere-Peres, Stephane ;
Huh, Woonghee Tim ;
Jang, Young Jae ;
Morrison, James R. .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2020, 58 (09) :2730-2731