Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance

被引：6

作者：

Abbas, Ammar N. ^{[1
,2
]}

Chasparis, Georgios C. ^{[1
]}

Kelleher, John D. ^{[3
]}

机构：

[1] Software Competence Ctr Hagenberg, Data Sci, Softwarepk 32a, A-4232 Hagenberg, Austria

[2] Technol Univ Dublin, Dept Comp Sci, Dublin D02HW71, Ireland

[3] Maynooth Univ, ADAPT Res Ctr, Maynooth W23 A3HY, Ireland

来源：

DATA & KNOWLEDGE ENGINEERING | 2024年 / 149卷

基金：

爱尔兰科学基金会;

关键词：

Deep reinforcement learning; Probabilistic modeling; Input-output hidden Markov model; Predictive maintenance; Industry; 5.0; Interpretable reinforcement learning; GO;

D O I：

10.1016/j.datak.2023.102240

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep reinforcement learning holds significant potential for application in industrial decision-making, offering a promising alternative to traditional physical models. However, its black-box learning approach presents challenges for real-world and safety-critical systems, as it lacks interpretability and explanations for the derived actions. Moreover, a key research question in deep reinforcement learning is how to focus policy learning on critical decisions within sparse domains. This paper introduces a novel approach that combines probabilistic modeling and reinforcement learning, providing interpretability and addressing these challenges in the context of safety-critical predictive maintenance. The methodology is activated in specific situations identified through the input-output hidden Markov model, such as critical conditions or near-failure scenarios. To mitigate the challenges associated with deep reinforcement learning in safety-critical predictive maintenance, the approach is initialized with a baseline policy using behavioral cloning, requiring minimal interactions with the environment. The effectiveness of this framework is demonstrated through a case study on predictive maintenance for turbofan engines, outperforming previous approaches and baselines, while also providing the added benefit of interpretability. Importantly, while the framework is applied to a specific use case, this paper aims to present a general methodology that can be applied to diverse predictive maintenance applications.

引用

页数：28

共 54 条

[1] Interpretable Input-Output Hidden Markov Model-Based Deep Reinforcement Learning for the Predictive Maintenance of Turbofan Engines [J].

Abbas, Ammar N. ;

Chasparis, Georgios C. ;

Kelleher, John D. .

BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2022, 2022, 13428 :133-148

[2]

[Anonymous], 2003, Bayesian Filtering: from Kalman Filters to Particle Filters, and beyond, DOI [10.1080/02331880309257, DOI 10.1080/02331880309257]

[3] Variable Compliance Control for Robotic Peg-in-Hole Assembly: A Deep-Reinforcement-Learning Approach [J].

Beltran-Hernandez, Cristian C. ;

Petit, Damien ;

Ramirez-Alpizar, Ixchel G. ;

Harada, Kensuke .

APPLIED SCIENCES-BASEL, 2020, 10 (19) :1-17

[4]

Bengio Y., 1995, Advances in Neural Information Processing Systems 7, P427

[5]

Bertsekas D. P., 1996, Neuro-Dynamic Programming, V1st

[6] A RAMI 4.0 View of Predictive Maintenance: Software Architecture, Platform and Case Study in Steel Industry [J].

Bousdekis, Alexandros ;

Lepenioti, Katerina ;

Ntalaperas, Dimitrios ;

Vergeti, Danai ;

Apostolou, Dimitris ;

Boursinos, Vasilis .

ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS (CAISE 2019), 2019, 349 :95-106

[7]

Bremaud P., 2012, An Introduction to Probabilistic Modeling

[8]

Brockman G, 2016, Arxiv, DOI arXiv:1606.01540

[9]

Chao A., Prognostics and Diagnostics, V6, P1

[10] Artificial intelligence in manufacturing and logistics systems: algorithms, applications, and case studies [J].

Chien, Chen-Fu ;

Dauzere-Peres, Stephane ;

Huh, Woonghee Tim ;

Jang, Young Jae ;

Morrison, James R. .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2020, 58 (09) :2730-2731

← 1 2 3 4 5 6 →