Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance

被引：6

作者：

Abbas, Ammar N. ^{[1
,2
]}

Chasparis, Georgios C. ^{[1
]}

Kelleher, John D. ^{[3
]}

机构：

[1] Software Competence Ctr Hagenberg, Data Sci, Softwarepk 32a, A-4232 Hagenberg, Austria

[2] Technol Univ Dublin, Dept Comp Sci, Dublin D02HW71, Ireland

[3] Maynooth Univ, ADAPT Res Ctr, Maynooth W23 A3HY, Ireland

来源：

DATA & KNOWLEDGE ENGINEERING | 2024年 / 149卷

基金：

爱尔兰科学基金会;

关键词：

Deep reinforcement learning; Probabilistic modeling; Input-output hidden Markov model; Predictive maintenance; Industry; 5.0; Interpretable reinforcement learning; GO;

D O I：

10.1016/j.datak.2023.102240

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep reinforcement learning holds significant potential for application in industrial decision-making, offering a promising alternative to traditional physical models. However, its black-box learning approach presents challenges for real-world and safety-critical systems, as it lacks interpretability and explanations for the derived actions. Moreover, a key research question in deep reinforcement learning is how to focus policy learning on critical decisions within sparse domains. This paper introduces a novel approach that combines probabilistic modeling and reinforcement learning, providing interpretability and addressing these challenges in the context of safety-critical predictive maintenance. The methodology is activated in specific situations identified through the input-output hidden Markov model, such as critical conditions or near-failure scenarios. To mitigate the challenges associated with deep reinforcement learning in safety-critical predictive maintenance, the approach is initialized with a baseline policy using behavioral cloning, requiring minimal interactions with the environment. The effectiveness of this framework is demonstrated through a case study on predictive maintenance for turbofan engines, outperforming previous approaches and baselines, while also providing the added benefit of interpretability. Importantly, while the framework is applied to a specific use case, this paper aims to present a general methodology that can be applied to diverse predictive maintenance applications.

引用

页数：28

共 54 条

[51] Machine learning in manufacturing: advantages, challenges, and applications [J].

Wuest, Thorsten ;

Weimer, Daniel ;

Irgens, Christopher ;

Thoben, Klaus-Dieter .

PRODUCTION AND MANUFACTURING RESEARCH-AN OPEN ACCESS JOURNAL, 2016, 4 (01) :23-45

[52]

Yin M., 2017, IOHMM

[53] Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process [J].

Yoon, Hyung-Jin ;

Lee, Donghwan ;

Hovakimyan, Naira .

2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, :2366-2371

[54] Dynamic maintenance model for a repairable multi-component system using deep reinforcement learning [J].

Yousefi, Nooshin ;

Tsianikas, Stamatis ;

Coit, David W. .

QUALITY ENGINEERING, 2022, 34 (01) :16-35

← 1 2 3 4 5 6 →