Hierarchical framework for interpretable and specialized deep reinforcement learning-based predictive maintenance

被引:6
作者
Abbas, Ammar N. [1 ,2 ]
Chasparis, Georgios C. [1 ]
Kelleher, John D. [3 ]
机构
[1] Software Competence Ctr Hagenberg, Data Sci, Softwarepk 32a, A-4232 Hagenberg, Austria
[2] Technol Univ Dublin, Dept Comp Sci, Dublin D02HW71, Ireland
[3] Maynooth Univ, ADAPT Res Ctr, Maynooth W23 A3HY, Ireland
基金
爱尔兰科学基金会;
关键词
Deep reinforcement learning; Probabilistic modeling; Input-output hidden Markov model; Predictive maintenance; Industry; 5.0; Interpretable reinforcement learning; GO;
D O I
10.1016/j.datak.2023.102240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning holds significant potential for application in industrial decision-making, offering a promising alternative to traditional physical models. However, its black-box learning approach presents challenges for real-world and safety-critical systems, as it lacks interpretability and explanations for the derived actions. Moreover, a key research question in deep reinforcement learning is how to focus policy learning on critical decisions within sparse domains. This paper introduces a novel approach that combines probabilistic modeling and reinforcement learning, providing interpretability and addressing these challenges in the context of safety-critical predictive maintenance. The methodology is activated in specific situations identified through the input-output hidden Markov model, such as critical conditions or near-failure scenarios. To mitigate the challenges associated with deep reinforcement learning in safety-critical predictive maintenance, the approach is initialized with a baseline policy using behavioral cloning, requiring minimal interactions with the environment. The effectiveness of this framework is demonstrated through a case study on predictive maintenance for turbofan engines, outperforming previous approaches and baselines, while also providing the added benefit of interpretability. Importantly, while the framework is applied to a specific use case, this paper aims to present a general methodology that can be applied to diverse predictive maintenance applications.
引用
收藏
页数:28
相关论文
共 54 条
[51]   Machine learning in manufacturing: advantages, challenges, and applications [J].
Wuest, Thorsten ;
Weimer, Daniel ;
Irgens, Christopher ;
Thoben, Klaus-Dieter .
PRODUCTION AND MANUFACTURING RESEARCH-AN OPEN ACCESS JOURNAL, 2016, 4 (01) :23-45
[52]  
Yin M., 2017, IOHMM
[53]   Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process [J].
Yoon, Hyung-Jin ;
Lee, Donghwan ;
Hovakimyan, Naira .
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, :2366-2371
[54]   Dynamic maintenance model for a repairable multi-component system using deep reinforcement learning [J].
Yousefi, Nooshin ;
Tsianikas, Stamatis ;
Coit, David W. .
QUALITY ENGINEERING, 2022, 34 (01) :16-35