Learning behavioral models by recurrent neural networks with discrete latent representations with application to a flexible industrial conveyor

被引:6
作者
Brusaferri, Alessandro [1 ,2 ]
Matteucci, Matteo [2 ]
Spinelli, Stefano [1 ,2 ]
Vitali, Andrea [1 ]
机构
[1] CNR, Inst Intelligent Ind Technol & Syst Adv Mfg, Via A Corti 12, Milan, Italy
[2] Politecn Milan, Dept Elect Informat & Bioengn, Via Ponzio 34-5, Milan, Italy
关键词
Deep learning; Recurrent neural network; Discrete representation; Finite state machine; Behavior cloning; Industrial cyber physical systems; CYBER-PHYSICAL SYSTEMS; EXTRACTION; SIMULATION;
D O I
10.1016/j.compind.2020.103263
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recurrent neural networks (RNN) are being extensively exploited in industry to address complex predictive tasks by leveraging on the increased availability of data from processes. However, the rationale behind model response is encoded in an implicit way, which is difficult to be explained by practitioners. If revealed, such mechanisms could provide deeper insights into RNN execution, enhancing conventional performance evaluations. We propose a new approach based on the introduction of a model-based clustering layer, constraining the network to operate on a discrete latent state representation. By processing context-input conditioned transitions between clusters, a Moore Machine characterizing the RNN computations is extracted. The proposed approach is demonstrated on both synthetic experiments from an open benchmark problem and via the application to a pilot industrial plant, by the behavior cloning of the flexible conveyor of a Remanufacturing process. The finite-state RNN attains the prediction accuracy of RNN with continuous state, providing in addition a more interpretable structure. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 41 条
[1]  
[Anonymous], 2006, Introduction to automata theory, languages, and computation
[2]   Stable behavior in a recurrent neural network for a finite state machine [J].
Arai, K ;
Nakano, R .
NEURAL NETWORKS, 2000, 13 (06) :667-680
[3]   Big Data and virtualization for manufacturing cyber-physical systems: A survey of the current status and future outlook [J].
Babiceanu, Radu F. ;
Seker, Remzi .
COMPUTERS IN INDUSTRY, 2016, 81 :128-137
[4]  
Ballarino A, 2015, SYSTEM LEVEL PERFORM, DOI [10.1109/ETFA.2014.7005350, DOI 10.1109/ETFA.2014.7005350]
[5]  
Bengio Y., 2014, P INT C LEARN REPR I
[6]  
Bishop C. M., 2006, Pattern Recognition and Machine Learning, DOI DOI 10.1117/1.2819119
[7]  
Brusaferri A., 2019, P 17 IEEE INT C IND
[8]  
Brusaferri A, 2019, IEEE INTL CONF IND I, P319, DOI [10.1109/INDIN41052.2019.8972113, 10.1109/indin41052.2019.8972113]
[9]  
Burke M., 2019, 3 INT C ROB LEARN CO
[10]   Review of digital twin applications in manufacturing [J].
Cimino, Chiara ;
Negri, Elisa ;
Fumagalli, Luca .
COMPUTERS IN INDUSTRY, 2019, 113