Data augmentation for forecasting industrial aging processes via conditional multimodal generative time-series models

被引:0
作者
Bogojeski, Mihail [1 ]
Yakut, Nataliya [2 ]
Nedelkoski, Sasho [3 ]
Nakajima, Shinichi [1 ,4 ,5 ]
Mueller, Klaus-Robert [1 ,4 ,6 ,7 ]
机构
[1] Tech Univ Berlin, Machine Learning Grp, D-10587 Berlin, Germany
[2] BASF SE, D-67056 Ludwigshafen, Germany
[3] Tech Univ Berlin, Distributed & Operating Syst Grp, D-10587 Berlin, Germany
[4] BIFOLD Berlin Inst Fdn Learning & Data, Berlin, Germany
[5] RIKEN AIP, 1-4-1 Nihonbashi,Chuo Ku, Tokyo, Japan
[6] Korea Univ, Dept Artificial Intelligence, Seoul 136713, South Korea
[7] Max Planck Inst Informat, Saarbrucken, Germany
关键词
Variational inference; Data augmentation; Industrial aging processes; Sequential data; Medical data; Generative models; BOOTSTRAP; RECURRENT;
D O I
10.1016/j.compchemeng.2025.109109
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data augmentation has shown to be effective for improving generalization performance of deep neural networks, especially in the regime of high noise and scarce data. However, this approach has not been applied to industrial aging processes (IAP) forecasting, where observed data are multimodal time-series, and therefore existing augmentation methods are not suitable for data generation. In this paper, we propose Seq-MVAE, a generative architecture that can generate complex time-series data consisting of multiple heterogeneous modalities. Seq-MVAE is capable of conditional generation, i.e., Seq-MVAE learns the joint distribution across the modalities, and allows users to generate a part of the modalities that are coherent with the other (given) modalities. This enables not only missing value imputation but also conditional generation, which is known to be crucial for data augmentation. We evaluate the generative performance and other aspects of Seq-MVAE on an artificial dataset generated based designed to simulate an industrial aging process, and show the effectiveness of data augmentation by Seq-MVAE on a real-world dataset acquired from an industrial plant.
引用
收藏
页数:15
相关论文
共 74 条
[1]  
Aggarwal C.C, 2018, Neural Netw. Deep. Learn.: Textb., P271, DOI DOI 10.1007/978-3-319-94463-0
[2]  
Alcaraz J. M. L., 2023, Trans. Mach. Learn. Res., P1
[3]  
Antoniou A, 2018, Arxiv, DOI arXiv:1711.04340
[4]   Imputation of Missing Data with Ordinary Kriging for Enhancing Fault Detection and Diagnosis [J].
Ardakani, Mohammadhamed ;
Shokry, Ahmed ;
Saki, Ghazal ;
Escudero, Gerard ;
Graells, Moises ;
Espuna, Antonio .
26TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING (ESCAPE), PT B, 2016, 38B :1377-1382
[5]   Fault diagnosis of chemical processes with incomplete observations: A comparative study [J].
Askarian, M. ;
Escudero, G. ;
Graells, M. ;
Zarghami, R. ;
Jalali-Farahani, F. ;
Mostoufi, N. .
COMPUTERS & CHEMICAL ENGINEERING, 2016, 84 :104-116
[6]  
Balhorn L.S., 2023, Computer Aided Chemical Engineering, V52, P2011
[7]   Improving the accuracy of global forecasting models using time series data augmentation [J].
Bandara, Kasun ;
Hewamalage, Hansika ;
Liu, Yuan-Hao ;
Kang, Yanfei ;
Bergmeir, Christoph .
PATTERN RECOGNITION, 2021, 120
[8]  
Bengio Y, 2013, INT CONF ACOUST SPEE, P8624, DOI 10.1109/ICASSP.2013.6639349
[9]   Forecasting industrial aging processes with machine learning methods [J].
Bogojeski, Mihail ;
Sauer, Simeon ;
Horn, Franziska ;
Mueller, Klaus-Robert .
COMPUTERS & CHEMICAL ENGINEERING, 2021, 144 (144)
[10]  
Brophy Eoin, 2019, arXiv, DOI [DOI 10.48550/ARXIV.1902.05624, 10.48550/ARXIV.1902.05624, 10.48550/arXiv.1902.05624]