Augmenting time series data: An interpretable approach with metric learning and variational autoencoders

被引:0
作者
Zhang, Chunfeng [1 ]
Qin, Hao [1 ]
Zhang, Yongjun [1 ]
Jiang, Chongying [1 ]
Zhang, Di [1 ]
Deng, Wenyang [1 ]
机构
[1] South China Univ Technol, Sch Elect Power, Guangzhou 510665, Peoples R China
基金
中国国家自然科学基金;
关键词
Time series data; Data augmentation; Variational autoencoder; Metric learning; Time series decomposition; DATA AUGMENTATION;
D O I
10.1016/j.ijepes.2024.110190
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the field of time series classification, deep learning techniques have shown remarkable performance; however, their effectiveness is often compromised when confronted with challenges of insufficient data and class imbalance. To address this challenge, we propose an interpretable time series data augmentation algorithm integrating variational autoencoders (VAE) and metric learning. The core contribution of this algorithm is manifested in three aspects: First, it eliminates the heteroscedasticity and non-stationarity of the data, ensuring that the data satisfies the hypothesis of normal distribution in the potential space of the encoder, and effectively avoids the approximation error of the real data distribution; Secondly, the algorithm constructs a discriminant VAE potential space, suitable for data augmentation, with metric learning, ensuring that the hidden variable distribution accurately reflects the characteristics of the original data. Finally, this paper explores the multi-seasonal decomposition algorithm of time series to seamlessly integrate the structural features of the original time series in the generated data, thereby enhancing the interpretability of data generation. Through experimental verification on four multivariate time series data sets, including the electrical energy data set, the results demonstrate that the proposed algorithm outperforms existing methods in fidelity and prediction performance, exhibiting high stability and generalization ability, particularly in cases of limited data volume. The introduction of this algorithm not only contributes to enhancing the overall performance of time series classification models but also substantially reduces the cost of data collection and labeling, thereby demonstrating its significant value in practical applications.
引用
收藏
页数:14
相关论文
共 36 条
[1]   Bagging exponential smoothing methods using STL decomposition and Box-Cox transformation [J].
Bergmeir, Christoph ;
Hyndman, Rob J. ;
Benitez, Jose M. .
INTERNATIONAL JOURNAL OF FORECASTING, 2016, 32 (02) :303-312
[2]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[3]   TrajVAE: A Variational AutoEncoder model for trajectory generation [J].
Chen, Xinyu ;
Xu, Jiajie ;
Zhou, Rui ;
Chen, Wei ;
Fang, Junhua ;
Liu, Chengfei .
NEUROCOMPUTING, 2021, 428 :332-339
[4]   Learning hierarchical time series data augmentation invariances via contrastive supervision for human activity recognition [J].
Cheng, Dongzhou ;
Zhang, Lei ;
Bu, Can ;
Wu, Hao ;
Song, Aiguo .
KNOWLEDGE-BASED SYSTEMS, 2023, 276
[5]   Data augmentation for time series regression: Applying transformations, autoencoders and adversarial networks to electricity price forecasting [J].
Demir, Sumeyra ;
Mincev, Krystof ;
Kok, Koen ;
Paterakis, Nikolaos G. .
APPLIED ENERGY, 2021, 304
[6]   Research on edge intelligent recognition method oriented to transmission line insulator fault detection [J].
Deng, Fangming ;
Xie, Zhongxin ;
Mao, Wei ;
Li, Bing ;
Shan, Yun ;
Wei, Baoquan ;
Zeng, Han .
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 139
[7]  
Desai A, 2021, Arxiv, DOI [arXiv:2111.08095, 10.48550/arXiv.2111.08095, DOI 10.48550/ARXIV.2111.08095]
[8]  
Esteban C, 2017, Arxiv, DOI arXiv:1706.02633
[9]   An Autuencoder-based Data Augmentation Strategy for Generalization Improvement of DCNNs [J].
Feng, Xiexing ;
Wu, Q. M. Jonathan ;
Yang, Yimin ;
Cao, Libo .
NEUROCOMPUTING, 2020, 402 :283-297
[10]  
Goubeaud M, 2021, 7 INT C OPT APPL ICO, V2021, P1