Self-Supervised Time Series Representation Learning via Cross Reconstruction Transformer

被引：24

作者：

Zhang, Wenrui ^{[1
,2
,3
]}

Yang, Ling ^{[1
,2
]}

Geng, Shijia ^{[4
]}

Hong, Shenda ^{[1
,2
]}

机构：

[1] Peking Univ, Natl Inst Hlth Data Sci, Beijing 100871, Peoples R China

[2] Peking Univ, Hlth Sci Ctr, Inst Med Technol, Beijing 100191, Peoples R China

[3] Natl Univ Singapore, Dept Math, Singapore 119077, Singapore

[4] HeartVoice Med Technol, Hefei 230027, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Time series analysis; Transformers; Cathode ray tubes; Task analysis; Self-supervised learning; Image reconstruction; Representation learning; Cross domain; self-supervised learning; time series; transformer; VEHICLE-ROUTING PROBLEM;

D O I：

10.1109/TNNLS.2023.3292066

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Since labeled samples are typically scarce in real-world scenarios, self-supervised representation learning in time series is critical. Existing approaches mainly employ the contrastive learning framework, which automatically learns to understand similar and dissimilar data pairs. However, they are constrained by the request for cumbersome sampling policies and prior knowledge of constructing pairs. Also, few works have focused on effectively modeling temporal-spectral correlations to improve the capacity of representations. In this article, we propose the cross reconstruction transformer (CRT) to solve the aforementioned issues. CRT achieves time series representation learning through a cross-domain dropping-reconstruction task. Specifically, we obtain the frequency domain of the time series via the fast Fourier transform (FFT) and randomly drop certain patches in both time and frequency domains. Dropping is employed to maximally preserve the global context while masking leads to the distribution shift. Then a Transformer architecture is utilized to adequately discover the cross-domain correlations between temporal and spectral information through reconstructing data in both domains, which is called Dropped Temporal-Spectral Modeling. To discriminate the representations in global latent space, we propose instance discrimination constraint (IDC) to reduce the mutual information between different time series samples and sharpen the decision boundaries. Additionally, a specified curriculum learning (CL) strategy is employed to improve the robustness during the pretraining phase, which progressively increases the dropping ratio in the training process. We conduct extensive experiments to evaluate the effectiveness of the proposed method on multiple real-world datasets. Results show that CRT consistently achieves the best performance over existing methods by 2%-9%. The code is publicly available at https://github.com/BobZwr/Cross-Reconstruction-Transformer.

引用

页码：16129 / 16138

页数：10

共 54 条

[1]

Anguita D., 2013, ESANN, P437

[2]

Ba J, 2014, ACS SYM SER

[3] LSTM-MSNet: Leveraging Forecasts on Sets of Related Time Series With Multiple Seasonal Patterns [J].

Bandara, Kasun ;

Bergmeir, Christoph ;

Hewamalage, Hansika .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (04) :1586-1599

[4]

Bengio Yoshua, 2009, P 26 ANN INT C MACH, P41

[5] Local Anomaly Detection for Multivariate Time Series by Temporal Dependency Based on Poisson Model [J].

Benkabou, Seif-Eddine ;

Benabdeslem, Khalid ;

Kraus, Vivien ;

Bourhis, Kilian ;

Canitia, Bruno .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) :6701-6711

[6] Reservoir Computing Approaches for Representation and Classification of Multivariate Time Series [J].

Bianchi, Filippo Maria ;

Scardapane, Simone ;

Lokse, Sigurd ;

Jenssen, Robert .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) :2169-2179

[7] Multiclass Sparse Centroids With Application to Fast Time Series Classification [J].

Bradde, Tommaso ;

Fracastoro, Giulia ;

Calafiore, Giuseppe C. .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) :5206-5211

[8]

Caron M, 2020, ADV NEUR IN, V33

[9] Deep Clustering for Unsupervised Learning of Visual Features [J].

Caron, Mathilde ;

Bojanowski, Piotr ;

Joulin, Armand ;

Douze, Matthijs .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156

[10] Imbalanced dataset-based echo state networks for anomaly detection [J].

Chen, Qing ;

Zhang, Anguo ;

Huang, Tingwen ;

He, Qianping ;

Song, Yongduan .

NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08) :3685-3694

← 1 2 3 4 5 6 →