Towards Generating Real-World Time Series Data

被引:29
作者
Pei, Hengzhi [1 ,2 ]
Ren, Kan [2 ]
Yang, Yuqing [2 ]
Liu, Chang [3 ]
Qin, Tao [3 ]
Li, Dongsheng [2 ]
机构
[1] Univ Illinois, Urbana, IL USA
[2] Microsoft Res Asia, Shanghai, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
来源
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021) | 2021年
关键词
Time series; data generation; missing values;
D O I
10.1109/ICDM51629.2021.00058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series data generation has drawn increasing attention in recent years. Several generative adversarial network (GAN) based methods have been proposed to tackle the problem usually with the assumption that the targeted time series data are well-formatted and complete. However, real-world time series (RTS) data are far away from this utopia, e.g., long sequences with variable lengths and informative missing data raise intractable challenges for designing powerful generation algorithms. In this paper, we propose a novel generative framework for RTS data - RTSGAN to tackle the aforementioned challenges. RTSGAN first learns an encoder-decoder module which provides a mapping between a time series instance and a fixed-dimension latent vector and then learns a generation module to generate vectors in the same latent space. By combining the generator and the decoder, RTSGAN is able to generate RTS which respect the original feature distributions and the temporal dynamics. To generate time series with missing values, we further equip RTSGAN with an observation embedding layer and a decide-and-generate decoder to better utilize the informative missing patterns. Experiments on the four RTS datasets show that the proposed framework outperforms the previous generation methods in terms of synthetic data utility for downstream classification and prediction tasks. Our code is available at https://seqml.github.io/rtsgan.
引用
收藏
页码:469 / 478
页数:10
相关论文
共 50 条
[31]   NEAR-REAL TIME ESTIMATES OF LEAF AREA INDEX FROM AVHRR TIME SERIES DATA [J].
Kandasamy, S. ;
Verger, A. ;
Baret, F. ;
Weiss, M. ;
Buis, S. .
2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, :6475-6478
[32]   Real-world evidence for coverage decisions: opportunities and challenges [J].
Hampson, Grace ;
Towse, Adrian ;
Dreitlein, William B. ;
Henshall, Chris ;
Pearson, Steven D. .
JOURNAL OF COMPARATIVE EFFECTIVENESS RESEARCH, 2018, 7 (12) :1133-1143
[33]   Using Wearable Skin Temperature Data to Advance Tracking and Characterization of the Menstrual Cycle in a Real-World Setting [J].
Gombert-Labedens, Marie ;
Alzueta, Elisabet ;
Perez-Amparan, Evelyn ;
Yuksel, Dilara ;
Kiss, Orsolya ;
de Zambotti, Massimiliano ;
Simon, Katharine ;
Zhang, Jing ;
Shuster, Alessandra ;
Morehouse, Allison ;
Pena, Andres Alessandro ;
Mednick, Sara ;
Baker, Fiona C. .
JOURNAL OF BIOLOGICAL RHYTHMS, 2024, 39 (04) :331-350
[34]   A method for generating high resolution satellite image time series [J].
Guo, Tao .
IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XX, 2014, 9244
[35]   Generating Load Profiles Using Smart Metering Time Series [J].
Bock, Christian .
ADVANCES IN FUZZY LOGIC AND TECHNOLOGY 2017, VOL 1, 2018, 641 :211-223
[36]   An Find to Find Real Time Architecture for Analyzing and Clustering Time Series Data: Case of an Fnergy Management System [J].
Talei, Hanaa ;
Essaaidi, Mohamed ;
Benhaddou, Driss .
2018 6TH INTERNATIONAL RENEWABLE AND SUSTAINABLE ENERGY CONFERENCE (IRSEC), 2018, :1153-1159
[37]   Combined Use of SAR and Optical Time Series Data for Near Real-Time Forest Disturbance Mapping [J].
Hirschmugl, Manuela ;
Deutscher, Janik ;
Gutjahr, Karl-Heinz ;
Sobe, Carina ;
Schardt, Mathias .
2017 9TH INTERNATIONAL WORKSHOP ON THE ANALYSIS OF MULTITEMPORAL REMOTE SENSING IMAGES (MULTITEMP), 2017,
[38]   Reconstructing missing data sequences in multivariate time series: an application to environmental data [J].
Parrella, Maria Lucia ;
Albano, Giuseppina ;
La Rocca, Michele ;
Perna, Cira .
STATISTICAL METHODS AND APPLICATIONS, 2019, 28 (02) :359-383
[39]   Reconstructing missing data sequences in multivariate time series: an application to environmental data [J].
Maria Lucia Parrella ;
Giuseppina Albano ;
Michele La Rocca ;
Cira Perna .
Statistical Methods & Applications, 2019, 28 :359-383
[40]   Time trends in antithrombotic therapy prescription patterns: Real-world monocentric study in hospitalized patients with atrial fibrillation [J].
Abrignani, Maurizio Giuseppe ;
Lombardo, Alberto ;
Braschi, Annabella ;
Renda, Nicolo ;
Abrignani, Vincenzo ;
Lombardo, Renzo M. .
WORLD JOURNAL OF CARDIOLOGY, 2022, 14 (11) :576-598