Doubly Structured Data Synthesis for Time-Series Energy-Use Data

被引:0
作者
Kim, Jiwoo [1 ]
Lee, Changhoon [2 ]
Jeon, Jehoon [3 ]
Choi, Jungwoong [2 ]
Kim, Joseph H. T. [1 ,3 ]
机构
[1] Yonsei Univ, Dept Stat & Data Sci, 50 Yonsei Ro, Seoul 03722, South Korea
[2] Korea Smart Grid Inst, 3F,Samwoo Bldg,32 Nonhyeon ro 86 Gil, Seoul 06223, South Korea
[3] Yonsei Univ, Dept Appl Stat, 50 Yonsei Ro, Seoul 03722, South Korea
关键词
data augmentation; energy data; energy management; electronic energy use; data privacy; synthetic data;
D O I
10.3390/s24248033
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
As the demand for efficient energy management increases, the need for extensive, high-quality energy data becomes critical. However, privacy concerns and insufficient data volume pose significant challenges. To address these issues, data synthesis techniques are employed to augment and replace real data. This paper introduces Doubly Structured Data Synthesis (DS2), a novel method to tackle privacy concerns in time-series energy-use data. DS2 synthesizes rate changes to maintain longitudinal information and uses calibration techniques to preserve the cross-sectional mean structure at each time point. Numerical analyses reveal that DS2 surpasses existing methods, such as Conditional Tabular GAN (CTGAN) and Transformer-based Time-Series Generative Adversarial Network (TTS-GAN), in capturing both time-series and cross-sectional characteristics. We evaluated our proposed method using metrics for data similarity, utility, and privacy. The results indicate that DS2 effectively retains the underlying characteristics of real datasets while ensuring adequate privacy protection. DS2 is a valuable tool for sharing and utilizing energy data, significantly enhancing energy demand prediction and management.
引用
收藏
页数:16
相关论文
共 50 条
[31]   MOG: A Background Extraction Approach For Data Augmentation of Time-series Images in Deep Learning Segmentation [J].
Borgersen, Jonas Nagell ;
Saad, Aya ;
Stahl, Annette .
FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
[32]   A Financial Data Analysis Method Based on Time-Series Generative Adversarial Network and Decomposition Learning [J].
Wang, Wei ;
Li, Bo .
IEEE ACCESS, 2025, 13 :118354-118368
[33]   OblivTime: Oblivious and Efficient Interval Skyline Query Processing Over Encrypted Time-Series Data [J].
Ouyang, Huajie ;
Zheng, Yifeng ;
Wang, Songlei ;
Hua, Zhongyun .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2025, 18 (03) :1602-1617
[34]   Stock market forecasting with super-high dimensional time-series data using ConvLSTM, trend sampling, and specialized data augmentation [J].
Lee, Si Woon ;
Kim, Ha Young .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 161
[35]   Data augmentation for forecasting industrial aging processes via conditional multimodal generative time-series models [J].
Bogojeski, Mihail ;
Yakut, Nataliya ;
Nedelkoski, Sasho ;
Nakajima, Shinichi ;
Mueller, Klaus-Robert .
COMPUTERS & CHEMICAL ENGINEERING, 2025, 199
[36]   A Novel Model to Generate Heterogeneous and Realistic Time-Series Data for Post-Stroke Rehabilitation Assessment [J].
Boukhennoufa, Issam ;
Jarchi, Delaram ;
Zhai, Xiaojun ;
Utti, Victor ;
Sanei, Saeid ;
Lee, Tracey K. M. ;
Jackson, Jo ;
McDonald-Maier, Klaus D. .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 :2676-2687
[37]   Making Big Money from Small Sensors: Trading Time-Series Data under Pufferfish Privacy [J].
Niu, Chaoyue ;
Zheng, Zhenzhe ;
Tang, Shaojie ;
Gao, Xiaofeng ;
Wu, Fan .
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, :568-576
[38]   GAN-based synthetic time-series data generation for improving prediction of demand for electric vehicles [J].
Chatterjee, Subhajit ;
Hazra, Debapriya ;
Byun, Yung-Cheol .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
[39]   Evaluating discrepancies in dimensionality reduction for time-series single-cell RNA-sequencing data [J].
Hackenberg, Maren ;
Guitart, Laia Canal ;
Backofen, Rolf ;
Binder, Harald .
BRIEFINGS IN BIOINFORMATICS, 2025, 26 (03)
[40]   A Pilot Study on the Use of Generative Adversarial Networks for Data Augmentation of Time Series [J].
Morizet, Nicolas ;
Rizzato, Matteo ;
Grimbert, David ;
Luta, George .
AI, 2022, 3 (04) :789-795