Doubly Structured Data Synthesis for Time-Series Energy-Use Data

被引:0
作者
Kim, Jiwoo [1 ]
Lee, Changhoon [2 ]
Jeon, Jehoon [3 ]
Choi, Jungwoong [2 ]
Kim, Joseph H. T. [1 ,3 ]
机构
[1] Department of Statistics and Data Science, Yonsei University, 50, Yonsei-ro, Seodaemun-gu, Seoul
[2] Korea Smart Grid Institute, 3F, Samwoo Bldg, 32, Nonhyeon-ro 86 gil, Gangnam-gu, Seoul
[3] Department of Applied Statistics, Yonsei University, 50, Yonsei-ro, Seodaemun-gu, Seoul
关键词
data augmentation; data privacy; electronic energy use; energy data; energy management; synthetic data;
D O I
10.3390/s24248033
中图分类号
学科分类号
摘要
As the demand for efficient energy management increases, the need for extensive, high-quality energy data becomes critical. However, privacy concerns and insufficient data volume pose significant challenges. To address these issues, data synthesis techniques are employed to augment and replace real data. This paper introduces Doubly Structured Data Synthesis ((Formula presented.)), a novel method to tackle privacy concerns in time-series energy-use data. (Formula presented.) synthesizes rate changes to maintain longitudinal information and uses calibration techniques to preserve the cross-sectional mean structure at each time point. Numerical analyses reveal that (Formula presented.) surpasses existing methods, such as Conditional Tabular GAN (CTGAN) and Transformer-based Time-Series Generative Adversarial Network (TTS-GAN), in capturing both time-series and cross-sectional characteristics. We evaluated our proposed method using metrics for data similarity, utility, and privacy. The results indicate that (Formula presented.) effectively retains the underlying characteristics of real datasets while ensuring adequate privacy protection. (Formula presented.) is a valuable tool for sharing and utilizing energy data, significantly enhancing energy demand prediction and management. © 2024 by the authors.
引用
收藏
相关论文
共 50 条
  • [21] A Symmetric Block Resampling Method to Generate Energy Time Series Data
    Kimbrough, Steven O.
    Yilmaz, Hasan Uemitcan
    2021 22ND IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2021, : 546 - 551
  • [22] Pointwise vortex-induced vibration detection: Learning from synthetic time-series data
    Lee, Sunho
    Kim, Sunjoong
    ENGINEERING STRUCTURES, 2025, 326
  • [23] Dimensional Expansion and Time-Series Data Augmentation Policy for Skeleton-Based Pose Estimation
    Park, Sung-Soo
    Kwon, Hye-Jeong
    Baek, Ji-Won
    Chung, Kyungyong
    IEEE ACCESS, 2022, 10 : 112261 - 112272
  • [24] TS-GAN: Time-series GAN for Sensor-based Health Data Augmentation
    Yang Z.
    Li Y.
    Zhou G.
    ACM Transactions on Computing for Healthcare, 2023, 4 (02):
  • [25] MOG: A Background Extraction Approach For Data Augmentation of Time-series Images in Deep Learning Segmentation
    Borgersen, Jonas Nagell
    Saad, Aya
    Stahl, Annette
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [26] Stock market forecasting with super-high dimensional time-series data using ConvLSTM, trend sampling, and specialized data augmentation
    Lee, Si Woon
    Kim, Ha Young
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 161
  • [27] A Novel Model to Generate Heterogeneous and Realistic Time-Series Data for Post-Stroke Rehabilitation Assessment
    Boukhennoufa, Issam
    Jarchi, Delaram
    Zhai, Xiaojun
    Utti, Victor
    Sanei, Saeid
    Lee, Tracey K. M.
    Jackson, Jo
    McDonald-Maier, Klaus D.
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 2676 - 2687
  • [28] Making Big Money from Small Sensors: Trading Time-Series Data under Pufferfish Privacy
    Niu, Chaoyue
    Zheng, Zhenzhe
    Tang, Shaojie
    Gao, Xiaofeng
    Wu, Fan
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 568 - 576
  • [29] GAN-based synthetic time-series data generation for improving prediction of demand for electric vehicles
    Chatterjee, Subhajit
    Hazra, Debapriya
    Byun, Yung-Cheol
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [30] A Pilot Study on the Use of Generative Adversarial Networks for Data Augmentation of Time Series
    Morizet, Nicolas
    Rizzato, Matteo
    Grimbert, David
    Luta, George
    AI, 2022, 3 (04) : 789 - 795