Deep Regression Modeling for Imbalanced and Incomplete Time-Series Data

被引:0
|
作者
Hssayeni, Murtadha D. [1 ]
Ghoraani, Behnaz [2 ]
机构
[1] Univ Technol Baghdad, Baghdad 10066, Iraq
[2] Florida Atlantic Univ, Dept Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 06期
基金
美国国家科学基金会;
关键词
Deep regression modeling; time-series data; generative adversarial networks; imbalanced and incomplete data; extrapolation; DYSKINESIA;
D O I
10.1109/TETCI.2024.3372435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During the collection of time-series data, many reasons lead to imbalanced and incomplete datasets. Consequently, it becomes challenging to develop deep convolutional models without suffering from overfitting. Our objective in this paper was to investigate an emerging but rather underutilized framework of Conditional Generative Adversarial Networks (cGANs) for improving deep regression models for time-series data with an imbalanced and incomplete distribution. First, we investigated the potential of using a vanilla cGAN as a data imputation to improve the generalizability of the developed models to unseen data in such datasets. Next, we proposed a modified cGAN architecture with improved extrapolation and generalizability of the regression models. Our investigations used an imbalanced synthetic non-stationary dataset, a real-world dataset in Parkinson's disease (PD) application domain, and one publicly-available dataset for Negative Affect (NA) estimation. We found that vanilla cGAN failed to generate realistic time-series data due to severe mode collapse, limiting its application as a data imputation for imbalanced and incomplete data. Importantly, the proposed cGAN framework significantly improved extrapolation and generalizability for the prediction of regression scores with an average improvement of 56%, 34%, and 18%, respectively, in mean absolute error for the synthetic, PD, and NA datasets when compared with traditional Convolutional Neural Networks. The codes are publicly available on Github.
引用
收藏
页码:3767 / 3778
页数:12
相关论文
共 50 条
  • [21] Using Property Graphs to Segment Time-Series Data
    Karetnikov, Aleksei
    Rehberger, Tobias
    Lettner, Christian
    Himmelbauer, Johannes
    Nikzad-Langerodi, Ramin
    Gsellmann, Guenter
    Nestelberger, Susanne
    Schutzeneder, Stefan
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022 WORKSHOPS, 2022, 1633 : 416 - 423
  • [22] Parallel Dimensionality Reduction Transformation for Time-Series Data
    Hoang Chi Thanh
    2009 FIRST ASIAN CONFERENCE ON INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2009, : 104 - 108
  • [23] Controlled-Sized Clustering for Time-Series Data
    Tsuda, Nobuhiko
    Hamasuna, Yukihiro
    2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 245 - 249
  • [24] On Some Fuzzy Clustering Algorithms for Time-Series Data
    Fujita, Mizuki
    Kanzawa, Yuchi
    INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING (IUKM 2022), 2022, 13199 : 169 - 181
  • [25] Rumor Detection on Time-Series of Tweets via Deep Learning
    Kotteti, Chandra Mouli Madhav
    Dong, Xishuang
    Qian, Lijun
    MILCOM 2019 - 2019 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2019,
  • [26] Least Information Spectral GAN With Time-Series Data Augmentation for Industrial IoT
    Seon, Joonho
    Lee, Seongwoo
    Sun, Young Ghyu
    Kim, Soo Hyun
    Kim, Dong In
    Kim, Jin Young
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 757 - 769
  • [27] Data mining of time-series medical data by formal concept analysis
    Sato, Kenji
    Okubo, Yoshiaki
    Haraguchi, Iakoto
    Kunifuji, Susumu
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT II, PROCEEDINGS, 2007, 4693 : 1214 - 1221
  • [28] Clustering Time-Series Gene Expression Data with Unequal Time Intervals
    Rueda, Luis
    Bari, Ataul
    Ngom, Alioune
    TRANSACTIONS ON COMPUTATIONAL SYSTEMS BIOLOGY X, 2008, 5410 : 100 - 123
  • [29] Missing Value Imputation of Time-Series Air-Quality Data via Deep Neural Networks
    Kim, Taesung
    Kim, Jinhee
    Yang, Wonho
    Lee, Hunjoo
    Choo, Jaegul
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (22)
  • [30] Mapping Time-Series Data on Process Patterns to Generate Synthetic Data
    Fonger, Frederik
    Aleknonyte-Resch, Milda
    Koschmider, Agnes
    ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS, CAISE 2023 INTERNATIONAL WORKSHOPS, 2023, 482 : 50 - 61