Deep Regression Modeling for Imbalanced and Incomplete Time-Series Data

被引:0
|
作者
Hssayeni, Murtadha D. [1 ]
Ghoraani, Behnaz [2 ]
机构
[1] Univ Technol Baghdad, Baghdad 10066, Iraq
[2] Florida Atlantic Univ, Dept Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 06期
基金
美国国家科学基金会;
关键词
Deep regression modeling; time-series data; generative adversarial networks; imbalanced and incomplete data; extrapolation; DYSKINESIA;
D O I
10.1109/TETCI.2024.3372435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During the collection of time-series data, many reasons lead to imbalanced and incomplete datasets. Consequently, it becomes challenging to develop deep convolutional models without suffering from overfitting. Our objective in this paper was to investigate an emerging but rather underutilized framework of Conditional Generative Adversarial Networks (cGANs) for improving deep regression models for time-series data with an imbalanced and incomplete distribution. First, we investigated the potential of using a vanilla cGAN as a data imputation to improve the generalizability of the developed models to unseen data in such datasets. Next, we proposed a modified cGAN architecture with improved extrapolation and generalizability of the regression models. Our investigations used an imbalanced synthetic non-stationary dataset, a real-world dataset in Parkinson's disease (PD) application domain, and one publicly-available dataset for Negative Affect (NA) estimation. We found that vanilla cGAN failed to generate realistic time-series data due to severe mode collapse, limiting its application as a data imputation for imbalanced and incomplete data. Importantly, the proposed cGAN framework significantly improved extrapolation and generalizability for the prediction of regression scores with an average improvement of 56%, 34%, and 18%, respectively, in mean absolute error for the synthetic, PD, and NA datasets when compared with traditional Convolutional Neural Networks. The codes are publicly available on Github.
引用
收藏
页码:3767 / 3778
页数:12
相关论文
共 50 条
  • [1] Time-Series Data Regression Modeling Method for Efficient Operation of Virtual Environments
    Takahashi, Yuriko
    Suzuki, Shigeto
    Yamamoto, Takuji
    Fukuda, Hiroyuki
    Oguchi, Masato
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [2] Deep Time-Series Clustering: A Review
    Alqahtani, Ali
    Ali, Mohammed
    Xie, Xianghua
    Jones, Mark W.
    ELECTRONICS, 2021, 10 (23)
  • [3] Stochastic dynamic modeling of short gene expression time-series data
    Wang, Zidong
    Yang, Fuwen
    Ho, Daniel W. C.
    Swift, Stephen
    Tucker, Allan
    Liu, Xiaohui
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2008, 7 (01) : 44 - 55
  • [4] Representation and analysis of time-series data via deep embedding and visual exploration
    Yixuan Zhou
    Runfeng Jiang
    Hongxing Qin
    Haibo Hu
    Journal of Visualization, 2023, 26 : 593 - 610
  • [5] Representation and analysis of time-series data via deep embedding and visual exploration
    Zhou, Yixuan
    Jiang, Runfeng
    Qin, Hongxing
    Hu, Haibo
    JOURNAL OF VISUALIZATION, 2023, 26 (03) : 593 - 610
  • [6] Modeling time series data with deep Fourier neural networks
    Gashler, Michael S.
    Ashmore, Stephen C.
    NEUROCOMPUTING, 2016, 188 : 3 - 11
  • [7] Topological Data Analysis of Time-Series as an Input Embedding for Deep Learning Models
    Byers, Morgan
    Hinkle, Lee B.
    Metsis, Vangelis
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART II, 2022, 647 : 402 - 413
  • [8] Illustrating Changes in Time-Series Data With Data Video
    Lu, Junhua
    Wang, Jie
    Ye, Hui
    Gu, Yuhui
    Ding, Zhiyu
    Xu, Mingliang
    Chen, Wei
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2020, 40 (02) : 18 - 31
  • [9] Analysis techniques for microarray time-series data
    Filkov, V
    Skiena, S
    Zhi, JZ
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) : 317 - 330
  • [10] Time-series data dynamic density clustering
    Chen, Hao
    Xia, Yu
    Pan, Yuekai
    Yang, Qing
    INTELLIGENT DATA ANALYSIS, 2021, 25 (06) : 1487 - 1506