Deep Regression Modeling for Imbalanced and Incomplete Time-Series Data

被引:0
|
作者
Hssayeni, Murtadha D. [1 ]
Ghoraani, Behnaz [2 ]
机构
[1] Univ Technol Baghdad, Baghdad 10066, Iraq
[2] Florida Atlantic Univ, Dept Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年 / 8卷 / 06期
基金
美国国家科学基金会;
关键词
Deep regression modeling; time-series data; generative adversarial networks; imbalanced and incomplete data; extrapolation; DYSKINESIA;
D O I
10.1109/TETCI.2024.3372435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During the collection of time-series data, many reasons lead to imbalanced and incomplete datasets. Consequently, it becomes challenging to develop deep convolutional models without suffering from overfitting. Our objective in this paper was to investigate an emerging but rather underutilized framework of Conditional Generative Adversarial Networks (cGANs) for improving deep regression models for time-series data with an imbalanced and incomplete distribution. First, we investigated the potential of using a vanilla cGAN as a data imputation to improve the generalizability of the developed models to unseen data in such datasets. Next, we proposed a modified cGAN architecture with improved extrapolation and generalizability of the regression models. Our investigations used an imbalanced synthetic non-stationary dataset, a real-world dataset in Parkinson's disease (PD) application domain, and one publicly-available dataset for Negative Affect (NA) estimation. We found that vanilla cGAN failed to generate realistic time-series data due to severe mode collapse, limiting its application as a data imputation for imbalanced and incomplete data. Importantly, the proposed cGAN framework significantly improved extrapolation and generalizability for the prediction of regression scores with an average improvement of 56%, 34%, and 18%, respectively, in mean absolute error for the synthetic, PD, and NA datasets when compared with traditional Convolutional Neural Networks. The codes are publicly available on Github.
引用
收藏
页码:3767 / 3778
页数:12
相关论文
共 50 条
  • [11] Research on Cassandra Data Compaction Strategies for Time-Series Data
    Lu, Bai
    Yang Xiaohui
    JOURNAL OF COMPUTERS, 2016, 11 (06) : 504 - 512
  • [12] Modeling financial time-series with generative adversarial networks
    Takahashi, Shuntaro
    Chen, Yu
    Tanaka-Ishii, Kumiko
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 527
  • [13] Inferring Epistasis from Genetic Time-series Data
    Sohail, Muhammad Saqib
    Louie, Raymond H. Y.
    Hong, Zhenchen
    Barton, John P.
    McKay, Matthew R.
    MOLECULAR BIOLOGY AND EVOLUTION, 2022, 39 (10)
  • [14] Feature detection from illustration of time-series data
    Takezawa, Tetsuya
    Watanabe, Toyohide
    GRAPHICS RECOGNITION: TEN YEARS REVIEW AND FUTURE PERSPECTIVES, 2006, 3926 : 323 - 333
  • [15] A Survey on Dimensionality Reduction Techniques for Time-Series Data
    Ashraf, Mohsena
    Anowar, Farzana
    Setu, Jahanggir H.
    Chowdhury, Atiqul I.
    Ahmed, Eshtiak
    Islam, Ashraful
    Al-Mamun, Abdullah
    IEEE ACCESS, 2023, 11 : 42909 - 42923
  • [16] Aggregation Agent for Preprocessing and Forecasting Time-Series Data
    Muntean, Maria Viorela
    Onita, Daniela
    PROCEEDINGS OF THE 2018 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2018,
  • [17] Efficient Time-Series Data Delivery in IoT With Xender
    Liu, Libin
    Li, Jingzong
    Niu, Zhixiong
    Zhang, Wei
    Xue, Jason Chun
    Xu, Hong
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 4777 - 4792
  • [18] Using Time-Series Databases for Energy Data Infrastructures
    Hadjichristofi, Christos
    Diochnos, Spyridon
    Andresakis, Kyriakos
    Vescoukis, Vassilios
    ENERGIES, 2024, 17 (21)
  • [19] Time-Series Data and Analysis Software of Connected Vehicles
    Lee, Jaekyu
    Lee, Sangyub
    Choi, Hyosub
    Cho, Hyeonjoong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (03): : 2709 - 2727
  • [20] Neural Decomposition of Time-Series Data for Effective Generalization
    Godfrey, Luke B.
    Gashler, Michael S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (07) : 2973 - 2985