Missing data imputation and classification of small sample missing time series data based on gradient penalized adversarial multi-task learning

被引:0
作者
Jing-Jing Liu
Jie-Peng Yao
Jin-Hang Liu
Zhong-Yi Wang
Lan Huang
机构
[1] China Agricultural University,College of Information and Electrical Engineering
[2] Key Laboratory of Agricultural Information Acquisition Technology (Beijing),Institute of Automation
[3] Ministry of Agriculture,undefined
[4] Chinese Academy of Sciences,undefined
[5] Key Laboratory of Modern Precision Agricultural System Integration (Beijing),undefined
[6] Ministry of Education,undefined
来源
Applied Intelligence | 2024年 / 54卷
关键词
Missing time series data; Small samples; Imputation; Classification; Gradient penalized adversarial Multitasking;
D O I
暂无
中图分类号
学科分类号
摘要
In practice, time series data obtained is usually small and missing, which poses a great challenge to data analysis in different domains, such as increasing the bias of model predictions, reducing the accuracy of model classification, and affecting the analysis data. This paper aims to address the problem of missing data imputation and classification of small sample time series data. By exploring and implementing efficient data interpolation strategies to improve classification accuracy, the robustness and accuracy of classification models in the face of incomplete data. To achieve this, we propose a new model that can effectively classify time series data with missing values. Our model utilizes a bi-directional long short-term memory network combined with an extreme learning machine for the imputation task, which can recover the missing time series values. For the classification task, we employ a self-attentional Inception Time network, which is regularized by a classification loss to effectively mitigate network overfitting. To improve the performance of the model on small sample time series datasets, we use a gradient penalty adversarial training approach. Our model integrates the advantages of multiple network modules, the gradient penalty adversarial multi-task model achieves optimal imputation and robust classification of missing small sample time series data. To evaluate the overall performance of our model, we selected forty datasets from the UCR time series datasets, and selected the German emotional speech datasets and the EEG epilepsy datasets, with the plant electrical signal datasets obtained from real measurements. A series of experiments were conducted to evaluate the effectiveness of our method compared to other methods, the datasets were set up with multiple missing rates, with root mean square error and coefficient of determination to assess the accuracy of imputation, and with accuracy to assess the performance of the classification task. The results show that our proposed method outperforms existing methods in terms of imputation accuracy and classification performance. To better understand the deep learning model, we used the Grad-CAM +  + method to enhance the reliability and credibility of the model by visualizing the important features of the temporal data when the plant electrical signal datasets was tested. In summary, this paper presents a model framework for the imputation and classification of missing small sample time series data, and the experimental results show that our model provides an effective solution for dealing with the analysis of missing small sample time series data.
引用
收藏
页码:2528 / 2550
页数:22
相关论文
共 238 条
[1]  
Afrin T(2022)A Long Short-Term Memory-based correlated traffic data prediction framework Knowl-Based Syst 237 107755-404
[2]  
Yodo N(2022)Short-term traffic flow prediction based on a hybrid optimization algorithm Appl Math Model 102 385-285
[3]  
Yan H(2019)CNNpred: CNN-based stock market prediction using a diverse set of variables Expert Syst Appl 129 273-62
[4]  
Zhang TA(2022)S_I_LSTM: stock price prediction based on multiple data sources and sentiment analysis Connect Sci 34 44-243
[5]  
Qi Y(2022)A generalized Rényi divergence for multi-source information fusion with its application in EEG data analysis Inf Sci 605 225-748
[6]  
Yu D-J(2022)Conditional generative adversarial networks applied to EEG data can inform about the inter-relation of antagonistic behaviors on a neural level Commun Biol 5 148-10683
[7]  
Hoseinzade E(2020)Internet of things for smart farming and frost intelligent control in greenhouses Comput Electron Agric 176 105614-316
[8]  
Haratizadeh S(2022)Impact of duration and missing data on the long-term photovoltaic degradation rate estimation Renew Energy 181 738-178
[9]  
Wu S(2022)Well performance prediction based on Long Short-Term Memory (LSTM) neural network J Petrol Sci Eng 208 109686-10141
[10]  
Liu Y(2023)Data Augmentation techniques in time series domain: a survey and taxonomy Neural Comput Appl 218 108261-72