共 16 条
Research on data augmentation algorithm for time series based on deep learning
被引:2
作者:
Liu, Shiyu
[1
,2
]
Qiao, Hongyan
[3
]
Yuan, Lianhong
[4
]
Yuan, Yuan
[5
]
Liu, Jun
[6
]
机构:
[1] Wuxi Univ, Sch Digital Econ & Management, Wuxi 214105, Peoples R China
[2] Wuxi Univ, Inst China Wuxi Cross Border Elect Commerce Compre, Wuxi 214105, Peoples R China
[3] Wuxi Yunyin Technol Grp Co LTD, Wuxi 214105, Peoples R China
[4] Hangzhou Polytech, Wuxi 214105, Peoples R China
[5] Zhejiang Guozi Robot Technol Co LTD, Wuxi 214105, Peoples R China
[6] Zhejiang Tuofeng Intelligent Equipment Co LTD, Wuxi 214105, Peoples R China
来源:
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS
|
2023年
/
17卷
/
06期
关键词:
Data Augmentation;
Time Series;
GAN;
Deep Learning;
Neural Network;
NETWORKS;
D O I:
10.3837/tiis.2023.06.002
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Data monitoring is an important foundation of modern science. In most cases, the monitoring data is time-series data, which has high application value. The deep learning algorithm has a strong nonlinear fitting capability, which enables the recognition of time series by capturing anomalous information in time series. At present, the research of time series recognition based on deep learning is especially important for data monitoring. Deep learning algorithms require a large amount of data for training. However, abnormal sample is a small sample in time series, which means the number of abnormal time series can seriously affect the accuracy of recognition algorithm because of class imbalance. In order to increase the number of abnormal sample, a data augmentation method called GANBATS (GAN-based Bi-LSTM and Attention for Time Series) is proposed. In GANBATS, Bi-LSTM is introduced to extract the timing features and then transfer features to the generator network of GANBATS.GANBATS also modifies the discriminator network by adding an attention mechanism to achieve global attention for time series. At the end of discriminator, GANBATS is adding averagepooling layer, which merges temporal features to boost the operational efficiency. In this paper, four time series datasets and five data augmentation algorithms are used for comparison experiments. The generated data are measured by PRD(Percent Root Mean Square Difference) and DTW(Dynamic Time Warping). The experimental results show that GANBATS reduces up to 26.22 in PRD metric and 9.45 in DTW metric. In addition, this paper uses different algorithms to reconstruct the datasets and compare them by classification accuracy. The classification accuracy is improved by 6.44%-12.96% on four time series datasets.
引用
收藏
页码:1530 / 1544
页数:15
相关论文