Research on data augmentation algorithm for time series based on deep learning

被引:2
作者
Liu, Shiyu [1 ,2 ]
Qiao, Hongyan [3 ]
Yuan, Lianhong [4 ]
Yuan, Yuan [5 ]
Liu, Jun [6 ]
机构
[1] Wuxi Univ, Sch Digital Econ & Management, Wuxi 214105, Peoples R China
[2] Wuxi Univ, Inst China Wuxi Cross Border Elect Commerce Compre, Wuxi 214105, Peoples R China
[3] Wuxi Yunyin Technol Grp Co LTD, Wuxi 214105, Peoples R China
[4] Hangzhou Polytech, Wuxi 214105, Peoples R China
[5] Zhejiang Guozi Robot Technol Co LTD, Wuxi 214105, Peoples R China
[6] Zhejiang Tuofeng Intelligent Equipment Co LTD, Wuxi 214105, Peoples R China
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2023年 / 17卷 / 06期
关键词
Data Augmentation; Time Series; GAN; Deep Learning; Neural Network; NETWORKS;
D O I
10.3837/tiis.2023.06.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data monitoring is an important foundation of modern science. In most cases, the monitoring data is time-series data, which has high application value. The deep learning algorithm has a strong nonlinear fitting capability, which enables the recognition of time series by capturing anomalous information in time series. At present, the research of time series recognition based on deep learning is especially important for data monitoring. Deep learning algorithms require a large amount of data for training. However, abnormal sample is a small sample in time series, which means the number of abnormal time series can seriously affect the accuracy of recognition algorithm because of class imbalance. In order to increase the number of abnormal sample, a data augmentation method called GANBATS (GAN-based Bi-LSTM and Attention for Time Series) is proposed. In GANBATS, Bi-LSTM is introduced to extract the timing features and then transfer features to the generator network of GANBATS.GANBATS also modifies the discriminator network by adding an attention mechanism to achieve global attention for time series. At the end of discriminator, GANBATS is adding averagepooling layer, which merges temporal features to boost the operational efficiency. In this paper, four time series datasets and five data augmentation algorithms are used for comparison experiments. The generated data are measured by PRD(Percent Root Mean Square Difference) and DTW(Dynamic Time Warping). The experimental results show that GANBATS reduces up to 26.22 in PRD metric and 9.45 in DTW metric. In addition, this paper uses different algorithms to reconstruct the datasets and compare them by classification accuracy. The classification accuracy is improved by 6.44%-12.96% on four time series datasets.
引用
收藏
页码:1530 / 1544
页数:15
相关论文
共 16 条
  • [1] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [2] Generative Adversarial Networks An overview
    Creswell, Antonia
    White, Tom
    Dumoulin, Vincent
    Arulkumaran, Kai
    Sengupta, Biswa
    Bharath, Anil A.
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) : 53 - 65
  • [3] Synthetic Data generation using DCGAN for improved traffic sign recognition
    Dewi, Christine
    Chen, Rung-Ching
    Liu, Yan-Ting
    Tai, Shao-Kuo
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24) : 21465 - 21480
  • [4] Improving Action Recognition via Temporal and Complementary Learning
    Elmadany, Nour Eldin
    He, Yifeng
    Guan, Ling
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (03)
  • [5] Deep learning for time series classification: a review
    Fawaz, Hassan Ismail
    Forestier, Germain
    Weber, Jonathan
    Idoumghar, Lhassane
    Muller, Pierre-Alain
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (04) : 917 - 963
  • [6] Improving SSH detection model using IPA time and WGAN-GP
    Lee, Junwon
    Lee, Heejo
    [J]. COMPUTERS & SECURITY, 2022, 116
  • [7] Spatio-Temporal Attention Networks for Action Recognition and Detection
    Li, Jun
    Liu, Xianglong
    Zhang, Wenxuan
    Zhang, Mingyuan
    Song, Jingkuan
    Sebe, Nicu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2990 - 3001
  • [8] A Review of Sensor Technologies for Perception in Automated Driving
    Marti, Enrique
    Perez, Joshue
    Angel de Miguel, Miguel
    Garcia, Fernando
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2019, 11 (04) : 94 - 108
  • [9] A multi-sensor data fusion enabled ensemble approach for medical data from body sensor networks
    Muzammal, Muhammad
    Talat, Romana
    Sodhro, Ali Hassan
    Pirbhulal, Sandeep
    [J]. INFORMATION FUSION, 2020, 53 : 155 - 164
  • [10] A Survey on Urban Traffic Management System Using Wireless Sensor Networks
    Nellore, Kapileswar
    Hancke, Gerhard P.
    [J]. SENSORS, 2016, 16 (02)