Multi-Attn BLS: Multi-head attention mechanism with broad learning system for chaotic time series prediction

被引:34
作者
Su, Liyun [1 ]
Xiong, Lang [1 ]
Yang, Jialing [1 ]
机构
[1] Chongqing Univ Technol, Sch Sci, Chongqing 40054, Peoples R China
基金
中国国家自然科学基金;
关键词
Broad learning system (BLS); Multi-head attention mechanism; Chaotic time series; Prediction;
D O I
10.1016/j.asoc.2022.109831
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The observational 1-D signals available for realizing the highly accurate intrinsic attractor fitting of deep learning network approaches are often insufficient because of the complexity and nonlinearity of chaotic time series. Unlike deep models, a broad learning system (BLS) with the attention mechanism exhibits a unique and preeminent pattern prediction ability. Thus, this system has been applied as a practical trend in many fields. However, the application of multi-head attention fused manifold broad learning architecture to chaotic time series prediction remains inadequate. Thus, a multihead attentional BLS (Multi-Attn BLS) for chaotic time series prediction is proposed in this study to improve the prediction accuracy of chaotic time series further. Our model develops a novel framework that combines the high computational efficiency of broad learning with the multi-head attention mechanism. First, the received data are reconstructed into fixed-size tuples. The multidimensional arrays with embedding dimensions and time delay are used as the input to a broad learning network. Subsequently, a robust BLS with a spatiotemporal multi-head attention mechanism is developed to depict the internal dynamic evolution. The Multi-Attn BLS model can capture key spatiotemporal feature information and achieve high predictive performance. It also has a good generalization ability in practical nonlinear complex systems. Comparative experiments with the traditional long short-term memory (LSTM) network and the primitive BLS show that its computing speed and generalization ability are improved. Furthermore, the network is good at capturing the spatiotemporal features of the sequence because of the multi-head attention mechanism. The experimental results show that our model outperforms BLS, ridge regression, and LSTM on the four main evaluation indicators (root mean square error, root mean square percentage error, mean absolute error, and mean absolute percentage error) in predicting classical systems (Lorenz and Rossler systems). Moreover, the model has an excellent prediction effect in the real-world chaotic system of sea clutter. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 23 条
[1]   Mutual Information-Based Inputs Selection for Electric Load Time Series Forecasting [J].
Bozic, Milos ;
Stojanovic, Milos ;
Stajic, Zoran ;
Floranovic, Nenad .
ENTROPY, 2013, 15 (03) :926-942
[2]   Universal Approximation Capability of Broad Learning System and Its Structural Variations [J].
Chen, C. L. Philip ;
Liu, Zhulin ;
Feng, Shuang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (04) :1191-1204
[3]  
Chen CLP, 2017, 2017 32ND YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), P1271, DOI 10.1109/YAC.2017.7967609
[4]   Control of goal-directed and stimulus-driven attention in the brain [J].
Corbetta, M ;
Shulman, GL .
NATURE REVIEWS NEUROSCIENCE, 2002, 3 (03) :201-215
[5]   Time series AR modeling with missing observations based on the polynomial transformation [J].
Ding, Jie ;
Han, Lili ;
Chen, Xiaoming .
MATHEMATICAL AND COMPUTER MODELLING, 2010, 51 (5-6) :527-536
[6]   Chaotic Dynamics Analysis Based on Financial Time Series [J].
Gu, Zheng ;
Xu, Yuhua .
COMPLEXITY, 2021, 2021
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Prediction of chaotic time series using hybrid neural network and attention mechanism [J].
Huang Wei-Jian ;
Li Yong-Tao ;
Huang Yuan .
ACTA PHYSICA SINICA, 2021, 70 (01)
[9]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[10]   A novel hybridization of artificial neural networks and ARIMA models for time series forecasting [J].
Khashei, Mehdi ;
Bijari, Mehdi .
APPLIED SOFT COMPUTING, 2011, 11 (02) :2664-2675