Fusion k-means clustering and multi-head self-attention mechanism for a multivariate time prediction model with feature selection

被引:0
作者
Cai, Mingwei [1 ]
Zhan, Jianming [1 ]
Zhang, Chao [2 ]
Liu, Qi [1 ]
机构
[1] Hubei Minzu Univ, Sch Math & Stat, Enshi 445000, Hubei, Peoples R China
[2] Shanxi Univ, Sch Comp & Informat Technol, Key Lab Computat Intelligence & Chinese Informat P, Taiyuan 030006, Shanxi, Peoples R China
关键词
<italic>k</italic>-means clustering; Multi-head self-attention mechanism; Feature Selection; LSTM; TRANSFORMER;
D O I
10.1007/s13042-024-02490-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the demand for precise predictions grows across various industries due to advancements in sensor technology and computer hardware, multi-feature time series prediction shows significant promise in fields such as information fusion, finance, energy, and meteorology. However, traditional machine learning methods often struggle to forecast future events given the increasing complexity of the data. To address this challenge, the paper introduces an innovative approach that combines an improved k-means clustering with a multi-head self-attention mechanism. This method utilizes long and short-term memory (LSTM) neural networks to filter and identify the most effective feature subset for prediction. In the enhanced k-means clustering algorithm, a novel similarity formula named Feature Vector Similarity (FVS) and a method for automatically determining the number of clustering centers are proposed. This advancement improves the rationality of cluster center selection and enhances overall clustering performance. The multi-head self-attention mechanism calculates the clustering centers and attention weights of objects within the cluster partitions, optimizing feature selection and enhancing computational efficiency. The fusion of k-means clustering, the multi-head self-attention mechanism, and LSTM networks results in a new feature selection method, referred to as KMAL. To further refine the prediction process, we integrate KMAL with LSTM, known for its strong performance in predicting long-term time series, to develop a novel prediction model: KMAL-LSTM. In the subsequent comparative experiments, the prediction performance of the models is assessed using mean absolute error (MAE), mean bias error (MBE), and root mean square error (RMSE). The proposed KMAL-LSTM model consistently exhibits superior validity, stability, and performance when compared to seven other prediction models across six distinct datasets.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] A hybrid Harris Hawks optimization algorithm with simulated annealing for feature selection
    Abdel-Basset, Mohamed
    Ding, Weiping
    El-Shahat, Doaa
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) : 593 - 637
  • [2] Improved EMD-Based Complex Prediction Model for Wind Power Forecasting
    Abedinia, Oveis
    Lotfi, Mohamed
    Bagheri, Mehdi
    Sobhani, Behrouz
    Shafie-khah, Miadreza
    Catalao, Joao P. S.
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2020, 11 (04) : 2790 - 2802
  • [3] MixMamba: Time series modeling with adaptive expertise
    Alkilane, Khaled
    He, Yihang
    Lee, Der-Horng
    [J]. INFORMATION FUSION, 2024, 112
  • [4] Improved fault detection based on kernel PCA for monitoring industrial applications
    Attouri, Khadija
    Mansouri, Majdi
    Hajji, Mansour
    Kouadri, Abdelmalek
    Bensmail, Abderrazak
    Bouzrara, Kais
    Nounou, Hazem
    [J]. JOURNAL OF PROCESS CONTROL, 2024, 133
  • [5] CNN based feature extraction and classification for sign language
    Barbhuiya, Abul Abbas
    Karsh, Ram Kumar
    Jain, Rahul
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 3051 - 3069
  • [6] Healthcare diagnostics with an adaptive deep learning model integrated with the Internet of medical Things (IoMT) for predicting heart disease
    Baseer, K. K.
    Sivakumar, K.
    Veeraiah, Duggineni
    Chhabra, Gunjan
    Lakineni, Prasanna Kumar
    Pasha, M. Jahir
    Gandikota, Ramu
    Harikrishnan, Gopakumar
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [7] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    [J]. INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [8] Fast density estimation for density-based clustering methods
    Cheng, Difei
    Xu, Ruihang
    Zhang, Bo
    Jin, Ruinan
    [J]. NEUROCOMPUTING, 2023, 532 : 170 - 182
  • [9] Revisiting linear regression to test agreement in continuous predicted-observed datasets
    Correndo, Adrian A.
    Hefley, Trevor J.
    Holzworth, Dean P.
    Ciampitti, Ignacio A.
    [J]. AGRICULTURAL SYSTEMS, 2021, 192
  • [10] Deep CNN based binary hash video representations for face retrieval
    Dong, Zhen
    Jing, Chenchen
    Pei, Mingtao
    Jia, Yunde
    [J]. PATTERN RECOGNITION, 2018, 81 : 357 - 369