Fusion k-means clustering and multi-head self-attention mechanism for a multivariate time prediction model with feature selection

被引：0

作者：

Cai, Mingwei ^{[1
]}

Zhan, Jianming ^{[1
]}

Zhang, Chao ^{[2
]}

Liu, Qi ^{[1
]}

机构：

[1] Hubei Minzu Univ, Sch Math & Stat, Enshi 445000, Hubei, Peoples R China

[2] Shanxi Univ, Sch Comp & Informat Technol, Key Lab Computat Intelligence & Chinese Informat P, Taiyuan 030006, Shanxi, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2024年

关键词：

<italic>k</italic>-means clustering; Multi-head self-attention mechanism; Feature Selection; LSTM; TRANSFORMER;

D O I：

10.1007/s13042-024-02490-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the demand for precise predictions grows across various industries due to advancements in sensor technology and computer hardware, multi-feature time series prediction shows significant promise in fields such as information fusion, finance, energy, and meteorology. However, traditional machine learning methods often struggle to forecast future events given the increasing complexity of the data. To address this challenge, the paper introduces an innovative approach that combines an improved k-means clustering with a multi-head self-attention mechanism. This method utilizes long and short-term memory (LSTM) neural networks to filter and identify the most effective feature subset for prediction. In the enhanced k-means clustering algorithm, a novel similarity formula named Feature Vector Similarity (FVS) and a method for automatically determining the number of clustering centers are proposed. This advancement improves the rationality of cluster center selection and enhances overall clustering performance. The multi-head self-attention mechanism calculates the clustering centers and attention weights of objects within the cluster partitions, optimizing feature selection and enhancing computational efficiency. The fusion of k-means clustering, the multi-head self-attention mechanism, and LSTM networks results in a new feature selection method, referred to as KMAL. To further refine the prediction process, we integrate KMAL with LSTM, known for its strong performance in predicting long-term time series, to develop a novel prediction model: KMAL-LSTM. In the subsequent comparative experiments, the prediction performance of the models is assessed using mean absolute error (MAE), mean bias error (MBE), and root mean square error (RMSE). The proposed KMAL-LSTM model consistently exhibits superior validity, stability, and performance when compared to seven other prediction models across six distinct datasets.

引用

页数：19

共 50 条

[1] A hybrid Harris Hawks optimization algorithm with simulated annealing for feature selection
Abdel-Basset, Mohamed
Ding, Weiping
El-Shahat, Doaa
[J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) : 593 - 637
[2] Improved EMD-Based Complex Prediction Model for Wind Power Forecasting
Abedinia, Oveis
Lotfi, Mohamed
Bagheri, Mehdi
Sobhani, Behrouz
Shafie-khah, Miadreza
Catalao, Joao P. S.
[J]. IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2020, 11 (04) : 2790 - 2802
[3] MixMamba: Time series modeling with adaptive expertise
Alkilane, Khaled
He, Yihang
Lee, Der-Horng
[J]. INFORMATION FUSION, 2024, 112
[4] Improved fault detection based on kernel PCA for monitoring industrial applications
Attouri, Khadija
Mansouri, Majdi
Hajji, Mansour
Kouadri, Abdelmalek
Bensmail, Abderrazak
Bouzrara, Kais
Nounou, Hazem
[J]. JOURNAL OF PROCESS CONTROL, 2024, 133
[5] CNN based feature extraction and classification for sign language
Barbhuiya, Abul Abbas
Karsh, Ram Kumar
Jain, Rahul
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 3051 - 3069
[6] Healthcare diagnostics with an adaptive deep learning model integrated with the Internet of medical Things (IoMT) for predicting heart disease
Baseer, K. K.
Sivakumar, K.
Veeraiah, Duggineni
Chhabra, Gunjan
Lakineni, Prasanna Kumar
Pasha, M. Jahir
Gandikota, Ramu
Harikrishnan, Gopakumar
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
[7] A review on semi-supervised clustering
Cai, Jianghui
Hao, Jing
Yang, Haifeng
Zhao, Xujun
Yang, Yuqing
[J]. INFORMATION SCIENCES, 2023, 632 : 164 - 200
[8] Fast density estimation for density-based clustering methods
Cheng, Difei
Xu, Ruihang
Zhang, Bo
Jin, Ruinan
[J]. NEUROCOMPUTING, 2023, 532 : 170 - 182
[9] Revisiting linear regression to test agreement in continuous predicted-observed datasets
Correndo, Adrian A.
Hefley, Trevor J.
Holzworth, Dean P.
Ciampitti, Ignacio A.
[J]. AGRICULTURAL SYSTEMS, 2021, 192
[10] Deep CNN based binary hash video representations for face retrieval
Dong, Zhen
Jing, Chenchen
Pei, Mingtao
Jia, Yunde
[J]. PATTERN RECOGNITION, 2018, 81 : 357 - 369

← 1 2 3 4 5 →