Ensemble CNN-ViT Using Feature-Level Fusion for Gait Recognition

被引：0

作者：

Mogan, Jashila Nair ^{[1
]}

Lee, Chin Poo ^{[1
]}

Lim, Kian Ming ^{[2
]}

机构：

[1] Multimedia Univ, Fac Informat Sci & Technol, Melaka 75450, Malaysia

[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Zhejiang, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Feature extraction; Computational modeling; Hidden Markov models; Convolutional neural networks; Transformers; Deep learning; Biological system modeling; ensemble; fusion; feature-fusion; gait; gait recognition; IMAGE; MODEL;

D O I：

10.1109/ACCESS.2024.3439602

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Individual deep learning models showcase impressive performance; however, the capacity of a single model might fall short in capturing the full spectrum of intricate patterns present in the input data. Thus, relying solely on a single model may hamper the attainment of optimal results and broader generalization. In light of this, the paper presents an ensemble method that leverages the strengths of multiple Convolutional Neural Networks (CNNs) and Transformer models to elevate gait recognition performance. Additionally, a novel gait representation named windowed Gait Energy Image (GEI) is introduced, obtained by averaging gait frames irrespective of gait cycles. Firstly, the windowed GEI is input to the Convolutional Neural Networks and Transformer models to learn significant gait features. Each model is followed by a Multilayer Perceptron (MLP) to encode the relationship between the extracted features and corresponding class labels. Subsequently, the extracted gait features from each model are flattened and concatenated into a cohesive feature representation before passing through another MLP for subject classification. The performance of the proposed method was assessed on three datasets: OU-ISIR dataset D, CASIA-B, and OU-LP dataset. Experimental results demonstrated remarkable improvements compared to existing methods across all three datasets. The proposed method achieved accuracy rates of 100% on OU-ISIR D, 99.93% on CASIA-B, and 99.94% on OU-LP, showcasing the superior performance of the Ensemble CNN-ViT model using feature-level fusion compared to state-of-the-art methods.

引用

页码：108573 / 108583

页数：11

共 50 条

[1] Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
Mogan, Jashila Nair
Lee, Chin Poo
Lim, Kian Ming
Ali, Mohammed
Alqahtani, Ali
SENSORS, 2023, 23 (08)
[2] A novel feature-level fusion scheme with multimodal attention CNN for heart sound classification
Ranipa, Kalpeshkumar
Zhu, Wei -Ping
Swamy, M. N. S.
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 248
[3] Feature-Level Fusion Recognition of Space Targets With Composite Micromotion
Zhang, Yuanpeng
Xie, Yan
Kang, Le
Li, Kaiming
Luo, Ying
Zhang, Qun
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (01) : 934 - 951
[4] Feature-level Fusion of Deep Convolutional Neural Networks for Sketch Recognition on Smartphones
Boyaci, Emel
Sert, Mustafa
2017 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2017,
[5] Palmprint identification using feature-level fusion
Kong, A
Zhang, D
Kamel, M
PATTERN RECOGNITION, 2006, 39 (03) : 478 - 487
[6] Feature-level Fusion for Depression Recognition Based on fNIRS Data
Zheng, Shuzhen
Lei, Chang
Wang, Tao
Wu, Chunyun
Sun, Jieqiong
Peng, Hong
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2898 - 2905
[7] Feature-level Fusion for Depression Recognition Based on fNIRS Data
Zheng, Shuzhen
Lei, Chang
Wang, Tao
Wu, Chunyun
Sun, Jieqiong
Peng, Hong
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2906 - 2913
[8] An Investigation of a Feature-Level Fusion for Noisy Speech Emotion Recognition
Sekkate, Sara
Khalil, Mohammed
Adib, Abdellah
Ben Jebara, Sofia
COMPUTERS, 2019, 8 (04)
[9] Alternative Deep Learning Architectures for Feature-Level Fusion in Human Activity Recognition
Maitre, Julien
Bouchard, Kevin
Gaboury, Sebastien
MOBILE NETWORKS & APPLICATIONS, 2021, 26 (05) : 2076 - 2086
[10] Alternative Deep Learning Architectures for Feature-Level Fusion in Human Activity Recognition
Julien Maitre
Kevin Bouchard
Sébastien Gaboury
Mobile Networks and Applications, 2021, 26 : 2076 - 2086

← 1 2 3 4 5 →