Ensemble CNN-ViT Using Feature-Level Fusion for Gait Recognition

被引：0

作者：

Mogan, Jashila Nair ^{[1
]}

Lee, Chin Poo ^{[1
]}

Lim, Kian Ming ^{[2
]}

机构：

[1] Multimedia Univ, Fac Informat Sci & Technol, Melaka 75450, Malaysia

[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Zhejiang, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Feature extraction; Computational modeling; Hidden Markov models; Convolutional neural networks; Transformers; Deep learning; Biological system modeling; ensemble; fusion; feature-fusion; gait; gait recognition; IMAGE; MODEL;

D O I：

10.1109/ACCESS.2024.3439602

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Individual deep learning models showcase impressive performance; however, the capacity of a single model might fall short in capturing the full spectrum of intricate patterns present in the input data. Thus, relying solely on a single model may hamper the attainment of optimal results and broader generalization. In light of this, the paper presents an ensemble method that leverages the strengths of multiple Convolutional Neural Networks (CNNs) and Transformer models to elevate gait recognition performance. Additionally, a novel gait representation named windowed Gait Energy Image (GEI) is introduced, obtained by averaging gait frames irrespective of gait cycles. Firstly, the windowed GEI is input to the Convolutional Neural Networks and Transformer models to learn significant gait features. Each model is followed by a Multilayer Perceptron (MLP) to encode the relationship between the extracted features and corresponding class labels. Subsequently, the extracted gait features from each model are flattened and concatenated into a cohesive feature representation before passing through another MLP for subject classification. The performance of the proposed method was assessed on three datasets: OU-ISIR dataset D, CASIA-B, and OU-LP dataset. Experimental results demonstrated remarkable improvements compared to existing methods across all three datasets. The proposed method achieved accuracy rates of 100% on OU-ISIR D, 99.93% on CASIA-B, and 99.94% on OU-LP, showcasing the superior performance of the Ensemble CNN-ViT model using feature-level fusion compared to state-of-the-art methods.

引用

页码：108573 / 108583

页数：11

共 50 条

[41] MFCF-Gait: Small Silhouette-Sensitive Gait Recognition Algorithm Based on Multi-Scale Feature Cross-Fusion
Song, Chenyang
Yun, Lijun
Li, Ruoyu
SENSORS, 2024, 24 (17)
[42] DHERF: A Deep Learning Ensemble Feature Extraction Framework for Emotion Recognition Using Enhanced-CNN
Basha, Shaik Abdul Khalandar
Vincent, P. M. Durai Raj
JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (07) : 853 - 861
[43] Deep learning gait recognition based on two branch spatiotemporal gait feature fusion
Zhang Y.-Z.
Dong X.
Zhang, Yun-Zuo (zhangyunzuo888@sina.com), 1600, Northeast University (39): : 1403 - 1408
[44] SAFLFusionGait: Gait recognition network with separate attention and different granularity feature learnability fusion
Hu, Yuchen
Chen, Zhenxue
Liu, Chengyun
Liang, Tian
Lu, Dan
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
[45] Plant Leaf Identification Using Feature Fusion of Wavelet Scattering Network and CNN With PCA Classifier
Gowthaman, S.
Das, Abhishek
IEEE ACCESS, 2025, 13 : 11594 - 11608
[46] A lightweight neural network with feature-level fusion and attention mechanisms for brain tumor classification
Omair Bilal
Sohaib Asif
Multiscale and Multidisciplinary Modeling, Experiments and Design, 2025, 8 (6)
[47] Ensemble Learning Using Pressure Sensor for Gait Recognition
Jung, Jinwon
Choi, Young Chan
Choi, Sang-Il
2021 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2021,
[48] Enhancing medical image analysis: A fusion of fully connected neural network classifier with CNN-VIT for improved retinal disease detection
Mannanuddin, Khaja
Vimal, V. R.
Srinivas, Angalkuditi
Mageswari, S. D. Uma
Mahendran, G.
Ramya, J.
Kumar, Ashok
Das, Pranjal
Vidhya, R. G.
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 12313 - 12328
[49] VTCNet: A Feature Fusion DL Model Based on CNN and ViT for the Classification of Cervical Cells
Li, Mingzhe
Que, Ningfeng
Zhang, Juanhua
Du, Pingfang
Dai, Yin
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (05)
[50] Oral cancer detection using feature-level fusion and novel self-attention mechanisms
Khan, Saif Ur Rehman
Asif, Sohaib
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95

← 1 2 3 4 5 →