Ensemble CNN-ViT Using Feature-Level Fusion for Gait Recognition

被引:0
|
作者
Mogan, Jashila Nair [1 ]
Lee, Chin Poo [1 ]
Lim, Kian Ming [2 ]
机构
[1] Multimedia Univ, Fac Informat Sci & Technol, Melaka 75450, Malaysia
[2] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315100, Zhejiang, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Computational modeling; Hidden Markov models; Convolutional neural networks; Transformers; Deep learning; Biological system modeling; ensemble; fusion; feature-fusion; gait; gait recognition; IMAGE; MODEL;
D O I
10.1109/ACCESS.2024.3439602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Individual deep learning models showcase impressive performance; however, the capacity of a single model might fall short in capturing the full spectrum of intricate patterns present in the input data. Thus, relying solely on a single model may hamper the attainment of optimal results and broader generalization. In light of this, the paper presents an ensemble method that leverages the strengths of multiple Convolutional Neural Networks (CNNs) and Transformer models to elevate gait recognition performance. Additionally, a novel gait representation named windowed Gait Energy Image (GEI) is introduced, obtained by averaging gait frames irrespective of gait cycles. Firstly, the windowed GEI is input to the Convolutional Neural Networks and Transformer models to learn significant gait features. Each model is followed by a Multilayer Perceptron (MLP) to encode the relationship between the extracted features and corresponding class labels. Subsequently, the extracted gait features from each model are flattened and concatenated into a cohesive feature representation before passing through another MLP for subject classification. The performance of the proposed method was assessed on three datasets: OU-ISIR dataset D, CASIA-B, and OU-LP dataset. Experimental results demonstrated remarkable improvements compared to existing methods across all three datasets. The proposed method achieved accuracy rates of 100% on OU-ISIR D, 99.93% on CASIA-B, and 99.94% on OU-LP, showcasing the superior performance of the Ensemble CNN-ViT model using feature-level fusion compared to state-of-the-art methods.
引用
收藏
页码:108573 / 108583
页数:11
相关论文
共 50 条
  • [41] MFCF-Gait: Small Silhouette-Sensitive Gait Recognition Algorithm Based on Multi-Scale Feature Cross-Fusion
    Song, Chenyang
    Yun, Lijun
    Li, Ruoyu
    SENSORS, 2024, 24 (17)
  • [42] DHERF: A Deep Learning Ensemble Feature Extraction Framework for Emotion Recognition Using Enhanced-CNN
    Basha, Shaik Abdul Khalandar
    Vincent, P. M. Durai Raj
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (07) : 853 - 861
  • [43] Deep learning gait recognition based on two branch spatiotemporal gait feature fusion
    Zhang Y.-Z.
    Dong X.
    Zhang, Yun-Zuo (zhangyunzuo888@sina.com), 1600, Northeast University (39): : 1403 - 1408
  • [44] SAFLFusionGait: Gait recognition network with separate attention and different granularity feature learnability fusion
    Hu, Yuchen
    Chen, Zhenxue
    Liu, Chengyun
    Liang, Tian
    Lu, Dan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [45] Plant Leaf Identification Using Feature Fusion of Wavelet Scattering Network and CNN With PCA Classifier
    Gowthaman, S.
    Das, Abhishek
    IEEE ACCESS, 2025, 13 : 11594 - 11608
  • [46] A lightweight neural network with feature-level fusion and attention mechanisms for brain tumor classification
    Omair Bilal
    Sohaib Asif
    Multiscale and Multidisciplinary Modeling, Experiments and Design, 2025, 8 (6)
  • [47] Ensemble Learning Using Pressure Sensor for Gait Recognition
    Jung, Jinwon
    Choi, Young Chan
    Choi, Sang-Il
    2021 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2021,
  • [48] Enhancing medical image analysis: A fusion of fully connected neural network classifier with CNN-VIT for improved retinal disease detection
    Mannanuddin, Khaja
    Vimal, V. R.
    Srinivas, Angalkuditi
    Mageswari, S. D. Uma
    Mahendran, G.
    Ramya, J.
    Kumar, Ashok
    Das, Pranjal
    Vidhya, R. G.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 12313 - 12328
  • [49] VTCNet: A Feature Fusion DL Model Based on CNN and ViT for the Classification of Cervical Cells
    Li, Mingzhe
    Que, Ningfeng
    Zhang, Juanhua
    Du, Pingfang
    Dai, Yin
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (05)
  • [50] Oral cancer detection using feature-level fusion and novel self-attention mechanisms
    Khan, Saif Ur Rehman
    Asif, Sohaib
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95