Multiscaled Multi-Head Attention-Based Video Transformer Network for Hand Gesture Recognition

被引:13
|
作者
Garg, Mallika [1 ]
Ghosh, Debashis [1 ]
Pradhan, Pyari Mohan [1 ]
机构
[1] Indian Inst Technol, Dept Elect & Commun Engn, Roorkee 247667, India
关键词
Dynamic gesture recognition; multi-modal recognition; multi-head attention; multiscale pyramid attention; video transformer;
D O I
10.1109/LSP.2023.3241857
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Dynamic gesture recognition is one of the challenging research areas due to variations in pose, size, and shape of the signer's hand. In this letter, Multiscaled Multi-Head Attention Video Transformer Network (MsMHA-VTN) for dynamic hand gesture recognition is proposed. A pyramidal hierarchy of multiscale features is extracted using the transformer multiscaled head attention model. The proposed model employs different attention dimensions for each head of the transformer which enables it to provide attention at the multiscale level. Further, in addition to single modality, recognition performance using multiple modalities is examined. Extensive experiments demonstrate the superior performance of the proposed MsMHA-VTN with an overall accuracy of 88.22% and 99.10% on NVGesture and Briareo datasets, respectively.
引用
收藏
页码:80 / 84
页数:5
相关论文
共 50 条
  • [1] Content-Adaptive and Attention-Based Network for Hand Gesture Recognition
    Cao, Zongjing
    Li, Yan
    Shin, Byeong-Seok
    APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [2] Multistage Spatial Attention-Based Neural Network for Hand Gesture Recognition
    Miah, Abu Saleh Musa
    Hasan, Md. Al Mehedi
    Shin, Jungpil
    Okuyama, Yuichi
    Tomioka, Yoichi
    COMPUTERS, 2023, 12 (01)
  • [3] Speech recognition based on the transformer's multi-head attention in Arabic
    Mahmoudi O.
    Filali-Bouami M.
    Benchat M.
    International Journal of Speech Technology, 2024, 27 (01) : 211 - 223
  • [4] Multi-Head Structural Attention-Based Vision Transformer with Sequential Views for 3D Object Recognition
    Bao, Jianjun
    Luo, Ke
    Kou, Qiqi
    He, Liang
    Zhao, Guo
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [5] Multi-head attention-based two-stream EfficientNet for action recognition
    Zhou, Aihua
    Ma, Yujun
    Ji, Wanting
    Zong, Ming
    Yang, Pei
    Wu, Min
    Liu, Mingzhe
    MULTIMEDIA SYSTEMS, 2023, 29 (02) : 487 - 498
  • [6] Multi-head attention-based two-stream EfficientNet for action recognition
    Aihua Zhou
    Yujun Ma
    Wanting Ji
    Ming Zong
    Pei Yang
    Min Wu
    Mingzhe Liu
    Multimedia Systems, 2023, 29 : 487 - 498
  • [7] Multi-Head Attention-Based Spectrum Sensing for Radio
    Devarakonda, B. V. Ravisankar
    Nandanavam, Venkateswararao
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (02) : 135 - 143
  • [8] Long-Range Hand Gesture Recognition via Attention-based SSD Network
    Zhou, Liguang
    Du, Chenping
    Sun, Zhenglong
    Lam, Tin Lun
    Xu, Yangsheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1832 - 1838
  • [9] MSKd_Net: Multi-Head Attention-Based Swin Transformer for Kidney Diseases Classification
    Sharen, H.
    Narendra, Modigari
    Anbarasi, L. Jani
    IEEE ACCESS, 2024, 12 : 181975 - 181986
  • [10] Multi-Head Attention-Based Hybrid Deep Neural Network for Aeroengine Risk Assessment
    Li, Jian-Hang
    Gao, Xin-Yue
    Lu, Xiang
    Liu, Guo-Dong
    IEEE ACCESS, 2023, 11 : 113376 - 113389