Isolated Sign Language Recognition with Multi-scale Features using LSTM

被引:18
|
作者
Mercanoglu Sincan, Ozge [1 ]
Tur, Anil Osman [1 ]
Yalim Keles, Hacer [1 ]
机构
[1] Ankara Univ, Bilgisayar Muhendisligi, Ankara, Turkey
关键词
convolutional neural networks; long short-term memory; feature pooling module; sign language recognition;
D O I
10.1109/siu.2019.8806467
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sign language recognition systems are used to convert signs in video streams to text automatically. In this work, an original isolated sign language recognition model is created using Convolutional Neural Networks (CNNs), Feature Pooling Module and Long Short-Term Memory Networks (LSTMs). In the CNN part, a pre-trained VGG-16 model is used identically in two parallel architectures, after adapting its weights to the dataset; in this architecture, the features from color (RGB) and depth streams are extracted in parallel. The extracted features are directed to FPM to generate multi-scale features. The features matrices are reduced to representative feature vectors, using Global Average Pooling (GAP). The features that are obtained from RGB and depth streams are concatenated and passed to the LSTM architecture after instance normalization. We get 93.15% test accuracy on Montalbano Italian sign language dataset using the proposed model; this result is comparable with the recent state-of-the-art methods.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Isolated Sign Language Recognition with Multi-Scale Spatial-Temporal Graph Convolutional Networks
    Vazquez-Enriquez, Manuel
    Alba-Castro, Jose L.
    Docio-Fernandez, Laura
    Rodriguez-Banga, Eduardo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3457 - 3466
  • [2] Multi-scale context-aware network for continuous sign language recognition
    XUE, Senhua
    GAO, Liqing
    WAN, Liang
    FENG, Wei
    Virtual Reality and Intelligent Hardware, 2024, 6 (04): : 323 - 337
  • [3] Multi-scale context-aware network for continuous sign language recognition
    Senhua XUE
    Liqing GAO
    Liang WAN
    Wei FENG
    虚拟现实与智能硬件(中英文), 2024, 6 (04) : 323 - 337
  • [4] Compact Multi-scale Periocular Recognition Using SAFE Features
    Alonso-Fernandez, Fernando
    Mikaelyan, Anna
    Bigun, Josef
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1455 - 1460
  • [5] Continuous Sign Language Recognition With Multi-Scale Spatial-Temporal Feature Enhancement
    Wang, Zhen
    Li, Dongyuan
    Jiang, Renhe
    Okumura, Manabu
    IEEE Access, 13 : 5491 - 5506
  • [6] Multi-scale local-temporal similarity fusion for continuous sign language recognition
    Xie, Pan
    Cui, Zhi
    Du, Yao
    Zhao, Mengyi
    Cui, Jianwei
    Wang, Bin
    Hu, Xiaohui
    PATTERN RECOGNITION, 2023, 136
  • [7] Continuous Sign Language Recognition With Multi-Scale Spatial-Temporal Feature Enhancement
    Wang, Zhen
    Li, Dongyuan
    Jiang, Renhe
    Okumura, Manabu
    IEEE ACCESS, 2025, 13 : 5491 - 5506
  • [8] Traffic Sign Recognition with Multi-Scale Convolutional Networks
    Sermanet, Pierre
    LeCun, Yann
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2809 - 2813
  • [9] EAR RECOGNITION BASED ON MULTI-SCALE FEATURES
    Zeng, Hui
    Mu, Zhi-Chun
    Yuan, Li
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 2418 - 2422
  • [10] Difference-guided multi-scale spatial-temporal representation for sign language recognition
    Gao, Liqing
    Hu, Lianyu
    Lyu, Fan
    Zhu, Lei
    Wan, Liang
    Pun, Chi-Man
    Feng, Wei
    VISUAL COMPUTER, 2023, 39 (08): : 3417 - 3428