Isolated Sign Language Recognition with Multi-scale Features using LSTM

被引：18

作者：

Mercanoglu Sincan, Ozge ^{[1
]}

Tur, Anil Osman ^{[1
]}

Yalim Keles, Hacer ^{[1
]}

机构：

[1] Ankara Univ, Bilgisayar Muhendisligi, Ankara, Turkey

来源：

2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) | 2019年

关键词：

convolutional neural networks; long short-term memory; feature pooling module; sign language recognition;

D O I：

10.1109/siu.2019.8806467

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Sign language recognition systems are used to convert signs in video streams to text automatically. In this work, an original isolated sign language recognition model is created using Convolutional Neural Networks (CNNs), Feature Pooling Module and Long Short-Term Memory Networks (LSTMs). In the CNN part, a pre-trained VGG-16 model is used identically in two parallel architectures, after adapting its weights to the dataset; in this architecture, the features from color (RGB) and depth streams are extracted in parallel. The extracted features are directed to FPM to generate multi-scale features. The features matrices are reduced to representative feature vectors, using Global Average Pooling (GAP). The features that are obtained from RGB and depth streams are concatenated and passed to the LSTM architecture after instance normalization. We get 93.15% test accuracy on Montalbano Italian sign language dataset using the proposed model; this result is comparable with the recent state-of-the-art methods.

引用

页数：4

共 50 条

[1] Isolated Sign Language Recognition with Multi-Scale Spatial-Temporal Graph Convolutional Networks
Vazquez-Enriquez, Manuel
Alba-Castro, Jose L.
Docio-Fernandez, Laura
Rodriguez-Banga, Eduardo
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3457 - 3466
[2] Multi-scale context-aware network for continuous sign language recognition
XUE, Senhua
GAO, Liqing
WAN, Liang
FENG, Wei
Virtual Reality and Intelligent Hardware, 2024, 6 (04): : 323 - 337
[3] Multi-scale context-aware network for continuous sign language recognition
Senhua XUE
Liqing GAO
Liang WAN
Wei FENG
虚拟现实与智能硬件(中英文), 2024, 6 (04) : 323 - 337
[4] Compact Multi-scale Periocular Recognition Using SAFE Features
Alonso-Fernandez, Fernando
Mikaelyan, Anna
Bigun, Josef
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1455 - 1460
[5] Continuous Sign Language Recognition With Multi-Scale Spatial-Temporal Feature Enhancement
Wang, Zhen
Li, Dongyuan
Jiang, Renhe
Okumura, Manabu
IEEE Access, 13 : 5491 - 5506
[6] Multi-scale local-temporal similarity fusion for continuous sign language recognition
Xie, Pan
Cui, Zhi
Du, Yao
Zhao, Mengyi
Cui, Jianwei
Wang, Bin
Hu, Xiaohui
PATTERN RECOGNITION, 2023, 136
[7] Continuous Sign Language Recognition With Multi-Scale Spatial-Temporal Feature Enhancement
Wang, Zhen
Li, Dongyuan
Jiang, Renhe
Okumura, Manabu
IEEE ACCESS, 2025, 13 : 5491 - 5506
[8] Traffic Sign Recognition with Multi-Scale Convolutional Networks
Sermanet, Pierre
LeCun, Yann
2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2809 - 2813
[9] EAR RECOGNITION BASED ON MULTI-SCALE FEATURES
Zeng, Hui
Mu, Zhi-Chun
Yuan, Li
PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 2418 - 2422
[10] Difference-guided multi-scale spatial-temporal representation for sign language recognition
Gao, Liqing
Hu, Lianyu
Lyu, Fan
Zhu, Lei
Wan, Liang
Pun, Chi-Man
Feng, Wei
VISUAL COMPUTER, 2023, 39 (08): : 3417 - 3428

← 1 2 3 4 5 →