Continuous Chinese Sign Language Recognition with CNN-LSTM

被引:18
作者
Yang, Su [1 ]
Zhu, Qing [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, 100 Ping Leyuan, Beijing 100124, Peoples R China
来源
NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017) | 2017年 / 10420卷
关键词
Sign language recognition; convolutional neural network; recurrent neural network; Long Short-Term Memory;
D O I
10.1117/12.2281671
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
The goal of sign language recognition (SLR) is to translate the sign language into text, and provide a convenient tool for the communication between the deaf-mute and the ordinary. In this paper, we formulate an appropriate model based on convolutional neural network (CNN) combined with Long Short-Term Memory (LSTM) network, in order to accomplish the continuous recognition work. With the strong ability of CNN, the information of pictures captured from Chinese sign language (CSL) videos can be learned and transformed into vector. Since the video can be regarded as an ordered sequence of frames, LSTM model is employed to connect with the fully-connected layer of CNN. As a recurrent neural network (RNN), it is suitable for sequence learning tasks with the capability of recognizing patterns defined by temporal distance. Compared with traditional RNN, LSTM has performed better on storing and accessing information. We evaluate this method on our self-built dataset including 40 daily vocabularies. The experimental results show that the recognition method with CNN-LSTM can achieve a high recognition rate with small training sets, which will meet the needs of real-time SLR system.
引用
收藏
页数:7
相关论文
共 50 条
[41]   Massive MIMO CSI reconstruction using CNN-LSTM and attention mechanism [J].
Zhang, Zufan ;
Zheng, Yue ;
Gan, Chenquan ;
Zhu, Qingyi .
IET COMMUNICATIONS, 2020, 14 (18) :3089-3094
[42]   A Study on the STGCN-LSTM Sign Language Recognition Model Based on Phonological Features of Sign Language [J].
Han, Yuxin ;
Han, Yong ;
Jiang, Qi .
IEEE ACCESS, 2025, 13 :74807-74816
[43]   A Fault Diagnosis Algorithm for the Dedicated Equipment Based on the CNN-LSTM Mechanism [J].
Guo, Zhannan ;
Hao, Yinlin ;
Shi, Hanwen ;
Wu, Zhenyu ;
Wu, Yuhu ;
Sun, Ximing .
ENERGIES, 2023, 16 (13)
[44]   Prediction of sign language recognition based on multi layered CNN [J].
Prasath, G. Arun ;
Annapurani, K. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) :29649-29669
[45]   Prediction of sign language recognition based on multi layered CNN [J].
G Arun Prasath ;
K Annapurani .
Multimedia Tools and Applications, 2023, 82 :29649-29669
[46]   Predicting catastrophic temperature changes based on past events via a CNN-LSTM regression mechanism [J].
Yusuf, Syed Adnan ;
Alshdadi, Abdulrahman A. ;
Alassafi, Madini O. ;
AlGhamdi, Rayed ;
Samad, Abdul .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (15) :9775-9790
[47]   Multimodal fusion diagnosis of depression and anxiety based on CNN-LSTM model [J].
Xie, Wanqing ;
Wang, Chen ;
Lin, Zhixiong ;
Luo, Xudong ;
Chen, Wenqian ;
Xu, Manzhu ;
Liang, Lizhong ;
Liu, Xiaofeng ;
Wang, Yanzhong ;
Luo, Hui ;
Cheng, Mingmei .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2022, 102
[48]   CNN-LSTM Fusion: An Intelligent Framework for Classifying Heart Failure Severity [J].
Botros, Jad ;
Mourad-Chehade, Farah ;
Laplanche, David .
ADVANCES IN DIGITAL HEALTH AND MEDICAL BIOENGINEERING, VOL 1, EHB-2023, 2024, 109 :555-562
[49]   Predicting catastrophic temperature changes based on past events via a CNN-LSTM regression mechanism [J].
Syed Adnan Yusuf ;
Abdulrahman A. Alshdadi ;
Madini O. Alassafi ;
Rayed AlGhamdi ;
Abdul Samad .
Neural Computing and Applications, 2021, 33 :9775-9790
[50]   An efficient CNN-LSTM model for sentiment detection in #BlackLivesMatter [J].
Ankita ;
Rani, Shalli ;
Bashir, Ali Kashif ;
Alhudhaif, Adi ;
Koundal, Deepika ;
Gunduz, Emine Selda .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193