Chinese Sign Language Recognition with Sequence to Sequence Learning

被引:11
作者
Mao, Chensi [1 ]
Huang, Shiliang [1 ]
Li, Xiaoxu [1 ]
Ye, Zhongfu [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Speech & Language Informat Proc, Dept Elect Engn & Informat Sci, Hefei 230027, Anhui, Peoples R China
来源
COMPUTER VISION, PT I | 2017年 / 771卷
关键词
Sign language recognition; Long short-term memory; Convolutional neural network; Trajectory;
D O I
10.1007/978-981-10-7299-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we formulate Chinese sign language recognition (SLR) as a sequence to sequence problem and propose an encoder-decoder based framework to handle it. The proposed framework is based on the convolutional neural network (CNN) and recurrent neural network (RNN) with long short-term memory (LSTM). Specifically, CNN is adopted to extract the spatial features of input frames. Two LSTM layers are cascaded to implement the structure of encoder-decoder. The encoder-decoder can not only learn the temporal information of the input features but also can learn the context model of sign language words. We feed the images sequences captured by Microsoft Kinect2.0 into the network to build an end-to-end model. Moreover, we also set up another model by using skeletal coordinates as the input of the encoder-decoder framework. In the recognition stage, a probability combination method is proposed to fuse these two models to get the final prediction. We validate our method on the self-build dataset and the experimental results demonstrate the effectiveness of the proposed method.
引用
收藏
页码:180 / 191
页数:12
相关论文
共 50 条
  • [1] Multimodal Sign Language Recognition via Temporal Deformable Convolutional Sequence Learning
    Papadimitriou, Katerina
    Potamianos, Gerasimos
    INTERSPEECH 2020, 2020, : 2752 - 2756
  • [2] Continuous Chinese Sign Language Recognition with CNN-LSTM
    Yang, Su
    Zhu, Qing
    NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [3] SeqαGAN: Sign Language Sequence Generation Based on Variational and Adversarial Learning
    Xiao, Qinkun
    Li, Lu
    Zhu, Yilin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (07) : 9320 - 9329
  • [4] Recent Advances on Deep Learning for Sign Language Recognition
    Zhang, Yanqiong
    Jiang, Xianwei
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (03): : 2399 - 2450
  • [5] Sequence recognition of Chinese license plates
    Wang, Jianlin
    Huang, He
    Qian, Xusheng
    Cao, Jinde
    Dai, Yakang
    NEUROCOMPUTING, 2018, 317 : 149 - 158
  • [6] A NEW SYSTEM FOR CHINESE SIGN LANGUAGE RECOGNITION
    Zhang, Jihai
    Zhou, Wengang
    Li, Houqiang
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 534 - 538
  • [7] CHINESE SIGN LANGUAGE RECOGNITION WITH ADAPTIVE HMM
    Zhang, Jihai
    Zhou, Wengang
    Xie, Chao
    Pu, Junfu
    Li, Hougiang
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [8] Multimodal Learning for Sign Language Recognition
    Ferreira, Pedro M.
    Cardoso, Jaime S.
    Rebelo, Ana
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 313 - 321
  • [9] On the role of multimodal learning in the recognition of sign language
    Ferreira, Pedro M.
    Cardoso, Jaime S.
    Rebelo, Ana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (08) : 10035 - 10056
  • [10] On the role of multimodal learning in the recognition of sign language
    Pedro M. Ferreira
    Jaime S. Cardoso
    Ana Rebelo
    Multimedia Tools and Applications, 2019, 78 : 10035 - 10056