Review of Sign Language Recognition Based on Deep Learning

被引:9
作者
Zhang Shujun [1 ]
Zhang Qun [1 ]
Li Hui [1 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Informat Sci & Technol, Qingdao 266061, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Sign language recognition; Convolutional Neural Network (CNN); Recurrent Neural Network (RNN); Long-term temporal sequence modeling; CORPUS; VIDEO;
D O I
10.11999/JEIT190416
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sign language recognition involves computer vision, pattern recognition, human-computer interaction, etc. It has important research significance and application value. The flourishing of deep learning technology brings new opportunities for more accurate and real-time sign language recognition. This paper reviews the sign language recognition technology based on deep learning in recent years, formulates and analyzes the algorithms from two branches - isolated words and continuous sentences. The isolated-word recognition technology is divided into three structures: Convolutional Neural Network (CNN), Three-Dimensional Convolutional Neural Network (3D-CNN) and Recurrent Neural Network (RNN) based method. The model used for continuous sentence recognition has higher complexity and is usually assisted with certain kind of long-term temporal sequence modeling algorithm. According to the major structure, there are three categories: the bidirectional LSTM, the 3D convolutional network model and the hybrid model. Common sign language datasets at home and abroad are summarized. Finally, the research challenges and development trends of sign language recognition technology are discussed, concluding that the robustness and practicality on the premise of high-precision still requires to be promoted.
引用
收藏
页码:1021 / 1032
页数:12
相关论文
共 69 条
[51]   Multimodal Gesture Recognition Based on the ResC3D Network [J].
Miao, Qiguang ;
Li, Yunan ;
Ouyang, Wanli ;
Ma, Zhenxin ;
Xu, Xin ;
Shi, Weikang ;
Cao, Xiaochun .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :3047-3055
[52]   A Modified LSTM Model for Continuous Sign Language Recognition Using Leap Motion [J].
Mittal, Anshul ;
Kumar, Pradeep ;
Roy, Partha Pratim ;
Balasubramanian, Raman ;
Chaudhuri, Bidyut B. .
IEEE SENSORS JOURNAL, 2019, 19 (16) :7056-7063
[53]  
Molchanov Pavlo, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P1, DOI 10.1109/CVPRW.2015.7301342
[54]  
Ong EJ, 2012, PROC CVPR IEEE, P2200, DOI 10.1109/CVPR.2012.6247928
[55]  
Oszust M, 2013, C HUM SYST INTERACT, P219, DOI 10.1109/HSI.2013.6577826
[56]   Gesture and Sign Language Recognition with Temporal Residual Networks [J].
Pigou, Lionel ;
Van Herreweghe, Mieke ;
Dambre, Joni .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :3086-3093
[57]  
Prabhavalkar R, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P4839, DOI 10.1109/ICASSP.2018.8461809
[58]  
Pugeault N, 2011, 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), DOI 10.1109/ICCVW.2011.6130290
[59]  
Rosete A., 2016, 22 CONGRESO ARGENTIN
[60]  
Shi BW, 2018, IEEE W SP LANG TECH, P145, DOI 10.1109/SLT.2018.8639639