Multimodal Learning for Sign Language Recognition

被引:7
作者
Ferreira, Pedro M. [1 ]
Cardoso, Jaime S. [1 ]
Rebelo, Ana [1 ]
机构
[1] INESC TEC, Porto, Portugal
来源
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017) | 2017年 / 10255卷
关键词
Sign Language Recognition; Multimodal learning; Convolutional neural networks; Kinect; Leap Motion;
D O I
10.1007/978-3-319-58838-4_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sign Language Recognition (SLR) has becoming one of the most important research areas in the field of human computer interaction. SLR systems are meant to automatically translate sign language into text or speech, in order to reduce the communicational gap between deaf and hearing people. The aim of this paper is to exploit multimodal learning techniques for an accurate SLR, making use of data provided by Kinect and Leap Motion. In this regard, single-modality approaches as well as different multimodal methods, mainly based on convolutional neural networks, are proposed. Experimental results demonstrate that multimodal learning yields an overall improvement in the sign recognition performance.
引用
收藏
页码:313 / 321
页数:9
相关论文
共 10 条
[1]  
Adithya V, 2013, 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), P1080
[2]  
[Anonymous], 2011, P IEEE WORKSH APPL C, DOI DOI 10.1109/WACV.2011.5711485
[3]  
Cooper H, 2007, LECT NOTES COMPUT SC, V4796, P88
[4]   Combining multiple depth-based descriptors for hand gesture recognition [J].
Dominio, Fabio ;
Donadeo, Mauro ;
Zanuttigh, Pietro .
PATTERN RECOGNITION LETTERS, 2014, 50 :101-111
[5]  
Marin G, 2014, IEEE IMAGE PROC, P1565, DOI 10.1109/ICIP.2014.7025313
[6]   Hand gesture recognition with jointly calibrated Leap Motion and depth sensor [J].
Marin, Giulio ;
Dominio, Fabio ;
Zanuttigh, Pietro .
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (22) :14991-15015
[7]  
Ngiam J., 2011, INT C MACH LEARN, V6
[8]  
Potter LE, 2013, P 25 AUSTR COMP HUM, P175, DOI DOI 10.1145/2541016.2541072
[9]   A Taxonomy of Deep Convolutional Neural Nets for Computer Vision [J].
Srinivas, Suraj ;
Sarvadevabhatla, Ravi Kiran ;
Mopuri, Konda Reddy ;
Prabhu, Nikita ;
Kruthiventi, Srinivas S. S. ;
Babu, R. Venkatesh .
FRONTIERS IN ROBOTICS AND AI, 2016, 2
[10]  
Srivastava N, 2014, J MACH LEARN RES, V15, P1929