Joint space representation and recognition of sign language fingerspelling using Gabor filter and convolutional neural network

被引:13
作者
Luqman, Hamzah [1 ]
El-Alfy, El-Sayed M. [1 ]
BinMakhashen, Galal M. [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Dhahran 31261, Saudi Arabia
关键词
Human-machine interaction; Sign language; Hand gesture; Gabor filter; Deep learning; Multimodal recognition systems; CLASSIFICATION; TRANSFORM; AUGMENTATION; FEATURES; IMAGES;
D O I
10.1007/s11042-020-09994-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we are proposing a new technique for visual recognition of fingerspelling of a sign language by fusing multiple spatial and spectral representations of manual gesture images using a convolutional neural network. This problem is gaining prominence in communication between hearing-impaired people and human-machine interaction. The proposed technique computes Gabor spectral representations of spatial images of hand sign gestures and uses an optimized convolutional neural network to classify the gestures in the joint space into corresponding classes. Various ways to combine both types of modalities are explored to identify the model that improves the robustness and recognition accuracy. The proposed system is evaluated using three databases (MNIST-ASL, ArSL, and MUASL) under different conditions and the attained results outperformed the state-of-the-art techniques.
引用
收藏
页码:10213 / 10234
页数:22
相关论文
共 59 条
[1]   A Review on Systems-Based Sensory Gloves for Sign Language Recognition State of the Art between 2007 and 2017 [J].
Ahmed, Mohamed Aktham ;
Zaidan, Bilal Bahaa ;
Zaidan, Aws Alaa ;
Salih, Mahmood Maher ;
Bin Lakulu, Muhammad Modi .
SENSORS, 2018, 18 (07)
[2]  
Aljahdali S, 2012, INT CONF MULTIMED, P125
[3]  
[Anonymous], 2012, Improving neural networks by preventing co-adaptation of feature detectors
[4]  
[Anonymous], 2012, P ACCV 12
[5]  
[Anonymous], 2017, CoRR abs/1710.06836.
[6]  
Aowal M.A., 2014, TENCON 2014 IEEE REG, P1
[7]  
Barczak ALC, 2011, RES LETT INF MATH SC, V15, P12
[8]  
BinMakhashen GM, 2012, LECT NOTES COMPUT SC, V7667, P410, DOI 10.1007/978-3-642-34500-5_49
[9]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[10]   Trigger Detection System for American Sign Language using Deep Convolutional Neural Networks [J].
Chakraborty, Debasrita ;
Garg, Deepankar ;
Ghosh, Ashish ;
Chan, Jonathan H. .
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY (IAIT2018), 2018,