Spatial Attention-Based 3D Graph Convolutional Neural Network for Sign Language Recognition

被引:14
作者
Al-Hammadi, Muneer [1 ,2 ]
Bencherif, Mohamed A. [1 ,3 ]
Alsulaiman, Mansour [1 ,3 ]
Muhammad, Ghulam [1 ,3 ]
Mekhtiche, Mohamed Amine [1 ,3 ]
Abdul, Wadood [1 ,3 ]
Alohali, Yousef A. [1 ,4 ]
Alrayes, Tareq S. [5 ]
Mathkour, Hassan [1 ,4 ]
Faisal, Mohammed [1 ,6 ]
Algabri, Mohammed [1 ,4 ]
Altaheri, Hamdi [1 ,3 ]
Alfakih, Taha [1 ]
Ghaleb, Hamid [1 ,7 ]
机构
[1] King Saud Univ, Ctr Smart Robot Res CS2R, Riyadh 11543, Saudi Arabia
[2] Norwegian Univ Sci & Technol, Dept Civil & Environm Engn, Fac Engn, Hogskoleringen 1, N-7034 Trondheim, Norway
[3] King Saud Univ, Dept Comp Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
[4] King Saud Univ, Dept Comp Sci, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
[5] King Saud Univ, Dept Special Educ, Coll Educ, Riyadh 11543, Saudi Arabia
[6] Kuwait Coll Sci & Technol KCST, Ctr Robot, Kuwait 35004, Kuwait
[7] King Saud Univ, Software Engn Dept, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
关键词
sign language recognition; graph convolutional neural network (GCN); attention; deep learning; HAND GESTURE RECOGNITION; FUSION;
D O I
10.3390/s22124558
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Sign language is the main channel for hearing-impaired people to communicate with others. It is a visual language that conveys highly structured components of manual and non-manual parameters such that it needs a lot of effort to master by hearing people. Sign language recognition aims to facilitate this mastering difficulty and bridge the communication gap between hearing-impaired people and others. This study presents an efficient architecture for sign language recognition based on a convolutional graph neural network (GCN). The presented architecture consists of a few separable 3DGCN layers, which are enhanced by a spatial attention mechanism. The limited number of layers in the proposed architecture enables it to avoid the common over-smoothing problem in deep graph neural networks. Furthermore, the attention mechanism enhances the spatial context representation of the gestures. The proposed architecture is evaluated on different datasets and shows outstanding results.
引用
收藏
页数:15
相关论文
共 47 条
[1]   A survey on manual and non-manual sign language recognition for isolated and continuous sign [J].
Agrawal, Subhash Chand ;
Jalal, Anand Singh ;
Tripathi, Rajesh Kumar .
INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2016, 3 (02) :99-134
[2]   Deep Learning-Based Approach for Sign Language Gesture Recognition With Efficient Hand Gesture Representation [J].
Al-Hammadi, Muneer ;
Muhammad, Ghulam ;
Abdul, Wadood ;
Alsulaiman, Mansour ;
Bencherif, Mohammed A. ;
Alrayes, Tareq S. ;
Mathkour, Hassan ;
Mekhtiche, Mohamed Amine .
IEEE ACCESS, 2020, 8 :192527-192542
[3]   Hand Gesture Recognition for Sign Language Using 3DCNN [J].
Al-Hammadi, Muneer ;
Muhammad, Ghulam ;
Abdul, Wadood ;
Alsulaiman, Mansour ;
Bencherif, Mohamed A. ;
Mekhtiche, Mohamed Amine .
IEEE ACCESS, 2020, 8 :79491-79509
[4]   Hand Gesture Recognition Using 3D-CNN Model [J].
Al-Hammadi, Muneer ;
Muhammad, Ghulam ;
Abdul, Wadood ;
Alsulaiman, Mansour ;
Hossain, M. Shamim .
IEEE CONSUMER ELECTRONICS MAGAZINE, 2020, 9 (01) :95-101
[5]   Deep learning techniques for classification of electroencephalogram (EEG) motor imagery (MI) signals: a review [J].
Altaheri, Hamdi ;
Muhammad, Ghulam ;
Alsulaiman, Mansour ;
Amin, Syed Umar ;
Altuwaijri, Ghadir Ali ;
Abdul, Wadood ;
Bencherif, Mohamed A. ;
Faisal, Mohammed .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (20) :14681-14722
[6]   A Multi-Branch Convolutional Neural Network with Squeeze-and-Excitation Attention Blocks for EEG-Based Motor Imagery Signals Classification [J].
Altuwaijri, Ghadir Ali ;
Muhammad, Ghulam ;
Altaheri, Hamdi ;
Alsulaiman, Mansour .
DIAGNOSTICS, 2022, 12 (04)
[7]   Attention-Inception and Long- Short-Term Memory-Based Electroencephalography Classification for Motor Imagery Tasks in Rehabilitation [J].
Amin, Syed Umar ;
Altaheri, Hamdi ;
Muhammad, Ghulam ;
Abdul, Wadood ;
Alsulaiman, Mansour .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (08) :5412-5421
[8]  
[Anonymous], 2013, Int J Comput Appl
[9]  
[Anonymous], 2016, P 22 C ARG CIENC COM
[10]  
[Anonymous], 2017, ADV NEURAL INF PROCE