Multi-modality-based Arabic sign language recognition

被引:19
作者
Elpeltagy, Marwa [1 ]
Abdelwahab, Moataz [1 ]
Hussein, Mohamed E. [2 ,3 ]
Shoukry, Amin [2 ,4 ]
Shoala, Asmaa [5 ]
Galal, Moustafa [5 ]
机构
[1] Egypt Japan Univ Sci & Technol, Dept Elect & Commun Engn, Alexandria 21934, Egypt
[2] Alexandria Univ, Comp & Syst Engn Dept, Alexandria 21544, Egypt
[3] Informat Sci Inst, Arlington, VA 22203 USA
[4] Egypt Japan Univ Sci & Technol, Dept Comp Sci & Engn, Alexandria 21934, Egypt
[5] ITAC Res Project, Alexandria 21934, Egypt
关键词
image motion analysis; image segmentation; feature extraction; sign language recognition; image classification; handicapped aids; principal component analysis; sign language recognition algorithm; hand segmentation; body motion description; sign classification; hand shape segmentation; hand joints; segmented hand shapes; hand shape sequence descriptor; hand states; face properties; motion sequence description; deaf-mute people; Arabic sign language recognition benchmark data sets; ArSL signs; multimodality-based Arabic sign language recognition; histogram of oriented gradients; canonical correlation analysis; random forest classifiers; GESTURE RECOGNITION;
D O I
10.1049/iet-cvi.2017.0598
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increase in the number of deaf-mute people in the Arab world and the lack of Arabic sign language (ArSL) recognition benchmark data sets, there is a pressing need for publishing a large-volume and realistic ArSL data set. This study presents such a data set, which consists of 150 isolated ArSL signs. The data set is challenging due to the great similarity among hand shapes and motions in the collected signs. Along with the data set, a sign language recognition algorithm is presented. The authors' proposed method consists of three major stages: hand segmentation, hand shape sequence and body motion description, and sign classification. The hand shape segmentation is based on the depth and position of the hand joints. Histograms of oriented gradients and principal component analysis are applied on the segmented hand shapes to obtain the hand shape sequence descriptor. The covariance of the three-dimensional joints of the upper half of the skeleton in addition to the hand states and face properties are adopted for motion sequence description. The canonical correlation analysis and random forest classifiers are used for classification. The achieved accuracy is 55.57% over 150 ArSL signs, which is considered promising.
引用
收藏
页码:1031 / 1039
页数:9
相关论文
共 30 条
  • [1] Almohimeed A., 2012, THESIS
  • [2] [Anonymous], P 2013 WORKSH MULT C
  • [3] [Anonymous], 2013, P 33 INT JOINT C ART
  • [4] [Anonymous], 2008, P LREC2008 3 WORKSHO
  • [5] [Anonymous], 2008, P 19 BRIT MACHINE VI
  • [6] Nearest neighbour classification of Indian sign language gestures using kinect camera
    Ansari, Zafar Ahmed
    Harit, Gaurav
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2016, 41 (02): : 161 - 182
  • [7] Bungeroth J., 2008, P INT C LANG RES EV
  • [8] Coogan T, 2006, LECT NOTES COMPUT SC, V4291, P495
  • [9] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [10] Dong C., 2015, P IEEE C COMP VIS PA, P44, DOI DOI 10.1109/CVPRW.2015.7301347