Using Motion History Images With 3D Convolutional Networks in Isolated Sign Language Recognition

被引:22
|
作者
Mercanoglu Sincan, Ozge [1 ]
Keles, Hacer Yalim [2 ]
机构
[1] Ankara Univ, Comp Engn Dept, TR-06830 Ankara, Turkey
[2] Hacettepe Univ, Comp Engn Dept, TR-06800 Ankara, Turkey
关键词
Assistive technologies; Gesture recognition; History; Data models; Image recognition; Faces; Feature extraction; 3D-CNN; attention; deep learning; motion history image; sign language recognition; FEATURES;
D O I
10.1109/ACCESS.2022.3151362
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sign language recognition using computational models is a challenging problem that requires simultaneous spatio-temporal modeling of the multiple sources, i.e. faces, hands, body, etc. In this paper, we propose an isolated sign language recognition model based on a model trained using Motion History Images (MHI) that are generated from RGB video frames. RGB-MHI images represent spatio-temporal summary of each sign video effectively in a single RGB image. We propose two different approaches using this RGB-MHI model. In the first approach, we use the RGB-MHI model as a motion-based spatial attention module integrated into a 3D-CNN architecture. In the second approach, we use RGB-MHI model features directly with the features of a 3D-CNN model using a late fusion technique. We perform extensive experiments on two recently released large-scale isolated sign language datasets, namely AUTSL and BosphorusSign22k. Our experiments show that our models, which use only RGB data, can compete with the state-of-the-art models in the literature that use multi-modal data.
引用
收藏
页码:18608 / 18618
页数:11
相关论文
共 50 条
  • [1] SIGN LANGUAGE RECOGNITION USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Huang, Jie
    Zhou, Wengang
    Li, Houqiang
    Li, Weiping
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [2] The Efficiency of Sign Language Recognition using 3D Convolutional Neural Networks
    Soodtoetong, Nantinee
    Gedkhaw, Eakbodin
    2018 15TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2018, : 70 - 73
  • [3] 3D Convolutional Neural Networks for Dynamic Sign Language Recognition
    Liang, Zhi-Jie
    Liao, Sheng-Bin
    Hu, Bing-Zhang
    COMPUTER JOURNAL, 2018, 61 (11): : 1724 - 1736
  • [4] Sign Language Recognition Based on 3D Convolutional Neural Networks
    Ramos Neto, Geovane M.
    Braz Junior, Geraldo
    Sousa de Almeida, Joao Dallyson
    de Paiva, Anselmo Cardoso
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2018), 2018, 10882 : 399 - 407
  • [5] Chinese Sign Language Recognition with 3D Hand Motion Trajectories and Depth Images
    Geng, Lubo
    Ma, Xin
    Wang, Haibo
    Gu, Jason
    Li, Yibin
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1457 - 1461
  • [6] Isolated sign language recognition through integrating pose data and motion history images
    Akdag, Ali
    Baykan, Omer Kaan
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [7] Isolated sign language recognition through integrating pose data and motion history images
    Akdağ A.
    Baykan Ö.K.
    PeerJ Computer Science, 2024, 10
  • [8] Sign Language Recognition Using Convolutional Neural Networks
    Pigou, Lionel
    Dieleman, Sander
    Kindermans, Pieter-Jan
    Schrauwen, Benjamin
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 572 - 578
  • [9] RECOGNITION OF SIGN LANGUAGE MOTION IMAGES
    TAMURA, S
    KAWASAKI, S
    PATTERN RECOGNITION, 1988, 21 (04) : 343 - 353
  • [10] On using sign method for 3D images recognition and classification
    Ivanko, EE
    Perevalov, DS
    International Conference on Computing, Communications and Control Technologies, Vol 5, Proceedings, 2004, : 248 - 251