Multimodal Learning for Sign Language Recognition

被引:7
|
作者
Ferreira, Pedro M. [1 ]
Cardoso, Jaime S. [1 ]
Rebelo, Ana [1 ]
机构
[1] INESC TEC, Porto, Portugal
来源
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017) | 2017年 / 10255卷
关键词
Sign Language Recognition; Multimodal learning; Convolutional neural networks; Kinect; Leap Motion;
D O I
10.1007/978-3-319-58838-4_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sign Language Recognition (SLR) has becoming one of the most important research areas in the field of human computer interaction. SLR systems are meant to automatically translate sign language into text or speech, in order to reduce the communicational gap between deaf and hearing people. The aim of this paper is to exploit multimodal learning techniques for an accurate SLR, making use of data provided by Kinect and Leap Motion. In this regard, single-modality approaches as well as different multimodal methods, mainly based on convolutional neural networks, are proposed. Experimental results demonstrate that multimodal learning yields an overall improvement in the sign recognition performance.
引用
收藏
页码:313 / 321
页数:9
相关论文
共 50 条
  • [1] On the role of multimodal learning in the recognition of sign language
    Ferreira, Pedro M.
    Cardoso, Jaime S.
    Rebelo, Ana
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (08) : 10035 - 10056
  • [2] On the role of multimodal learning in the recognition of sign language
    Pedro M. Ferreira
    Jaime S. Cardoso
    Ana Rebelo
    Multimedia Tools and Applications, 2019, 78 : 10035 - 10056
  • [3] Sign Language Recognition with Multimodal Sensors and Deep Learning Methods
    Lu, Chenghong
    Kozakai, Misaki
    Jing, Lei
    ELECTRONICS, 2023, 12 (23)
  • [4] Multimodal Spatiotemporal Networks for Sign Language Recognition
    Zhang, Shujun
    Meng, Weijia
    Li, Hui
    Cui, Xuehong
    IEEE ACCESS, 2019, 7 : 180270 - 180280
  • [5] Multimodal Sign Language Recognition via Temporal Deformable Convolutional Sequence Learning
    Papadimitriou, Katerina
    Potamianos, Gerasimos
    INTERSPEECH 2020, 2020, : 2752 - 2756
  • [6] Sign Language Recognition Analysis using Multimodal Data
    Hosain, Al Amin
    Santhalingam, Panneer Selvam
    Pathak, Parth
    Kosecka, Jana
    Rangwala, Huzefa
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 203 - 210
  • [7] A multimodal framework for sensor based sign language recognition
    Kumar, Pradeep
    Gauba, Himaanshu
    Roy, Partha Pratim
    Dogra, Debi Prosad
    NEUROCOMPUTING, 2017, 259 : 21 - 38
  • [8] Sign Language Recognition: Learning American Sign Language in a Virtual Environment
    Schioppo, Jacob
    Meyer, Zachary
    Fabiano, Diego
    Canavan, Shaun
    CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [9] Improving Sign Language Recognition Performance Using Multimodal Data
    Nishimura, Tomoe
    Abbasi, Bahareh
    2024 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI 2024, 2024, : 184 - 189
  • [10] Multimodal Locally Enhanced Transformer for Continuous Sign Language Recognition
    Papadimitriou, Katerina
    Potamianos, Gerasimos
    INTERSPEECH 2023, 2023, : 1513 - 1517