Text2Sign: Towards Sign Language Production Using Neural Machine Translation and Generative Adversarial Networks

被引:104
|
作者
Stoll, Stephanie [1 ]
Camgoz, Necati Cihan [1 ]
Hadfield, Simon [1 ]
Bowden, Richard [1 ]
机构
[1] Ctr Vis Speech & Signal Proc, Guildford, Surrey, England
基金
欧盟地平线“2020”; 英国工程与自然科学研究理事会;
关键词
Generative adversarial networks; Neural machine translation; Sign language production;
D O I
10.1007/s11263-019-01281-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel approach to automatic Sign Language Production using recent developments in Neural Machine Translation (NMT), Generative Adversarial Networks, and motion generation. Our system is capable of producing sign videos from spoken language sentences. Contrary to current approaches that are dependent on heavily annotated data, our approach requires minimal gloss and skeletal level annotations for training. We achieve this by breaking down the task into dedicated sub-processes. We first translate spoken language sentences into sign pose sequences by combining an NMT network with a Motion Graph. The resulting pose information is then used to condition a generative model that produces photo realistic sign language video sequences. This is the first approach to continuous sign video generation that does not use a classical graphical avatar. We evaluate the translation abilities of our approach on the PHOENIX14T Sign Language Translation dataset. We set a baseline for text-to-gloss translation, reporting a BLEU-4 score of 16.34/15.26 on dev/test sets. We further demonstrate the video generation capabilities of our approach for both multi-signer and high-definition settings qualitatively and quantitatively using broadcast quality assessment metrics.
引用
收藏
页码:891 / 908
页数:18
相关论文
共 50 条
  • [1] Text2Sign: Towards Sign Language Production Using Neural Machine Translation and Generative Adversarial Networks
    Stephanie Stoll
    Necati Cihan Camgoz
    Simon Hadfield
    Richard Bowden
    International Journal of Computer Vision, 2020, 128 : 891 - 908
  • [2] American and Russian Sign Language Dactyl Recognition and Text2Sign Translation
    Makarov, Ilya
    Veldyaykin, Nikolay
    Chertkov, Maxim
    Pokoev, Aleksei
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2019, 2019, 11832 : 309 - 320
  • [3] Sign Language Video Generation from Text Using Generative Adversarial Networks
    Sreemathy, R.
    Chordiya, Param
    Khurana, Soumya
    Turuk, Mousami
    OPTICAL MEMORY AND NEURAL NETWORKS, 2024, 33 (04) : 466 - 476
  • [4] Neural machine translation from text to sign language
    De Martino, Jose Mario
    Silva, Ivani Rodrigues
    Marques, Janice Goncalves Temoteo
    Martins, Antonielle Cantarelli
    Poeta, Enzo Telles
    Christinele, Dener Stassun
    Campos, Joao Pedro Araujo Ferreira
    UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2025, 24 (01) : 37 - 50
  • [5] Using Neural Machine Translation Methods for Sign Language Translation
    Angelova, Galina
    Avramidis, Eleftherios
    Moeller, Sebastian
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 273 - 284
  • [6] Neural Machine Translation Methods for Translating Text to Sign Language Glosses
    Zhu, Dele
    Czehmann, Vera
    Avramidis, Eleftherios
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 12523 - 12541
  • [7] Neural machine translation techniques for English text to Pakistan sign language gloss translation
    Tanwir, Abdul Majid
    Jilani, Muhammad Najeeb
    Khan, Zaviar
    Samad, Abdul
    UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2024,
  • [8] ATLASLang NMT: Arabic text language into Arabic sign language neural machine translation
    Brour, Mourad
    Benabbou, Abderrahim
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (09) : 1121 - 1131
  • [9] Sign Language Translation Using Deep Convolutional Neural Networks
    Abiyev, Rahib H.
    Arslan, Murat
    Idok, John Bush
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (02) : 631 - 653
  • [10] Machine translation from text to sign language: a systematic review
    Kahlon, Navroz Kaur
    Singh, Williamjeet
    UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2023, 22 (01) : 1 - 35