Motion2language, unsupervised learning of synchronized semantic motion segmentation

被引:1
|
作者
Radouane, Karim [1 ]
Tchechmedjiev, Andon [1 ]
Lagarde, Julien [2 ]
Ranwez, Sylvie [1 ]
机构
[1] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Ales, France
[2] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Montpellier, France
关键词
Unsupervised learning; Semantic segmentation; Synchronized transcription; GRU; Local recurrent attention; WHOLE-BODY MOTION;
D O I
10.1007/s00521-023-09227-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate building a sequence to sequence architecture for motion-to-language translation and synchronization. The aim is to translate motion capture inputs into English natural-language descriptions, such that the descriptions are generated synchronously with the actions performed, enabling semantic segmentation as a byproduct, but without requiring synchronized training data. We propose a new recurrent formulation of local attention that is suited for synchronous/live text generation, as well as an improved motion encoder architecture better suited to smaller data and for synchronous generation. We evaluate both contributions in individual experiments, using the standard BLEU4 metric, as well as a simple semantic equivalence measure, on the KIT motion-language dataset. In a follow-up experiment, we assess the quality of the synchronization of generated text in our proposed approaches through multiple evaluation metrics. We find that both contributions to the attention mechanism and the encoder architecture additively improve the quality of generated text (BLEU and semantic equivalence), but also of synchronization.
引用
收藏
页码:4401 / 4420
页数:20
相关论文
共 50 条
  • [21] Unsupervised Learning of Monocular Depth and Ego-Motion in Outdoor/Indoor Environments
    Gao, Ruipeng
    Xiao, Xuan
    Xing, Weiwei
    Li, Chi
    Liu, Lei
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17) : 16247 - 16258
  • [22] Anomaly Detection of Motion Artifact in Photoplethysmography (PPG) Sensors Using Unsupervised Learning
    Kwon, Taerim
    Yoon, Sang Won
    IEEE SENSORS JOURNAL, 2024, 24 (14) : 23163 - 23172
  • [23] Vehicle Visual SLAM in Dynamic Scenes Based on Semantic Segmentation and Motion Consistency Constraints
    Huang S.
    Hu M.
    Zhou Y.
    Yin Z.
    Qin X.
    Bian Y.
    Jia Q.
    Qiche Gongcheng/Automotive Engineering, 2022, 44 (10): : 1503 - 1510
  • [24] Unsupervised deep learning based ego motion estimation with a downward facing camera
    Maximilian Gilles
    Sascha Ibrahimpasic
    The Visual Computer, 2023, 39 : 785 - 798
  • [25] Unsupervised Learning for Large Motion Thoracic CT Follow-Up Registration
    Hering, Alessa
    Heldmann, Stefan
    MEDICAL IMAGING 2019: IMAGE PROCESSING, 2019, 10949
  • [26] Fast Semantic Segmentation on Video Using Block Motion-Based Feature Interpolation
    Jain, Samvit
    Gonzalez, Joseph E.
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 3 - 6
  • [27] Unsupervised deep learning based ego motion estimation with a downward facing camera
    Gilles, Maximilian
    Ibrahimpasic, Sascha
    VISUAL COMPUTER, 2023, 39 (03) : 785 - 798
  • [28] Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video
    Sharma, Alisha
    Ventura, Jonathan
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR), 2019, : 58 - 65
  • [29] Performance boosting of conventional deep learning-based semantic segmentation leveraging unsupervised clustering
    Ma, Jong Won
    Leite, Fernanda
    AUTOMATION IN CONSTRUCTION, 2022, 136
  • [30] Towards the Target: Self-regularized Progressive Learning for Unsupervised Domain Adaptation on Semantic Segmentation
    Chang, Jui
    Pang, Yu-Ting
    Hsu, Chiou-Ting
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 299 - 313