Motion2language, unsupervised learning of synchronized semantic motion segmentation

被引:1
|
作者
Radouane, Karim [1 ]
Tchechmedjiev, Andon [1 ]
Lagarde, Julien [2 ]
Ranwez, Sylvie [1 ]
机构
[1] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Ales, France
[2] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Montpellier, France
关键词
Unsupervised learning; Semantic segmentation; Synchronized transcription; GRU; Local recurrent attention; WHOLE-BODY MOTION;
D O I
10.1007/s00521-023-09227-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate building a sequence to sequence architecture for motion-to-language translation and synchronization. The aim is to translate motion capture inputs into English natural-language descriptions, such that the descriptions are generated synchronously with the actions performed, enabling semantic segmentation as a byproduct, but without requiring synchronized training data. We propose a new recurrent formulation of local attention that is suited for synchronous/live text generation, as well as an improved motion encoder architecture better suited to smaller data and for synchronous generation. We evaluate both contributions in individual experiments, using the standard BLEU4 metric, as well as a simple semantic equivalence measure, on the KIT motion-language dataset. In a follow-up experiment, we assess the quality of the synchronization of generated text in our proposed approaches through multiple evaluation metrics. We find that both contributions to the attention mechanism and the encoder architecture additively improve the quality of generated text (BLEU and semantic equivalence), but also of synchronization.
引用
收藏
页码:4401 / 4420
页数:20
相关论文
共 50 条
  • [1] Motion2language, unsupervised learning of synchronized semantic motion segmentation
    Karim Radouane
    Andon Tchechmedjiev
    Julien Lagarde
    Sylvie Ranwez
    Neural Computing and Applications, 2024, 36 : 4401 - 4420
  • [2] Unsupervised learning of human motion
    Song, Y
    Goncalves, L
    Perona, P
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (07) : 814 - 827
  • [3] Multi-stage unsupervised learning for multi-body motion segmentation
    Sugaya, Y
    Kanatani, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (07): : 1935 - 1942
  • [4] MOTION RECTIFICATION NETWORK FOR UNSUPERVISED LEARNING OF MONOCULAR DEPTH AND CAMERA MOTION
    Liu, Hong
    Hua, Guoliang
    Huang, Weibo
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2805 - 2809
  • [5] Motion-DVAE: Unsupervised learning for fast human motion denoising
    Fiche, Guenole
    Leglaive, Simon
    Alameda-Pineda, Xavier
    Seguier, Renaud
    15TH ANNUAL ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION AND GAMES, MIG 2023, 2023,
  • [6] A hybrid domain learning framework for unsupervised semantic segmentation
    Zhang, Yuhang
    Tian, Shishun
    Liao, Muxin
    Zou, Wenbin
    Xu, Chen
    NEUROCOMPUTING, 2023, 516 : 133 - 145
  • [7] Flow2Seg: Motion-Aided Semantic Segmentation
    Li, Xiangtai
    Bai, Jiangang
    Yang, Kuiyuan
    Tong, Yunhai
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 225 - 237
  • [8] UNSUPERVISED LEARNING OF MOTION PATTERNS USING GENERATIVE MODELS
    Nascimento, Jacinto C.
    Figueiredo, Mario A. T.
    Marques, Jorge S.
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 761 - 764
  • [9] Adversarial Framework for Unsupervised Learning of Motion Dynamics in Videos
    Spampinato, C.
    Palazzo, S.
    D'Oro, P.
    Giordano, D.
    Shah, M.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (05) : 1378 - 1397
  • [10] Adversarial Framework for Unsupervised Learning of Motion Dynamics in Videos
    C. Spampinato
    S. Palazzo
    P. D’Oro
    D. Giordano
    M. Shah
    International Journal of Computer Vision, 2020, 128 : 1378 - 1397