Motion2language, unsupervised learning of synchronized semantic motion segmentation

被引:1
|
作者
Radouane, Karim [1 ]
Tchechmedjiev, Andon [1 ]
Lagarde, Julien [2 ]
Ranwez, Sylvie [1 ]
机构
[1] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Ales, France
[2] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Montpellier, France
关键词
Unsupervised learning; Semantic segmentation; Synchronized transcription; GRU; Local recurrent attention; WHOLE-BODY MOTION;
D O I
10.1007/s00521-023-09227-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate building a sequence to sequence architecture for motion-to-language translation and synchronization. The aim is to translate motion capture inputs into English natural-language descriptions, such that the descriptions are generated synchronously with the actions performed, enabling semantic segmentation as a byproduct, but without requiring synchronized training data. We propose a new recurrent formulation of local attention that is suited for synchronous/live text generation, as well as an improved motion encoder architecture better suited to smaller data and for synchronous generation. We evaluate both contributions in individual experiments, using the standard BLEU4 metric, as well as a simple semantic equivalence measure, on the KIT motion-language dataset. In a follow-up experiment, we assess the quality of the synchronization of generated text in our proposed approaches through multiple evaluation metrics. We find that both contributions to the attention mechanism and the encoder architecture additively improve the quality of generated text (BLEU and semantic equivalence), but also of synchronization.
引用
收藏
页码:4401 / 4420
页数:20
相关论文
共 50 条
  • [31] Unsupervised domain adaptive building semantic segmentation network by edge-enhanced contrastive learning
    Yang, Mengyuan
    Yang, Rui
    Tao, Shikang
    Zhang, Xin
    Wang, Min
    NEURAL NETWORKS, 2024, 179
  • [32] Cross-dataset semantic segmentation for composite crack detection using unsupervised transfer learning
    Zhao, Pengchao
    Xu, Wenyuan
    Qi, Dawei
    Yuan, Bo
    COMPOSITE STRUCTURES, 2025, 362
  • [33] Increasing the localization accuracy of visual SLAM with semantic segmentation and motion consistency detection in dynamic scenes
    Shen, Dong
    Fang, Haoyu
    Li, Qiang
    Liu, Jiale
    Guo, Sheng
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7501 - 7512
  • [34] Vision-Based Uncertainty-Aware Motion Planning Based on Probabilistic Semantic Segmentation
    Roemer, Ralf
    Lederer, Armin
    Tesfazgi, Samuel
    Hirche, Sandra
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7825 - 7832
  • [35] DecoupledPoseNet: Cascade Decoupled Pose Learning for Unsupervised Camera Ego-Motion Estimation
    Zhou, Wenhui
    Zhang, Hua
    Yan, Zhengmao
    Wang, Weisheng
    Lin, Lili
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1636 - 1648
  • [36] A New Neural Computation Scheme of Unsupervised Learning with Applications to Robot Biped Loco motion
    Hidenori, Kimura
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 1, 2008, : 5 - 5
  • [37] A unified framework for unsupervised action learning via global-to-local motion transformer
    Kim, Boeun
    Kim, Jungho
    Chang, Hyung Jin
    Oh, Tae-Hyun
    PATTERN RECOGNITION, 2025, 159
  • [38] Geometry perception and motion planning in robotic assembly based on semantic segmentation and point clouds reconstruction
    Jiang, Yuze
    Liu, Guanghui
    Huang, Zhouzhou
    Yang, Bin
    Yang, Wenyu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [39] Improving Unsupervised Learning of Monocular Depth and Ego-Motion via Stereo Network
    He, Mu
    Xie, Jin
    Yang, Jian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 421 - 433
  • [40] Perceiving Spectral Variation: Unsupervised Spectrum Motion Feature Learning for Hyperspectral Image Classification
    Sun, Yifan
    Liu, Bing
    Yu, Xuchu
    Yu, Anzhu
    Gao, Kuiliang
    Ding, Lei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60