Motion2language, unsupervised learning of synchronized semantic motion segmentation

被引：1

作者：

Radouane, Karim ^{[1
]}

Tchechmedjiev, Andon ^{[1
]}

Lagarde, Julien ^{[2
]}

Ranwez, Sylvie ^{[1
]}

机构：

[1] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Ales, France

[2] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Montpellier, France

来源：

NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 08期

关键词：

Unsupervised learning; Semantic segmentation; Synchronized transcription; GRU; Local recurrent attention; WHOLE-BODY MOTION;

D O I：

10.1007/s00521-023-09227-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we investigate building a sequence to sequence architecture for motion-to-language translation and synchronization. The aim is to translate motion capture inputs into English natural-language descriptions, such that the descriptions are generated synchronously with the actions performed, enabling semantic segmentation as a byproduct, but without requiring synchronized training data. We propose a new recurrent formulation of local attention that is suited for synchronous/live text generation, as well as an improved motion encoder architecture better suited to smaller data and for synchronous generation. We evaluate both contributions in individual experiments, using the standard BLEU4 metric, as well as a simple semantic equivalence measure, on the KIT motion-language dataset. In a follow-up experiment, we assess the quality of the synchronization of generated text in our proposed approaches through multiple evaluation metrics. We find that both contributions to the attention mechanism and the encoder architecture additively improve the quality of generated text (BLEU and semantic equivalence), but also of synchronization.

引用

页码：4401 / 4420

页数：20

共 50 条

[1] Motion2language, unsupervised learning of synchronized semantic motion segmentation
Karim Radouane
Andon Tchechmedjiev
Julien Lagarde
Sylvie Ranwez
Neural Computing and Applications, 2024, 36 : 4401 - 4420
[2] Unsupervised learning of human motion
Song, Y
Goncalves, L
Perona, P
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (07) : 814 - 827
[3] Multi-stage unsupervised learning for multi-body motion segmentation
Sugaya, Y
Kanatani, K
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (07): : 1935 - 1942
[4] MOTION RECTIFICATION NETWORK FOR UNSUPERVISED LEARNING OF MONOCULAR DEPTH AND CAMERA MOTION
Liu, Hong
Hua, Guoliang
Huang, Weibo
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2805 - 2809
[5] Motion-DVAE: Unsupervised learning for fast human motion denoising
Fiche, Guenole
Leglaive, Simon
Alameda-Pineda, Xavier
Seguier, Renaud
15TH ANNUAL ACM SIGGRAPH CONFERENCE ON MOTION, INTERACTION AND GAMES, MIG 2023, 2023,
[6] A hybrid domain learning framework for unsupervised semantic segmentation
Zhang, Yuhang
Tian, Shishun
Liao, Muxin
Zou, Wenbin
Xu, Chen
NEUROCOMPUTING, 2023, 516 : 133 - 145
[7] Flow2Seg: Motion-Aided Semantic Segmentation
Li, Xiangtai
Bai, Jiangang
Yang, Kuiyuan
Tong, Yunhai
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 225 - 237
[8] UNSUPERVISED LEARNING OF MOTION PATTERNS USING GENERATIVE MODELS
Nascimento, Jacinto C.
Figueiredo, Mario A. T.
Marques, Jorge S.
2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 761 - 764
[9] Adversarial Framework for Unsupervised Learning of Motion Dynamics in Videos
Spampinato, C.
Palazzo, S.
D'Oro, P.
Giordano, D.
Shah, M.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (05) : 1378 - 1397
[10] Adversarial Framework for Unsupervised Learning of Motion Dynamics in Videos
C. Spampinato
S. Palazzo
P. D’Oro
D. Giordano
M. Shah
International Journal of Computer Vision, 2020, 128 : 1378 - 1397

← 1 2 3 4 5 →