Motion2language, unsupervised learning of synchronized semantic motion segmentation

被引：1

作者：

Radouane, Karim ^{[1
]}

Tchechmedjiev, Andon ^{[1
]}

Lagarde, Julien ^{[2
]}

Ranwez, Sylvie ^{[1
]}

机构：

[1] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Ales, France

[2] Univ Montpellier, IMT Mines Ales, EuroMov Digital Hlth Mot, Montpellier, France

来源：

NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 08期

关键词：

Unsupervised learning; Semantic segmentation; Synchronized transcription; GRU; Local recurrent attention; WHOLE-BODY MOTION;

D O I：

10.1007/s00521-023-09227-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we investigate building a sequence to sequence architecture for motion-to-language translation and synchronization. The aim is to translate motion capture inputs into English natural-language descriptions, such that the descriptions are generated synchronously with the actions performed, enabling semantic segmentation as a byproduct, but without requiring synchronized training data. We propose a new recurrent formulation of local attention that is suited for synchronous/live text generation, as well as an improved motion encoder architecture better suited to smaller data and for synchronous generation. We evaluate both contributions in individual experiments, using the standard BLEU4 metric, as well as a simple semantic equivalence measure, on the KIT motion-language dataset. In a follow-up experiment, we assess the quality of the synchronization of generated text in our proposed approaches through multiple evaluation metrics. We find that both contributions to the attention mechanism and the encoder architecture additively improve the quality of generated text (BLEU and semantic equivalence), but also of synchronization.

引用

页码：4401 / 4420

页数：20

共 50 条

[21] Unsupervised Learning of Monocular Depth and Ego-Motion in Outdoor/Indoor Environments
Gao, Ruipeng
Xiao, Xuan
Xing, Weiwei
Li, Chi
Liu, Lei
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (17) : 16247 - 16258
[22] Anomaly Detection of Motion Artifact in Photoplethysmography (PPG) Sensors Using Unsupervised Learning
Kwon, Taerim
Yoon, Sang Won
IEEE SENSORS JOURNAL, 2024, 24 (14) : 23163 - 23172
[23] Vehicle Visual SLAM in Dynamic Scenes Based on Semantic Segmentation and Motion Consistency Constraints
Huang S.
Hu M.
Zhou Y.
Yin Z.
Qin X.
Bian Y.
Jia Q.
Qiche Gongcheng/Automotive Engineering, 2022, 44 (10): : 1503 - 1510
[24] Unsupervised deep learning based ego motion estimation with a downward facing camera
Maximilian Gilles
Sascha Ibrahimpasic
The Visual Computer, 2023, 39 : 785 - 798
[25] Unsupervised Learning for Large Motion Thoracic CT Follow-Up Registration
Hering, Alessa
Heldmann, Stefan
MEDICAL IMAGING 2019: IMAGE PROCESSING, 2019, 10949
[26] Fast Semantic Segmentation on Video Using Block Motion-Based Feature Interpolation
Jain, Samvit
Gonzalez, Joseph E.
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 3 - 6
[27] Unsupervised deep learning based ego motion estimation with a downward facing camera
Gilles, Maximilian
Ibrahimpasic, Sascha
VISUAL COMPUTER, 2023, 39 (03) : 785 - 798
[28] Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video
Sharma, Alisha
Ventura, Jonathan
2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY (AIVR), 2019, : 58 - 65
[29] Performance boosting of conventional deep learning-based semantic segmentation leveraging unsupervised clustering
Ma, Jong Won
Leite, Fernanda
AUTOMATION IN CONSTRUCTION, 2022, 136
[30] Towards the Target: Self-regularized Progressive Learning for Unsupervised Domain Adaptation on Semantic Segmentation
Chang, Jui
Pang, Yu-Ting
Hsu, Chiou-Ting
PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 299 - 313

← 1 2 3 4 5 →