Self-supervised Representation Learning for Fine Grained Human Hand Action Recognition in Industrial Assembly Lines

被引:2
作者
Sturm, Fabian [1 ,2 ]
Sathiyababu, Rahul [1 ]
Allipilli, Harshitha [1 ]
Hergenroether, Elke [2 ]
Siegel, Melanie [2 ]
机构
[1] Bosch Rexroth AG, Lise Meitner Str 4, D-89081 Ulm, Germany
[2] Univ Appl Sci Darmstadt, Schoefferstr 3, D-64295 Darmstadt, Germany
来源
ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I | 2023年 / 14361卷
关键词
Self-Supervised Learning; Human Action Recognition; Industrial Vision;
D O I
10.1007/978-3-031-47969-4_14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Humans are still indispensable on industrial assembly lines, but in the event of an error, they need support from intelligent systems. In addition to the objects to be observed, it is equally important to understand the fine-grained hand movements of a human to be able to track the entire process. However, these deep learning based hand action recognition methods are very label intensive, which cannot be offered by all industrial companies due to the associated costs. This work therefore presents a self-supervised learning approach for industrial assembly processes that allows a spatio-temporal transformer architecture to be pre-trained on a variety of information from real-world video footage of daily life. Subsequently, this deep learning model is adapted to the industrial assembly task at hand using only a few labels. It is shown which known real-world datasets are best suited for representation learning of these hand actions in a regression task, and to what extent they optimize the subsequent supervised trained classification task.
引用
收藏
页码:172 / 184
页数:13
相关论文
共 50 条
[41]   Self-Supervised Learning via Multi-Transformation Classification for Action Recognition [J].
Duc-Quang Vu ;
Ngan Le ;
Wang, Jia-Ching .
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
[42]   Comparing self-supervised learning techniques for wearable human activity recognition [J].
Ek, Sannara ;
Presotto, Riccardo ;
Civitarese, Gabriele ;
Portet, Francois ;
Lalanda, Philippe ;
Bettini, Claudio .
CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2025,
[43]   Transformer-Based Self-Supervised Multimodal Representation Learning for Wearable Emotion Recognition [J].
Wu, Yujin ;
Daoudi, Mohamed ;
Amad, Ali .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (01) :157-172
[44]   An end-to-end integration of speech separation and recognition with self-supervised learning representation [J].
Masuyama, Yoshiki ;
Chang, Xuankai ;
Zhang, Wangyou ;
Cornell, Samuele ;
Wang, Zhong-Qiu ;
Ono, Nobutaka ;
Qian, Yanmin ;
Watanabe, Shinji .
COMPUTER SPEECH AND LANGUAGE, 2026, 95
[45]   A Novel Self-supervised Representation Learning Model for an Open-Set Speaker Recognition [J].
Ohi, Abu Quwsar ;
Gavrilova, Marina L. .
COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2023, 2023, 14164 :270-282
[46]   Self-Supervised Representation Learning With Spatial-Temporal Consistency for Sign Language Recognition [J].
Zhao, Weichao ;
Zhou, Wengang ;
Hu, Hezhen ;
Wang, Min ;
Li, Houqiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :4188-4201
[47]   Facial Action Unit Representation Based on Self-Supervised Learning With Ensembled Priori Constraints [J].
Chen, Haifeng ;
Zhang, Peng ;
Guo, Chujia ;
Lu, Ke ;
Jiang, Dongmei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :5045-5059
[48]   CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning [J].
Meng, Chutong ;
Ao, Junyi ;
Ko, Tom ;
Wang, Mingxuan ;
Li, Haizhou .
INTERSPEECH 2023, 2023, :2978-2982
[49]   ViewMix: Augmentation for Robust Representation in Self-Supervised Learning [J].
Das, Arjon ;
Zhong, Xin .
IEEE ACCESS, 2024, 12 :8461-8470
[50]   TRIBYOL: TRIPLET BYOL FOR SELF-SUPERVISED REPRESENTATION LEARNING [J].
Li, Guang ;
Togo, Ren ;
Ogawa, Takahiro ;
Haseyama, Miki .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :3458-3462