Self-supervised Representation Learning for Fine Grained Human Hand Action Recognition in Industrial Assembly Lines

被引:2
作者
Sturm, Fabian [1 ,2 ]
Sathiyababu, Rahul [1 ]
Allipilli, Harshitha [1 ]
Hergenroether, Elke [2 ]
Siegel, Melanie [2 ]
机构
[1] Bosch Rexroth AG, Lise Meitner Str 4, D-89081 Ulm, Germany
[2] Univ Appl Sci Darmstadt, Schoefferstr 3, D-64295 Darmstadt, Germany
来源
ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I | 2023年 / 14361卷
关键词
Self-Supervised Learning; Human Action Recognition; Industrial Vision;
D O I
10.1007/978-3-031-47969-4_14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Humans are still indispensable on industrial assembly lines, but in the event of an error, they need support from intelligent systems. In addition to the objects to be observed, it is equally important to understand the fine-grained hand movements of a human to be able to track the entire process. However, these deep learning based hand action recognition methods are very label intensive, which cannot be offered by all industrial companies due to the associated costs. This work therefore presents a self-supervised learning approach for industrial assembly processes that allows a spatio-temporal transformer architecture to be pre-trained on a variety of information from real-world video footage of daily life. Subsequently, this deep learning model is adapted to the industrial assembly task at hand using only a few labels. It is shown which known real-world datasets are best suited for representation learning of these hand actions in a regression task, and to what extent they optimize the subsequent supervised trained classification task.
引用
收藏
页码:172 / 184
页数:13
相关论文
共 50 条
[31]   Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory [J].
Li, Rui ;
Xie, Zhiwei ;
Xu, Haihua ;
Peng, Yizhou ;
Liu, Hexin ;
Huang, Hao ;
Chng, Eng Siong .
INTERSPEECH 2023, 2023, :1968-1972
[32]   Applying Self-Supervised Representation Learning for Emotion Recognition Using Physiological Signals [J].
Quispe, Kevin G. Montero G. ;
Utyiama, Daniel M. S. ;
dos Santos, Eulanda M. M. ;
Oliveira, Horacio A. B. F. ;
Souto, Eduardo J. P. .
SENSORS, 2022, 22 (23)
[33]   PuzText: Self-Supervised Learning of Permuted Texture Representation for Multilingual Text Recognition [J].
Lu, Minjun ;
Xu, Shugong ;
Zhang, Xuefan .
IEEE ACCESS, 2024, 12 :182883-182893
[34]   Self-Supervised Speech Representation Learning: A Review [J].
Mohamed, Abdelrahman ;
Lee, Hung-yi ;
Borgholt, Lasse ;
Havtorn, Jakob D. ;
Edin, Joakim ;
Igel, Christian ;
Kirchhoff, Katrin ;
Li, Shang-Wen ;
Livescu, Karen ;
Maaloe, Lars ;
Sainath, Tara N. ;
Watanabe, Shinji .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) :1179-1210
[35]   Representation Distillation for Efficient Self-Supervised Learning [J].
Liu, Xin ;
Li, Yali ;
Wang, Shengjin .
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
[36]   Adaptive Self-Supervised Graph Representation Learning [J].
Gong, Yunchi .
36TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2022), 2022, :254-259
[37]   Self-supervised representation learning for trip recommendation [J].
Gao, Qiang ;
Wang, Wei ;
Zhang, Kunpeng ;
Yang, Xin ;
Miao, Congcong ;
Li, Tianrui .
KNOWLEDGE-BASED SYSTEMS, 2022, 247
[38]   Self-Supervised Dense Visual Representation Learning [J].
Ozcelik, Timoteos Onur ;
Gokberk, Berk ;
Akarun, Lale .
32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
[39]   Adversarial Self-supervised Learning for Semi-supervised 3D Action Recognition [J].
Si, Chenyang ;
Nie, Xuecheng ;
Wang, Wei ;
Wang, Liang ;
Tan, Tieniu ;
Feng, Jiashi .
COMPUTER VISION - ECCV 2020, PT VII, 2020, 12352 :35-51
[40]   Self-Supervised GlobalLocal Contrastive Learning for Fine-Grained Change Detection in VHR Images [J].
Jiang, Fenlong ;
Gong, Maoguo ;
Zheng, Hanhong ;
Liu, Tongfei ;
Zhang, Mingyang ;
Liu, Jialu .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61