Multibranch Adversarial Regression for Domain Adaptative Hand Pose Estimation

被引:6
作者
Jin, Rui [1 ]
Zhang, Jing [2 ]
Yang, Jianyu [1 ]
Tao, Dacheng [3 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215131, Peoples R China
[2] Univ Sydney, Fac Engn, Sch Comp Sci, Darlington, NSW 2008, Australia
[3] JD Explore Acad, Beijing 100176, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Pose estimation; Adaptation models; Task analysis; Feature extraction; Training data; Data models; Hand pose estimation; unsupervised domain adaptation; adversarial training; mean teacher;
D O I
10.1109/TCSVT.2022.3158676
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although hand pose estimation has achieved a great success in recent years, there are still challenges with RGB-based estimation tasks, the most significant of which is the absence of labeled training data. At present, the synthetic dataset has plenty of images with accurate annotation, but the difference from real-world datasets affects generalization. Therefore, a transfer learning strategy, which tries to transfer knowledge from a labeled source domain to an unlabeled target domain, is a frequent solution. Existing methods such as mean-teacher, Cyclegan, and MCD will train models with the help of some easily accessible domains such as synthetic data. However, these methods are not guaranteed to operate well in real-world settings due to the domain shift. In this paper, we design a new unsupervised domain adaptation method named Multi-branch Adversarial Regressors (MarsDA) in hand pose estimation, where it could be better for feature migration. Specifically, we first generate pseudo-labels for unlabeled target domain data. Then, the new adversarial training loss between multiple regression branches we designed for hand pose estimation is introduced to narrow the domain gap. In this way, our model can reduce the noise of pseudo labels caused by the domain gap and improve the accuracy of pseudo labels. We evaluate our method on two publicly available real-world datasets, H3D and STB. Experimental results show that our method outperforms existing methods by a large margin.
引用
收藏
页码:6125 / 6136
页数:12
相关论文
共 50 条
[31]   Hand pose estimation with CNN-RNN [J].
Hu, Zhongxu ;
Hu, Youmin ;
Wu, Bo ;
Liu, Jie .
2017 EUROPEAN CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (EECS), 2017, :458-463
[32]   Multi-Modal Hand-Object Pose Estimation With Adaptive Fusion and Interaction Learning [J].
Hoang, Dinh-Cuong ;
Tan, Phan Xuan ;
Nguyen, Anh-Nhat ;
Vu, Duy-Quang ;
Vu, Van-Duc ;
Nguyen, Thu-Uyen ;
Hoang, Ngoc-Anh ;
Phan, Khanh-Toan ;
Tran, Duc-Thanh ;
Nguyen, Van-Thiep ;
Duong, Quang-Tri ;
Ho, Ngoc-Trung ;
Tran, Cong-Trinh ;
Duong, Van-Hiep ;
Ngo, Phuc-Quan .
IEEE ACCESS, 2024, 12 :54339-54351
[33]   A Dexterous Hand-Arm Teleoperation System Based on Hand Pose Estimation and Active Vision [J].
Li, Shuang ;
Hendrich, Norman ;
Liang, Hongzhuo ;
Ruppel, Philipp ;
Zhang, Changshui ;
Zhang, Jianwei .
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) :1417-1428
[34]   Regression-Based Three-Dimensional Pose Estimation for Texture-Less Objects [J].
Liu, Yuanpeng ;
Zhou, Laishui ;
Zong, Hua ;
Gong, Xiaoxi ;
Wu, Qiaoyun ;
Liang, Qingxiao ;
Wang, Jun .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (11) :2776-2789
[35]   Multiscale feature fusion network for monocular complex hand pose estimation [J].
Zhan, Zhi ;
Luo, Guang .
ELECTRONICS LETTERS, 2023, 59 (24)
[36]   Domain adversarial adaptation framework for few-shot QoT estimation in optical networks [J].
Cai, Zhuojun ;
Wang, Qihang ;
Deng, Yubin ;
Zhang, Peng ;
Zhou, Gai ;
Li, Yang ;
Khan, Faisal Nadeem .
JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2024, 16 (11) :1133-1144
[37]   Stereo Feature Learning Based on Attention and Geometry for Absolute Hand Pose Estimation in Egocentric Stereo Views [J].
Seo, Kyeongeun ;
Cho, Hyeonjoong ;
Choi, Daewoong ;
Heo, Taewook .
IEEE ACCESS, 2021, 9 :116083-116093
[38]   Pose guided structured region ensemble network for cascaded hand pose estimation [J].
Chen, Xinghao ;
Wang, Guijin ;
Guo, Hengkai ;
Zhang, Cairong .
NEUROCOMPUTING, 2020, 395 (395) :138-149
[39]   Skeleton-aware Multi-scale Heatmap Regression for 2D Hand Pose Estimation [J].
Kourbane, Ikram ;
Genc, Yakup .
INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (04) :593-604
[40]   A Shared Pose Regression Network for Pose Estimation of Objects from RGB Images [J].
Bengtson, Stefan Hein ;
Astrom, Hampus ;
Moeslund, Thomas B. ;
Topp, Elin A. ;
Krueger, Volker .
2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, :91-97