Towards Unconstrained Joint Hand-Object Reconstruction From RGB Videos

被引:29
作者
Hasson, Yana [1 ,2 ]
Varol, Gul [3 ]
Schmid, Cordelia [1 ,2 ]
Laptev, Ivan [1 ,2 ]
机构
[1] INRIA, Le Chesnay, France
[2] PSL Res Univ, CNRS, ENS, Dept Informat, Paris, France
[3] Univ Gustave Eiffel, CNRS, Ecole Ponts, LIGM, Champs Sur Marne, France
来源
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021) | 2021年
关键词
D O I
10.1109/3DV53792.2021.00075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our work aims to obtain 3D reconstruction of hands and manipulated objects from monocular videos. Reconstructing hand-object manipulations holds a great potential for robotics and learning from human demonstrations. The supervised learning approach to this problem, however, requires 3D supervision and remains limited to constrained laboratory settings and simulators for which 3D ground truth is available. In this paper we first propose a learning-free fitting approach for hand-object reconstruction which can seamlessly handle two-hand object interactions. Our method relies on cues obtained with common methods for object detection, hand pose estimation and instance segmentation. We quantitatively evaluate our approach and show that it can be applied to datasets with varying levels of difficulty for which training data is unavailable.
引用
收藏
页码:659 / 668
页数:10
相关论文
共 66 条
[1]   Exploiting temporal context for 3D human pose estimation in the wild [J].
Arnab, Anurag ;
Doersch, Carl ;
Zisserman, Andrew .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3390-3399
[2]  
Ballan L, 2012, LECT NOTES COMPUT SC, V7577, P640, DOI 10.1007/978-3-642-33783-3_46
[3]   Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].
Bogo, Federica ;
Kanazawa, Angjoo ;
Lassner, Christoph ;
Gehler, Peter ;
Romero, Javier ;
Black, Michael J. .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578
[4]   3D Hand Shape and Pose from Images in the Wild [J].
Boukhayma, Adnane ;
de Bem, Rodrigo ;
Torr, Philip H. S. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10835-10844
[5]   ContactPose: A Dataset of Grasps with Object Contact and Hand Pose [J].
Brahmbhatt, Samarth ;
Tang, Chengcheng ;
Twigg, Christopher D. ;
Kemp, Charles C. ;
Hays, James .
COMPUTER VISION - ECCV 2020, PT XIII, 2020, 12358 :361-378
[6]   ContactDB: Analyzing and Predicting Grasp Contact via Thermal Imaging [J].
Brahmbhatt, Samarth ;
Ham, Cusuh ;
Kemp, Charles C. ;
Hays, James .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8701-8711
[7]   Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks [J].
Cai, Yujun ;
Ge, Liuhao ;
Liu, Jun ;
Cai, Jianfei ;
Cham, Tat-Jen ;
Yuan, Junsong ;
Thalmann, Nadia Magnenat .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2272-2281
[8]  
Calli B, 2015, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), P510, DOI 10.1109/ICAR.2015.7251504
[9]   Reconstructing Hand-Object Interactions in the Wild [J].
Cao, Zhe ;
Radosavovic, Ilija ;
Kanazawa, Angjoo ;
Malik, Jitendra .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12397-12406
[10]  
Chang AX., 2015, ShapeNet: an InformationRich 3D Model Repository, V1512, P03012