Towards Unconstrained Joint Hand-Object Reconstruction From RGB Videos

被引:29
作者
Hasson, Yana [1 ,2 ]
Varol, Gul [3 ]
Schmid, Cordelia [1 ,2 ]
Laptev, Ivan [1 ,2 ]
机构
[1] INRIA, Le Chesnay, France
[2] PSL Res Univ, CNRS, ENS, Dept Informat, Paris, France
[3] Univ Gustave Eiffel, CNRS, Ecole Ponts, LIGM, Champs Sur Marne, France
来源
2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021) | 2021年
关键词
D O I
10.1109/3DV53792.2021.00075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our work aims to obtain 3D reconstruction of hands and manipulated objects from monocular videos. Reconstructing hand-object manipulations holds a great potential for robotics and learning from human demonstrations. The supervised learning approach to this problem, however, requires 3D supervision and remains limited to constrained laboratory settings and simulators for which 3D ground truth is available. In this paper we first propose a learning-free fitting approach for hand-object reconstruction which can seamlessly handle two-hand object interactions. Our method relies on cues obtained with common methods for object detection, hand pose estimation and instance segmentation. We quantitatively evaluate our approach and show that it can be applied to datasets with varying levels of difficulty for which training data is unavailable.
引用
收藏
页码:659 / 668
页数:10
相关论文
共 66 条
[11]  
Chao Yu-Wei, 2021, CVPR
[12]   GanHand: Predicting Human Grasp Affordances in Multi-Object Scenes [J].
Corona, Enric ;
Pumarola, Albert ;
Alenya, Guillem ;
Moreno-Noguer, Francesc ;
Rogez, Gregory .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5030-5040
[13]   Scaling Egocentric Vision: The EPIC-KITCHENS Dataset [J].
Damen, Dima ;
Doughty, Hazel ;
Farinella, Giovanni Maria ;
Fidler, Sanja ;
Furnari, Antonino ;
Kazakos, Evangelos ;
Moltisanti, Davide ;
Munro, Jonathan ;
Perrett, Toby ;
Price, Will ;
Wray, Michael .
COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 :753-771
[14]   HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation [J].
Doosti, Bardia ;
Naha, Shujon ;
Mirbagheri, Majid ;
Crandall, David J. .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6607-6616
[15]   First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations [J].
Garcia-Hernando, Guillermo ;
Yuan, Shanxin ;
Baek, Seungryul ;
Kim, Tae-Kyun .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :409-419
[16]   A Papier-Mache Approach to Learning 3D Surface Generation [J].
Groueix, Thibault ;
Fisher, Matthew ;
Kim, Vladimir G. ;
Russell, Bryan C. ;
Aubry, Mathieu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :216-224
[17]   HOnnotate: A method for 3D Annotation of Hand and Object Poses [J].
Hampali, Shreyas ;
Rad, Mahdi ;
Oberweger, Markus ;
Lepetit, Vincent .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3193-3203
[18]   Resolving 3D Human Pose Ambiguities with 3D Scene Constraints [J].
Hassan, Mohamed ;
Choutas, Vasileios ;
Tzionas, Dimitrios ;
Black, Michael J. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2282-2292
[19]   Learning joint reconstruction of hands and manipulated objects [J].
Hasson, Yana ;
Varol, Gul ;
Tzionas, Dimitrios ;
Kalevatykh, Igor ;
Black, Michael J. ;
Laptev, Ivan ;
Schmid, Cordelia .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11799-11808
[20]  
Hasson Yana, 2020, CVPR