Generic 3D Representation via Pose Estimation and Matching

被引:32
|
作者
Zamir, Amir R. [1 ]
Wekel, Tilman [1 ]
Agrawal, Pulkit [2 ]
Wei, Colin [1 ]
Malik, Jitendra [2 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
COMPUTER VISION - ECCV 2016, PT III | 2016年 / 9907卷
关键词
Generic vision; Representation; Descriptor learning; Pose estimation; Wide-baseline matching; Street view;
D O I
10.1007/978-3-319-46487-9_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Though a large body of computer vision research has investigated developing generic semantic representations, efforts towards developing a similar representation for 3D has been limited. In this paper, we learn a generic 3D representation through solving a set of foundational proxy 3D tasks: object-centric camera pose estimation and wide baseline feature matching. Our method is based upon the premise that by providing supervision over a set of carefully selected foundational tasks, generalization to novel tasks and abstraction capabilities can be achieved. We empirically show that the internal representation of a multi-task ConvNet trained to solve the above core problems generalizes to novel 3D tasks (e.g., scene layout estimation, object pose estimation, surface normal estimation) without the need for fine-tuning and shows traits of abstraction abilities (e.g., cross modality pose estimation). In the context of the core supervised tasks, we demonstrate our representation achieves state-of-the-art wide baseline feature matching results without requiring apriori rectification (unlike SIFT and the majority of learnt features). We also show 6DOF camera pose estimation given a pair local image patches. The accuracy of both supervised tasks come comparable to humans. Finally, we contribute a large-scale dataset composed of object-centric street view scenes along with point correspondences and camera pose information, and conclude with a discussion on the learned representation and open research questions.
引用
收藏
页码:535 / 553
页数:19
相关论文
共 50 条
  • [1] Invariant representation, matching and pose estimation of 3D space curves under similarity transformations
    Li, SZ
    PATTERN RECOGNITION, 1997, 30 (03) : 447 - 458
  • [2] Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation
    Xu, Yuanlu
    Wang, Wenguan
    Liu, Tengyu
    Liu, Xiaobai
    Xie, Jianwen
    Zhu, Song-Chun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6327 - 6344
  • [3] Improving 3D Human Pose Estimation via 3D Part Affinity Fields
    Liu, Ding
    Zhao, Zixu
    Wang, Xinchao
    Hu, Yuxiao
    Zhang, Lei
    Huang, Thomas S.
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1004 - 1013
  • [4] 3D Ego-Pose Estimation via Imitation Learning
    Yuan, Ye
    Kitani, Kris
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 763 - 778
  • [5] New algorithms for 2D and 3D point matching: Pose estimation and correspondence
    Gold, S
    Rangarajan, A
    Lu, CP
    Pappu, S
    Mjolsness, E
    PATTERN RECOGNITION, 1998, 31 (08) : 1019 - 1031
  • [6] 3D ASSISTED FACE RECOGNITION VIA PROGRESSIVE POSE ESTIMATION
    Zhang, Wuming
    Huang, Di
    Samaras, Dimitris
    Morvan, Jean-Marie
    Wang, Yunhong
    Chen, Liming
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 728 - 732
  • [7] 3D Hand Pose Estimation via Graph-Based Reasoning
    Song, Jae-Hun
    Kang, Suk-Ju
    IEEE ACCESS, 2021, 9 : 35824 - 35833
  • [8] Overview of 3D Human Pose Estimation
    Lin, Jianchu
    Li, Shuang
    Qin, Hong
    Wang, Hongchang
    Cui, Ning
    Jiang, Qian
    Jian, Haifang
    Wang, Gongming
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 134 (03): : 1621 - 1651
  • [9] Local shape feature fusion for improved matching, pose estimation and 3D object recognition
    Buch, Anders G.
    Petersen, Henrik G.
    Kruger, Norbert
    SPRINGERPLUS, 2016, 5
  • [10] 3D object recognition: Representation and matching
    Jain, AK
    Dorai, C
    STATISTICS AND COMPUTING, 2000, 10 (02) : 167 - 182