Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models

被引:248
作者
Aubry, Mathieu [1 ]
Maturana, Daniel [2 ]
Efros, Alexei A. [3 ]
Russell, Bryan C. [4 ]
Sivic, Josef [1 ]
机构
[1] INRIA, Paris, France
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Univ Calif Berkeley, Berkeley, CA USA
[4] Intel Labs, Cambridge, England
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
D O I
10.1109/CVPR.2014.487
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper poses object category detection in images as a type of 2D-to-3D alignment problem, utilizing the large quantities of 3D CAD models that have been made publicly available online. Using the "chair" class as a running example, we propose an exemplar-based 3D category representation, which can explicitly model chairs of different styles as well as the large variation in viewpoint. We develop an approach to establish part-based correspondences between 3D CAD models and real photographs. This is achieved by (i) representing each 3D model using a set of view-dependent mid-level visual elements learned from synthesized views in a discriminative fashion, (ii) carefully calibrating the individual element detectors on a common dataset of negative images, and (iii) matching visual elements to the test image allowing for small mutual deformations but preserving the viewpoint and style constraints. We demonstrate the ability of our system to align 3D models with 2D objects in the challenging PASCAL VOC images, which depict a wide variety of chairs in complex scenes.
引用
收藏
页码:3762 / 3769
页数:8
相关论文
共 38 条
  • [1] [Anonymous], 2013, CVPR
  • [2] [Anonymous], ICCV
  • [3] [Anonymous], ICCV
  • [4] [Anonymous], ICCV
  • [5] [Anonymous], ACM T GRAPHICS
  • [6] [Anonymous], 2010, IEEE PAMI
  • [7] [Anonymous], IEEE PAMI
  • [8] Arandjelovic Relja, 2011, ICCV
  • [9] Baatz G., 2012, ECCV
  • [10] Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations
    Bourdev, Lubomir
    Malik, Jitendra
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1365 - 1372