Image Collection Pop-up: 3D Reconstruction and Clustering of Rigid and Non-Rigid Categories

被引:18
作者
Agudo, Antonio [1 ]
Pijoan, Melcior [1 ]
Moreno-Noguer, Francesc [1 ]
机构
[1] UPC, CSIC, Inst Robot & Informat Ind, Barcelona 08028, Spain
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
SHAPE; MOTION;
D O I
10.1109/CVPR.2018.00276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces an approach to simultaneously estimate 3D shape, camera pose, and object and type of deformation clustering, from partial 2D annotations in a multi-instance collection of images. Furthermore, we can indistinctly process rigid and non-rigid categories. This advances existing work, which only addresses the problem for one single object or, if multiple objects are considered, they are assumed to be clustered a priori. To handle this broader version of the problem, we model object deformation using a formulation based on multiple unions of subspaces, able to span from small rigid motion to complex deformations. The parameters of this model are learned via Augmented Lagrange Multipliers, in a completely unsupervised manner that does not require any training data at all. Extensive validation is provided in a wide variety of synthetic and real scenarios, including rigid and non-rigid categories with small and large deformations. In all cases our approach outperforms state-of-the-art in terms of 3D reconstruction accuracy, while also providing clustering results that allow segmenting the images into object instances and their associated type of deformation (or action the object is performing).
引用
收藏
页码:2607 / 2615
页数:9
相关论文
共 41 条
  • [1] Building Rome in a Day
    Agarwal, Sameer
    Snavely, Noah
    Simon, Ian
    Seitz, Steven M.
    Szeliski, Richard
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 72 - 79
  • [2] Agudo A., 2016, ACCV
  • [3] Agudo A., 2017, CVPR
  • [4] Combining Local-Physical and Global-Statistical Models for Sequential Deformable Shape from Motion
    Agudo, Antonio
    Moreno-Noguer, Francesc
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 122 (02) : 371 - 387
  • [5] Modal Space: A Physics-Based Model for Sequential Estimation of Time-Varying Shape from Monocular Video
    Agudo, Antonio
    Martinez Montiel, Jose M.
    Agapito, Lourdes
    Calvo, Begona
    [J]. JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2017, 57 (01) : 75 - 98
  • [6] Agudo Antonio, 2015, ICCV, P2
  • [7] Akhter Ijaz, 2008, ADV NEURAL INFORM PR, V21
  • [8] [Anonymous], 3DV
  • [9] [Anonymous], 2013, ICCV
  • [10] Bartoli Adrien., 2008, CVPR