PlaneFormers: From Sparse View Planes to 3D Reconstruction

被引:9
作者
Agarwala, Samir [1 ]
Jin, Linyi [1 ]
Rockwell, Chris [1 ]
Fouhey, David F. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI USA
来源
COMPUTER VISION - ECCV 2022, PT III | 2022年 / 13663卷
关键词
D O I
10.1007/978-3-031-20062-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success. Code is available at https://github.com/samiragarwala/PlaneFormers.
引用
收藏
页码:192 / 209
页数:18
相关论文
共 66 条
  • [1] Agarwal S, 2010, LECT NOTES COMPUT SC, V6312, P29, DOI 10.1007/978-3-642-15552-9_3
  • [2] [Anonymous], 2004, International journal of computer vision, V60, P91, DOI [DOI 10.1023/B:VISI.0000029664.99615.94, 10.1023/B:VISI.0000029664.99615.94]
  • [3] Bloem P., 2019, US
  • [4] Bozic A., 2021, NEURIPS, V34
  • [5] Extreme Rotation Estimation using Dense Correlation Volumes
    Cai, Ruojin
    Hariharan, Bharath
    Snavely, Noah
    Averbuch-Elor, Hadar
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14561 - 14570
  • [6] Matterport3D: Learning from RGB-D Data in Indoor Environments
    Chang, Angel
    Dai, Angela
    Funkhouser, Thomas
    Halber, Maciej
    Niessner, Matthias
    Savva, Manolis
    Song, Shuran
    Zeng, Andy
    Zhang, Yinda
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 667 - 676
  • [7] MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo
    Chen, Anpei
    Xu, Zexiang
    Zhao, Fuqiang
    Zhang, Xiaoshuai
    Xiang, Fanbo
    Yu, Jingyi
    Su, Hao
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14104 - 14113
  • [8] Wide-Baseline Relative Camera Pose Estimation with Directional Learning
    Chen, Kefan
    Snavely, Noah
    Makadia, Ameesh
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3257 - 3267
  • [9] OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
    Chen, Weifeng
    Qian, Shengyi
    Fan, David
    Kojima, Noriyuki
    Hamilton, Max
    Deng, Jia
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 676 - 685
  • [10] Deep Global Registration
    Choy, Christopher
    Dong, Wei
    Koltun, Vladlen
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2511 - 2520