PlaneFormers: From Sparse View Planes to 3D Reconstruction

被引：14

作者：

Agarwala, Samir ^{[1
]}

Jin, Linyi ^{[1
]}

Rockwell, Chris ^{[1
]}

Fouhey, David F. ^{[1
]}

机构：

[1] Univ Michigan, Ann Arbor, MI USA

来源：

COMPUTER VISION - ECCV 2022, PT III | 2022年 / 13663卷

关键词：

D O I：

10.1007/978-3-031-20062-5_12

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success. Code is available at https://github.com/samiragarwala/PlaneFormers.

引用

页码：192 / 209

页数：18

共 66 条

[1]

Agarwal S, 2010, LECT NOTES COMPUT SC, V6312, P29, DOI 10.1007/978-3-642-15552-9_3

[2]

[Anonymous], 2004, An invitation to 3-d vision: from images to geometric models

[3]

[Anonymous], 2004, International Journal of Computer Vision, DOI DOI 10.1023/B:VISI.0000029664.99615.94

[4]

[Anonymous], 1999, INT WORKSHOP VISION

[5]

Bewley A., 2021, CORL

[6]

Bloem P., 2019, US

[7]

Bozic A., 2021, NEURIPS, V34

[8] Extreme Rotation Estimation using Dense Correlation Volumes [J].

Cai, Ruojin ;

Hariharan, Bharath ;

Snavely, Noah ;

Averbuch-Elor, Hadar .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14561-14570

[9] MessyTable: Instance Association in Multiple Camera Views [J].

Cai, Zhongang ;

Zhang, Junzhe ;

Ren, Daxuan ;

Yu, Cunjun ;

Zhao, Haiyu ;

Yi, Shuai ;

Yeo, Chai Kiat ;

Loy, Chen Change .

COMPUTER VISION - ECCV 2020, PT XI, 2020, 12356 :1-16

[10] Matterport3D: Learning from RGB-D Data in Indoor Environments [J].

Chang, Angel ;

Dai, Angela ;

Funkhouser, Thomas ;

Halber, Maciej ;

Niessner, Matthias ;

Savva, Manolis ;

Song, Shuran ;

Zeng, Andy ;

Zhang, Yinda .

PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, :667-676

← 1 2 3 4 5 6 7 →