HEAT: Holistic Edge Attention Transformer for Structured Reconstruction

被引：16

作者：

Chen, Jiacheng ^{[1
]}

Qian, Yiming ^{[2
]}

Furukawa, Yasutaka ^{[1
]}

机构：

[1] Simon Fraser Univ, Burnaby, BC, Canada

[2] Univ Manitoba, Winnipeg, MB, Canada

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1109/CVPR52688.2022.00384

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a novel attention-based neural network for structured reconstruction, which takes a 2D raster image as an input and reconstructs a planar graph depicting an underlying geometric structure. The approach detects corners and classifies edge candidates between corners in an end-to-end manner. Our contribution is a holistic edge classification architecture, which 1) initializes the feature of an edge candidate by a trigonometric positional encoding of its end-points; 2) fides image feature to each edge candidate by deformable attention; 3) employs two weight-sharing Transformer decoders to learn holistic structural patterns over the graph edge candidates; and 4) is trained with a masked learning strategy. The corner detector is a variant of the edge classification architecture, adapted to operate on pixels as corner candidates. We conduct experiments on two structured reconstruction tasks: outdoor building architecture and indoor floorplan planar graph reconstruction. Extensive qualitative and quantitative evaluations demonstrate the superiority of our approach over the state of the art. Code and pre-trained models are available at https://heat-structured-reconstruction. github io/

引用

页码：3856 / 3865

页数：10

共 33 条

[1] Adan A., 2011, INT C 3D IMAGING MOD, P275, DOI [DOI 10.1109/3DIMPVT.2011.42, 10.1109/3DIMPVT.2011.42]
[2] Piecewise Planar and Compact Floorplan Reconstruction from Images
Cabral, Ricardo
Furukawa, Yasutaka
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 628 - 635
[3] Carion N., 2020, EUROPEAN C COMPUTER
[4] Floor-SP: Inverse CAD for Floorplans by Sequential Room-wise Shortest Path
Chen, Jiacheng
Liu, Chen
Wu, Jiaye
Furukawa, Yasutaka
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2661 - 2670
[5] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6] Furukawa Y, 2009, PROC CVPR IEEE, P1422, DOI 10.1109/CVPRW.2009.5206867
[7] Review: reconstruction of 3D building information models from 2D scanned plans
Gimenez, Lucile
Hippolyte, Jean-Laurent
Robert, Sylvain
Suard, Frederic
Zreik, Khaldoun
[J]. JOURNAL OF BUILDING ENGINEERING, 2015, 2 (24-35) : 24 - 35
[8] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[10] Recovering the Spatial Layout of Cluttered Rooms
Hedau, Varsha
Hoiem, Derek
Forsyth, David
[J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1849 - 1856

← 1 2 3 4 →